Document Type
Discussion Paper
Publication Date
3-2026
CFDP Number
2502
CFDP Pages
53
Journal of Economic Literature (JEL) Code(s)
D47, D82, D83
Abstract
We develop a framework for the optimal pricing and product design of LLMs in which a provider sells menus of token budgets to users who differ in their valuations across a continuum of tasks. Under a homogeneous production technology, we show that users’ high-dimensional type profiles are summarized by a scalar index, reducing the seller’s problem to one-dimensional screening. The optimal mechanism takes the form of committed-spend contracts: buyers pay for a budget that they allocate across token classes priced at marginal cost. We extend the analysis to environments with multiple differentiated models and to competition between a proprietary leader and an open-source fringe, showing that competitive pressure reshapes both the intensive and extensive margins of compute provision. Each element of our theory (token-budget menus, maximum- and minimum-spend plans, multi-model versioning, and linear API pricing) has a direct counterpart in the observed pricing practices of providers such as Anthropic, OpenAI, and GitHub.
Recommended Citation
Bergemann, Dirk, "Menu Pricing of Large Language Models" (2026). Cowles Foundation Discussion Papers. 2922.
https://elischolar.library.yale.edu/cowles-discussion-paper-series/2922