What is Token Pricing?

The per-million-token rate a provider charges for input and output. Expressed as dollars per million tokens. Claude Opus output is $75 per million tokens. Haiku is $5. The gap is fifteen times, which is what pricing tiers exist to navigate.

Token Pricing | The Wise Operator

Providers publish token prices in dollars per million tokens. The numbers look small, but they add up fast. Claude Opus 4.7 at $15 input and $75 output means a single agent turn with 20,000 in and 5,000 out runs about $0.68. At 1,000 turns a day, that is $680 per day per feature, or roughly $20,000 per month. Cheaper tiers exist for a reason.

The Simple Version

Each model family has tiers. Opus is the smartest and most expensive. Sonnet is balanced. Haiku is fast and cheap. The same prompt costs dramatically different amounts depending on which tier it routes to. The operator’s job is matching the task to the cheapest tier that can still do it well.

Why It Matters

Using Opus for tasks Haiku can handle is how startups burn their runway. Most production workloads should be Haiku or Sonnet. Opus should be reserved for work that genuinely needs the extra capability: complex reasoning chains, high-stakes decisions, long autonomous runs. Pricing tiers are a discipline, not a menu.

How It’s Used on This Site

TWO’s digest writer agent runs on Opus because editorial quality benefits from it. The research agent runs on Sonnet because volume matters more than perfection. The tokenizer-level text cleanup runs on Haiku. Three tiers, three costs, one pipeline. Match the task to the tier.

Token Pricing

The Simple Version

Why It Matters

How It’s Used on This Site

Related Terms

Scrolls That Teach This

The Economics of Tokens