AI Infrastructure Economics Series · Tool 3

Enterprise Token Economics Calculator

Model LLM consumption at organizational scale — with heavy-tail user distribution, workload-profile decomposition, agentic amplification, and governance levers. Built for CIOs, CFOs, and infrastructure operators who need to estimate order-of-magnitude token spend before the invoice arrives.

Organization Preset ready

Three reference configurations. Select one as a starting point, then edit any value.

Organization Shape

Total eligible headcount

Adoption at month 12 (%)

Power-user share (%)

Regular-user share (%)

Workload Profiles

Seven canonical work patterns. Defaults are computed server-side; edit any field to override for your environment.

Tier Composition

What fraction of each tier's time goes to each workload profile. Rows should sum to 100%.

Model Pricing

Per-million-token rates across three model tiers. Defaults reflect Anthropic Q2 2026 list prices.

Frontier Opus

Input $/M Output $/M Cache $/M

Mid Sonnet

Input $/M Output $/M Cache $/M

Cheap Haiku

Input $/M Output $/M Cache $/M

Enterprise discount (%)

Seat-license $/user/mo

Seat coverage (% of active)

Governance Levers

Per-user monthly token cap0 = no cap.

Opus → Sonnet routing (%)Share of frontier traffic routed to mid-tier.

Agent session timeout (hours)Caps amplification for agentic profiles. 0 = no cap.

Agentic intensity multiplier1.0 = baseline, 2.0 = double agentic consumption.

Attribution requirementRequired to enforce all other levers.

Headline KPIs

Annual Cost

—

Monthly Cost

—

steady-state

$ / Employee / Mo

—

blended

$ / Power User / Mo

—

top tier

Top-tier Concentration

—

of total spend

Decomposition & Sensitivity

Pareto: User consumption distribution

Users ranked by consumption vs cumulative % of spend.

Cost by workload profile

Where the token budget actually goes.

Cost composition: input / output / cache

Output tokens dominate at scale.

Sensitivity tornado

$ annual delta from each governance lever.

Export

Download configuration and computed results.