Model LLM consumption at organizational scale — with heavy-tail user distribution, workload-profile decomposition, agentic amplification, and governance levers. Built for CIOs, CFOs, and infrastructure operators who need to estimate order-of-magnitude token spend before the invoice arrives.
Three reference configurations. Select one as a starting point, then edit any value.
Seven canonical work patterns. Defaults are computed server-side; edit any field to override for your environment.
What fraction of each tier's time goes to each workload profile. Rows should sum to 100%.
Per-million-token rates across three model tiers. Defaults reflect Anthropic Q2 2026 list prices.
Download configuration and computed results.