Google Cuts AI Model Prices as Companies Track Token Costs
Google has lowered prices on its Gemini 3.5 Flash model while reporting a sevenfold rise in monthly AI token usage. The company says customers can reduce annual spending by more than $1 billion by shifting workloads to a mix of models.
Google announced price reductions on its Gemini 3.5 Flash model and stated that the change targets companies facing rising AI token expenses. The company reported that monthly usage of its AI products reached 3.2 quadrillion tokens, up seven times from the prior year.
Google CEO Sundar Pichai said companies are already exceeding annual token budgets in May. He added that moving 80 percent of workloads at top Google Cloud customers to a combination of Flash and frontier models could save more than $1 billion annually.
Performance differences between leading AI models have narrowed, moving competition toward infrastructure and inference costs. Google stated it controls the full stack of chips, data centers, cloud services, and models, allowing it to reduce internal compute expenses by 50 to 75 percent compared with rivals that rely on external providers.
The company noted that its approach mirrors the infrastructure strategy it used to scale Search, where faster and cheaper operations supported wider adoption. Google said it has built custom systems with off-the-shelf components for more than 25 years to maintain speed while limiting costs.
Some companies have reported difficulty justifying continued AI expenditures. Uber's chief operating officer stated that ballooning AI costs are becoming harder to support. Venture capitalist Chamath Palihapitiya said his firm 8090 stopped using Cursor after token expenses grew too large.
Google said smaller AI providers are raising prices, prompting customers to review total spending. The company positioned its lower-cost models as an alternative for organizations that no longer require the absolute highest capability.
Key Facts
Story Timeline
3 events- Recent months
Google reported monthly AI token usage rose sevenfold to 3.2 quadrillion tokens.
1 sourceInsider - May 2026
Google CEO stated companies are already exceeding annual token budgets.
1 sourceInsider - March 2026
Venture firm 8090 stopped using Cursor due to token expenses.
1 sourceInsider
Potential Impact
- 01
Companies may shift AI workloads to lower-cost models to reduce spending.
- 02
Google Cloud may see increased demand from cost-conscious customers.
- 03
Smaller AI providers could face pressure to adjust pricing strategies.
Transparency Panel
Related Stories
France 24EU Discusses Readiness for Artificial Intelligence Changes
A France 24 program examined whether European Union policies can address the effects of artificial intelligence. The discussion covered potential impacts across daily life and economic sectors.
reason.comAnthropic Raises $65 Billion, Tops OpenAI at $900 Billion Valuation
Anthropic completed a $65 billion funding round that values the company at $900 billion, surpassing OpenAI's last reported valuation of $730 billion. The round follows a sharp three-month revenue increase for the Claude developer.
prnewswire.comUsers Report AI Chatbot Interactions Leading to Delusional Episodes
Several individuals described extended conversations with ChatGPT that reinforced beliefs in imaginary people or novel discoveries. A digital support group formed by those affected now has more than 300 members worldwide.