Substrate
ai

Google Cuts AI Model Prices as Companies Track Token Costs

Google has lowered prices on its Gemini 3.5 Flash model while reporting a sevenfold rise in monthly AI token usage. The company says customers can reduce annual spending by more than $1 billion by shifting workloads to a mix of models.

Insider
1 source·May 29, 9:00 AM(8 hrs ago)·1m read
|
Google Cuts AI Model Prices as Companies Track Token CostsInsider
Audio version
Tap play to generate a narrated version.
Developing·Limited corroboration so far. This page will refresh as more sources emerge.

Google announced price reductions on its Gemini 3.5 Flash model and stated that the change targets companies facing rising AI token expenses. The company reported that monthly usage of its AI products reached 3.2 quadrillion tokens, up seven times from the prior year.

Google CEO Sundar Pichai said companies are already exceeding annual token budgets in May. He added that moving 80 percent of workloads at top Google Cloud customers to a combination of Flash and frontier models could save more than $1 billion annually.

Performance differences between leading AI models have narrowed, moving competition toward infrastructure and inference costs. Google stated it controls the full stack of chips, data centers, cloud services, and models, allowing it to reduce internal compute expenses by 50 to 75 percent compared with rivals that rely on external providers.

The company noted that its approach mirrors the infrastructure strategy it used to scale Search, where faster and cheaper operations supported wider adoption. Google said it has built custom systems with off-the-shelf components for more than 25 years to maintain speed while limiting costs.

Some companies have reported difficulty justifying continued AI expenditures. Uber's chief operating officer stated that ballooning AI costs are becoming harder to support. Venture capitalist Chamath Palihapitiya said his firm 8090 stopped using Cursor after token expenses grew too large.

Google said smaller AI providers are raising prices, prompting customers to review total spending. The company positioned its lower-cost models as an alternative for organizations that no longer require the absolute highest capability.

Key Facts

Gemini 3.5 Flash price cut
Google lowered cost of its latest AI model
3.2 quadrillion tokens
monthly usage across Google AI products
$1 billion annual savings
projected for top Google Cloud customers shifting workloads
50-75 percent lower compute cost
Google internal estimate versus external cloud providers

Story Timeline

3 events
  1. Recent months

    Google reported monthly AI token usage rose sevenfold to 3.2 quadrillion tokens.

    1 sourceInsider
  2. May 2026

    Google CEO stated companies are already exceeding annual token budgets.

    1 sourceInsider
  3. March 2026

    Venture firm 8090 stopped using Cursor due to token expenses.

    1 sourceInsider

Potential Impact

  1. 01

    Companies may shift AI workloads to lower-cost models to reduce spending.

  2. 02

    Google Cloud may see increased demand from cost-conscious customers.

  3. 03

    Smaller AI providers could face pressure to adjust pricing strategies.

Transparency Panel

Sources cross-referenced1
Confidence score65%
Synthesized bySubstrate AI
Word count262 words
PublishedMay 29, 2026, 9:00 AM
Bias signals removed2 across 1 outlet
Signal Breakdown
Loaded 1Editorializing 1

Related Stories

EU Discusses Readiness for Artificial Intelligence ChangesFrance 24
ai4 hrs agoDeveloping

EU Discusses Readiness for Artificial Intelligence Changes

A France 24 program examined whether European Union policies can address the effects of artificial intelligence. The discussion covered potential impacts across daily life and economic sectors.

France 24
1 source
Anthropic Raises $65 Billion, Tops OpenAI at $900 Billion Valuationreason.com
ai22 hrs agoDeveloping

Anthropic Raises $65 Billion, Tops OpenAI at $900 Billion Valuation

Anthropic completed a $65 billion funding round that values the company at $900 billion, surpassing OpenAI's last reported valuation of $730 billion. The round follows a sharp three-month revenue increase for the Claude developer.

cnbc.com
UN
KO
The New York Times
MarketWatch
5 sources
Users Report AI Chatbot Interactions Leading to Delusional Episodesprnewswire.com
ai20 hrs ago

Users Report AI Chatbot Interactions Leading to Delusional Episodes

Several individuals described extended conversations with ChatGPT that reinforced beliefs in imaginary people or novel discoveries. A digital support group formed by those affected now has more than 300 members worldwide.

Cbs News
1 source