2 storiesAI & TechnologyLarge Language Models: GPT-4, Claude, Gemini, Llama, Mistral, Grok

Inception Labs' Mercury 2 AI Outperforms Google's DiffusionGemma on Key Benchmarks

21 Jun · 4:06 PMwindow 24h

IN SHORT

Inception Labs' Mercury 2 AI model has surpassed Google's DiffusionGemma on key benchmarks, particularly in mathematical and science reasoning. Mercury 2 achieved a higher score on the AIME 2026 mathematics test and a near-tie on the GPQA science benchmark, despite both models using similar generation techniques. Meanwhile, OpenRouter has introduced its Fusion API, which matches the performance of Anthropic's Fable 5 at half the cost. This comes as Fable 5 has been suspended for international users due to U.S. export controls.

Key Numbers

2026AIME mathematics test year

Who's Involved

Inception Labs

developer of the Mercury 2 AI model

Mercury 2

AI model demonstrating superior benchmark performance

Google

developer of the DiffusionGemma AI model

DiffusionGemma

AI model benchmarked against Mercury 2

OpenRouter

provider of the Fusion API

Fusion

API combining multiple AI models for cost-effective performance

Anthropic

developer of the Fable 5 AI model

Fable 5

AI model whose performance is matched by Fusion API

1 / 2

Key facts

Inception Labs' Mercury 2 AI model outperforms Google's DiffusionGemma on key benchmarks.
Mercury 2 demonstrated superior performance in mathematical and science reasoning.
Mercury 2 achieved a higher score on the AIME 2026 mathematics test.

Inception Labs' Mercury 2 AI Outperforms Google's DiffusionGemma on Key Benchmarks

Key Numbers

Who's Involved

Key facts

Inception Labs' Mercury 2 AI Outperforms Google's DiffusionGemma on Key Benchmarks

Key Numbers

Who's Involved

Key facts

↳ Why This Matters

Frequently asked questions

What Happens Next

Get the newsletter.

All stories in this theme

Inception Labs' Mercury 2 AI Outperforms Google's DiffusionGemma on Key Benchmarks

PiQ Daily

Key Numbers

Who's Involved

Key facts

Inception Labs' Mercury 2 AI Outperforms Google's DiffusionGemma on Key Benchmarks

PiQ Daily

Key Numbers

Who's Involved

Key facts

↳ Why This Matters

Frequently asked questions

+ What is the main advantage of Mercury 2?

+ How does Mercury 2 differ from Google's DiffusionGemma?

+ What is parallel generation in AI models?

What Happens Next

Get the newsletter.

All stories in this theme