HomeEverything
Equities & FundsCrypto & Digital AssetsAI & TechnologyBusiness & CorporateUS Politics & PolicyGeopolitics & Global RiskMacro, Rates & FXCommodities & EnergyEuropean Politics & MarketsAsia-PacificReal Estate & Property
← All Stories

Claude Fable 5 Performance Unchanged, Safety Classifier Causes Routing Issues

Created at 3 Jul · 9:10 PM1 source↑ Market-relevant
IN SHORT

Claude Fable 5's performance has not degraded, despite user and benchmark concerns. A new safety classifier is rerouting many coding and debugging tasks to a different model, leading to perceived quality drops.

✉Newsletter

PiQ Daily

Pick your topics. Get only what matters, on your cadence.

Key Numbers

86.2BridgeBench debugging score for Fable 5 before July 1
25.9BridgeBench debugging score for Fable 5 after July 1
73.6BridgeBench refactoring score for Fable 5 before July 1
38.4BridgeBench refactoring score for Fable 5 after July 1
75.9BridgeBench hallucination resistance score for Fable 5 before July 1
61.7BridgeBench hallucination resistance score for Fable 5 after July 1
1650Arena.AI frontend code Elo score before July 1
1623Arena.AI frontend code Elo score after July 1

Who's Involved

Claude Fable 5
AI model whose performance is under scrutiny
Anthropic
Developer of Claude models and safety classifiers
BridgeBench AI
AI evaluation platform that reported quality degradation
Arena.AI
LLM benchmarking platform that found minimal performance changes
Claude Opus 4.8
Model used as a fallback by the safety classifier
Claude Fable 5 Performance Unchanged, Safety Classifier Causes Routing Issues

↳ Why This Matters

The distinction between a model's inherent capability and the impact of its safety filters is crucial for users, especially developers, who may be paying for a premium model but receiving responses from a less capable fallback. This highlights the ongoing challenge of balancing AI safety with performance and usability.

Key facts

  • Claude Fable 5's performance has not degraded since its July 1 reinstatement.
  • A new safety classifier is rerouting many coding and debugging tasks to Claude Opus 4.8.
  • BridgeBench AI's scores dropped significantly because rerouted tasks were scored as zero.
  • Arena.AI's blind human-preference votes showed Fable 5's performance remained largely consistent.
  • Developers working in security-adjacent areas are most affected by the classifier's over-aggressiveness.

Concerns that Anthropic's Claude Fable 5 model was significantly degraded after its July 1 reinstatement have been largely attributed to an overzealous safety classifier rather than a decline in the model's capabilities. While benchmarks like BridgeBench AI showed drastic score drops in coding and debugging tasks, these results were skewed because the new classifier rerouted many prompts to Claude Opus 4.8, with BridgeBench scoring these fallbacks as zero.

Conversely, Arena.AI's blind human-preference tests, which rely on perceived quality rather than infrastructure routing, indicated that Fable 5's performance remained largely stable, with some categories even showing slight improvements. Users engaged in creative writing, document analysis, and expert text queries are unlikely to notice a difference.

However, developers, particularly those working in security-adjacent fields involving terms like 'vulnerability' or 'exploit,' are frequently hitting the classifier's fallback mechanism. Anthropic has acknowledged that the new classifiers are prone to false positives and will be refined over time, but has not provided a timeline for these improvements. The aggressive classifier was implemented to address a reported jailbreak technique that allowed Fable 5 to identify and demonstrate software vulnerabilities, which was deemed a national security threat.

Frequently asked questions

No, performance tests suggest Fable 5's core capabilities have not degraded. The perceived drop in quality is due to a new safety classifier rerouting many tasks to a different model.

BridgeBench AI's methodology scored rerouted tasks as zero. The new safety classifier intercepted most coding and debugging prompts, preventing Fable 5 from answering them.

Developers working on security-related coding tasks, or those using keywords that might be flagged as sensitive, are most likely to encounter the classifier's rerouting.

Anthropic has acknowledged the problem of false positives and stated that the classifiers will be refined over time, though no specific timeline has been given.

What Happens Next

01Anthropic will refine its safety classifiers to reduce false positives.
02Anthropic will provide a timeline for classifier improvements.

Get the newsletter.

Pick the topics you actually care about. We'll email when there's news worth your time, on the cadence you choose. Cancel any time from your account.

Cadence

How It Developed

Claude Fable 5 was reinstated online July 1.
Users reported Fable 5 was nerfed and underperforming.
BridgeBench AI reported a severe quality degradation in Fable 5's coding tasks.
Arena.AI found Fable 5's performance mostly flat, with some categories improving.
BridgeBench's scores were impacted by a new safety classifier rerouting tasks.
Arena.AI's human preference votes showed minimal performance differences.
Anthropic acknowledged the new classifiers produce false positives and will be refined.

Sources

T1
Claude Fable 5 Isn't Nerfed. The Router Is Just ParanoidDecrypt

Related Stories

China's GLM 5.2 AI model rivals Anthropic's Claude and OpenAI's GPT 5.5
3 Jul · 11:45 AM
Alibaba bans Anthropic's Claude Code over security and spyware concerns
3 Jul · 6:52 AM
EU Data Rules Slow LLM Rollout, Study Finds
3 Jul · 5:10 AM
ByteDance discovers AI scaling law potentially aiding industry growth
3 Jul · 10:05 PM
Humanoid robot with silicone flesh tells jokes that feel surprisingly human
3 Jul · 9:20 AM