Preseason
MatchesRankingsPrompts
GitHub
Preseason
MatchesRankingsPromptsMethodologyContact

© 2026 Preseason. All rights reserved.

PrivacyTerms
@betocmn
AI / LLM Integration
Methodology

Anthropic Claude API vs Anthropic

ANAnthropic Claude APIvsAnthropicANAnthropic
Anthropic Claude APIAnthropic
35%
65%

Leading: Anthropic (64.9%)

Statistics

MetricValue
Anthropic Claude API wins52
Anthropic wins96
Abstains (no tool)19
Other tool chosen2434
Decisive cases148
Anthropic Claude API win rate (unweighted)35.1%
95% CI27.9% - 43.1%
Anthropic Claude API win rate (weighted)35.1%

Comments

Anthropic Claude API

No comments yet

Verified critics can leave comments here.

Anthropic

No comments yet

Verified critics can leave comments here.

Per-model breakdown

ModelTierAnthropic Claude APIAnthropicNoneOtherA rate
Claude Haiku 4.5Small366004438%
MiniMax M2.7Frontier161808647%
Mistral Small 4Mid0102970%
Kimi K2.7 CodeFrontier02070%
DeepSeek V4 FlashMid01080%
Gemini 3.5 FlashSmall01070%
GLM 5 TurboFrontier0121280%
GPT 5.5Frontier01060%
Kimi K2.5Frontier0111160%
MiniMax M3Frontier01080%
Claude Opus 4.6Frontier000132n/a
Claude Opus 4.8Frontier0008n/a
Claude Sonnet 4.6Frontier000141n/a
DeepSeek R1 0528Frontier002136n/a
DeepSeek V3.2Mid000131n/a
DeepSeek V4 ProFrontier0018n/a
Devstral 2 2512Mid002135n/a
Gemini 2.5 FlashSmall000132n/a
Gemini 2.5 ProFrontier002139n/a
GLM 5.2Frontier0009n/a
GPT 5.3 CodexFrontier000141n/a
GPT 5.4Frontier000132n/a
GPT 5.4 MiniMid001140n/a
Llama 4 MaverickFrontier003133n/a
Llama 4 ScoutSmall002134n/a
MiMo V2 ProFrontier000130n/a
MiMo V2.5 ProFrontier0009n/a
Qwen3 Coder NextMid001137n/a

Per-prompt breakdown

PromptTierAnthropic Claude APIAnthropicNoneOtherA rate
ai-support-agent-platformAdvanced289139376%
ai-support-agent-platformIntermediate424040614%
ai-revenue-ops-copilotIntermediate22514047%
ai-revenue-ops-copilotAdvanced164339980%
ai-revenue-ops-copilotBeginner11803885%
ai-support-agent-platformBeginner11014069%
ai-agent-applicationIntermediate025100%
ai-agent-applicationAdvanced023150%
ai-agent-applicationBeginner025130%