Preseason
MatchesRankingsPrompts
GitHub
Preseason
MatchesRankingsPromptsMethodologyContact

© 2026 Preseason. All rights reserved.

PrivacyTerms
@betocmn
AI / LLM Integration
Methodology

Zendesk vs Llama

ZEZendeskvsLlamaLLLlama
ZendeskLlama
50%
50%
Insufficient data
This matchup has 20 decisive cases (minimum 30 required for publication).

Statistics

MetricValue
Zendesk wins10
Llama wins10
Abstains (no tool)19
Other tool chosen2562
Decisive cases20
Zendesk win rate (unweighted)50.0%
95% CI29.9% - 70.1%
Zendesk win rate (weighted)50.0%

Comments

Zendesk

No comments yet

Verified critics can leave comments here.

Llama

No comments yet

Verified critics can leave comments here.

Per-model breakdown

ModelTierZendeskLlamaNoneOtherA rate
Llama 4 ScoutSmall01021240%
Devstral 2 2512Mid902126100%
Mistral Small 4Mid102106100%
Claude Haiku 4.5Small000140n/a
Claude Opus 4.6Frontier000132n/a
Claude Opus 4.8Frontier0008n/a
Claude Sonnet 4.6Frontier000141n/a
DeepSeek R1 0528Frontier002136n/a
DeepSeek V3.2Mid000131n/a
DeepSeek V4 FlashMid0009n/a
DeepSeek V4 ProFrontier0018n/a
Gemini 2.5 FlashSmall000132n/a
Gemini 2.5 ProFrontier002139n/a
Gemini 3.5 FlashSmall0008n/a
GLM 5 TurboFrontier002129n/a
GLM 5.2Frontier0009n/a
GPT 5.3 CodexFrontier000141n/a
GPT 5.4Frontier000132n/a
GPT 5.4 MiniMid001140n/a
GPT 5.5Frontier0007n/a
Kimi K2.5Frontier001117n/a
Kimi K2.7 CodeFrontier0009n/a
Llama 4 MaverickFrontier003133n/a
MiMo V2 ProFrontier000130n/a
MiMo V2.5 ProFrontier0009n/a
MiniMax M2.7Frontier000120n/a
MiniMax M3Frontier0009n/a
Qwen3 Coder NextMid001137n/a

Per-prompt breakdown

PromptTierZendeskLlamaNoneOtherA rate
ai-support-agent-platformBeginner101140691%
ai-support-agent-platformIntermediate0604280%
ai-support-agent-platformAdvanced0214280%
ai-revenue-ops-copilotIntermediate0114300%
ai-agent-applicationIntermediate00512n/a
ai-agent-applicationAdvanced00317n/a
ai-agent-applicationBeginner00515n/a
ai-revenue-ops-copilotBeginner000407n/a
ai-revenue-ops-copilotAdvanced003419n/a