Preseason
MatchesRankingsPrompts
GitHub
Preseason
MatchesRankingsPromptsMethodologyContact

© 2026 Preseason. All rights reserved.

PrivacyTerms
@betocmn
LLM Evals
Methodology

Humanloop vs Langfuse

HUHumanloopvsLangfuseLALangfuse
HumanloopLangfuse
48%
52%

Leading: Langfuse (51.5%)

Statistics

MetricValue
Humanloop wins47
Langfuse wins50
Abstains (no tool)105
Other tool chosen2473
Decisive cases97
Humanloop win rate (unweighted)48.5%
95% CI38.8% - 58.3%
Humanloop win rate (weighted)48.5%

Comments

Humanloop

No comments yet

Verified critics can leave comments here.

Langfuse

No comments yet

Verified critics can leave comments here.

Per-model breakdown

ModelTierHumanloopLangfuseNoneOtherA rate
Devstral 2 2512Mid1906110100%
Qwen3 Coder NextMid01731210%
Gemini 2.5 FlashSmall1401112100%
Claude Sonnet 4.6Frontier01311300%
Claude Haiku 4.5Small44112850%
GPT 5.4 MiniMid0731330%
DeepSeek V3.2Mid412210180%
DeepSeek R1 0528Frontier407133100%
Llama 4 ScoutSmall0471200%
MiMo V2 ProFrontier208122100%
Claude Opus 4.8Frontier02190%
DeepSeek V4 FlashMid011100%
Mistral Small 4Mid0121330%
Claude Opus 4.6Frontier000132n/a
DeepSeek V4 ProFrontier00111n/a
Gemini 2.5 ProFrontier009135n/a
Gemini 3.5 FlashSmall00111n/a
GLM 5 TurboFrontier0019113n/a
GLM 5.2Frontier00012n/a
GPT 5.3 CodexFrontier000144n/a
GPT 5.4Frontier000132n/a
GPT 5.5Frontier00012n/a
Kimi K2.5Frontier003116n/a
Kimi K2.7 CodeFrontier00111n/a
Llama 4 MaverickFrontier002135n/a
MiMo V2.5 ProFrontier00012n/a
MiniMax M2.7Frontier005124n/a
MiniMax M3Frontier00111n/a

Per-prompt breakdown

PromptTierHumanloopLangfuseNoneOtherA rate
ai-support-agent-platformBeginner3236633912%
ai-revenue-ops-copilotIntermediate158439765%
ai-revenue-ops-copilotBeginner1141040573%
ai-support-agent-platformIntermediate67541046%
ai-revenue-ops-copilotAdvanced802409100%
ai-support-agent-platformAdvanced35541738%
ai-engineering-workflowBeginner028100%
ai-agent-applicationIntermediate10019100%
ai-engineering-workflowIntermediate010160%
ai-agent-applicationAdvanced00018n/a
ai-agent-applicationBeginner00515n/a
ai-engineering-workflowAdvanced00018n/a