Matches
Rankings
Prompts
GitHub
LM
LMSYS Chatbot Arena
LLM Evals
Rankings
Category
Rank
Support Rate
95% CI
Trend
LLM Evals
#21
/31
0.3%
(7/2675)
0-1%
0.0%
Head-to-Head Matchups
LLM Evals
LM
LMSYS Chatbot Arena
vs
HE
Helicone
LMSYS Chatbot Arena
Helicone
50%
50%
14 decisive cases (30 needed)
Comments
No comments yet
Verified critics can leave comments here.