Preseason
MatchesRankingsPrompts
GitHub
Preseason
MatchesRankingsPromptsMethodologyContact

© 2026 Preseason. All rights reserved.

PrivacyTerms
@betocmn
AI / LLM Integration
Methodology

Einstein AI vs Vellum

EIEinstein AIvsVEVellum
Einstein AIVellum
42%
58%

Leading: Vellum (58.3%)

Insufficient data
This matchup has 24 decisive cases (minimum 30 required for publication).

Statistics

MetricValue
Einstein AI wins10
Vellum wins14
Abstains (no tool)19
Other tool chosen2558
Decisive cases24
Einstein AI win rate (unweighted)41.7%
95% CI24.5% - 61.2%
Einstein AI win rate (weighted)41.7%

Comments

Einstein AI

No comments yet

Verified critics can leave comments here.

Vellum

No comments yet

Verified critics can leave comments here.

Per-model breakdown

ModelTierEinstein AIVellumNoneOtherA rate
Devstral 2 2512Mid01221230%
Llama 4 ScoutSmall1002124100%
DeepSeek V3.2Mid0101300%
GLM 5 TurboFrontier0121280%
Claude Haiku 4.5Small000140n/a
Claude Opus 4.6Frontier000132n/a
Claude Opus 4.8Frontier0008n/a
Claude Sonnet 4.6Frontier000141n/a
DeepSeek R1 0528Frontier002136n/a
DeepSeek V4 FlashMid0009n/a
DeepSeek V4 ProFrontier0018n/a
Gemini 2.5 FlashSmall000132n/a
Gemini 2.5 ProFrontier002139n/a
Gemini 3.5 FlashSmall0008n/a
GLM 5.2Frontier0009n/a
GPT 5.3 CodexFrontier000141n/a
GPT 5.4Frontier000132n/a
GPT 5.4 MiniMid001140n/a
GPT 5.5Frontier0007n/a
Kimi K2.5Frontier001117n/a
Kimi K2.7 CodeFrontier0009n/a
Llama 4 MaverickFrontier003133n/a
MiMo V2 ProFrontier000130n/a
MiMo V2.5 ProFrontier0009n/a
MiniMax M2.7Frontier000120n/a
MiniMax M3Frontier0009n/a
Mistral Small 4Mid002107n/a
Qwen3 Coder NextMid001137n/a

Per-prompt breakdown

PromptTierEinstein AIVellumNoneOtherA rate
ai-support-agent-platformIntermediate01304210%
ai-revenue-ops-copilotBeginner71039988%
ai-revenue-ops-copilotIntermediate301428100%
ai-agent-applicationIntermediate00512n/a
ai-agent-applicationAdvanced00317n/a
ai-agent-applicationBeginner00515n/a
ai-revenue-ops-copilotAdvanced003419n/a
ai-support-agent-platformBeginner001417n/a
ai-support-agent-platformAdvanced001430n/a