Qwen
Qwen3-235B-A22B
Status: provisional
Summary stats
95% confidence interval: 909.714 to 976.224.
Current status
Why this row is not ranked yet
samples_total<60, samples_holdout<20, distinct_opponents<3, rd_glicko2>110.0, conservative_signal_not_separated_adjacent
Head-to-head results
vs GPT-OSS-120B
4-10-0
14 raw games
BT edge: 0.286 (0.117 to 0.546)
vs GPT-OSS-20B
5-5-0
10 raw games
BT edge: 0.500 (0.237 to 0.763)