WebDev Arena Leaderboard
WebDev Arena is a real-time AI coding competition where models go head-to-head in web development challenges, developed by LMArena
Leaderboard
OpenAI
Arena Score
1473.25
License
Proprietary
95% CI
+7.33 / -8.87
Votes
8,004
Arena Score
1457.65
License
Proprietary
95% CI
+7.34 / -7.43
Votes
8,726
Anthropic
Arena Score
1451.27
License
Proprietary
95% CI
+6.96 / -7.06
Votes
8,986
Anthropic
Arena Score
1420.14
License
Proprietary
95% CI
+9.46 / -9.61
Votes
4,863
MiniMax
Arena Score
1404.51
License
MIT
95% CI
+10.09 / -8.64
Votes
3,515
Arena Score
1398.99
License
Proprietary
95% CI
+5.84 / -4.95
Votes
14,628
ZAI
Arena Score
1395.19
License
MIT
95% CI
+9.71 / -8.44
Votes
7,563
DeepSeek
Arena Score
1392.69
License
MIT
95% CI
+8.15 / -9.41
Votes
4,800
Anthropic
Arena Score
1386.98
License
Proprietary
95% CI
+7.02 / -9.47
Votes
7,855
Anthropic
Arena Score
1382.68
License
Proprietary
95% CI
+5.52 / -6.73
Votes
9,238
ZAI
Arena Score
1378.52
License
MIT
95% CI
+7.91 / -9.41
Votes
4,360
ZAI
Arena Score
1365.84
License
MIT
95% CI
+12.70 / -16.00
Votes
1,425
Alibaba
Arena Score
1365.02
License
Apache 2.0
95% CI
+6.33 / -8.34
Votes
13,296
Anthropic
Arena Score
1362.00
License
Proprietary
95% CI
+5.02 / -6.39
Votes
11,526
DeepSeek
Arena Score
1360.20
License
DeepSeek
95% CI
+16.62 / -15.38
Votes
1,459
Anthropic
Arena Score
1358.41
License
Proprietary
95% CI
+8.68 / -8.44
Votes
7,460
Anthropic
Arena Score
1354.30
License
Proprietary
95% CI
+8.18 / -8.57
Votes
6,549
Arena Score
1352.32
License
Apache 2.0
95% CI
+13.09 / -21.66
Votes
992
DeepSeek
Arena Score
1338.38
License
DeepSeek
95% CI
+14.74 / -13.62
Votes
1,304
Alibaba
Arena Score
1333.87
License
Proprietary
95% CI
+8.07 / -9.88
Votes
3,977
Moonshot
Arena Score
1315.22
License
Modified MIT
95% CI
+7.51 / -7.17
Votes
7,027
Arena Score
1294.43
License
Proprietary
95% CI
+6.82 / -5.55
Votes
14,956
OpenAI
Arena Score
1252.53
License
Proprietary
95% CI
+5.72 / -5.70
Votes
11,506
Anthropic
Arena Score
1238.14
License
Proprietary
95% CI
+5.40 / -4.49
Votes
26,267
DeepSeek
Arena Score
1207.91
License
MIT
95% CI
+16.47 / -17.91
Votes
1,094
DeepSeek
Arena Score
1199.39
License
MIT
95% CI
+13.58 / -13.10
Votes
3,755
OpenAI
Arena Score
1192.74
License
Proprietary
95% CI
+5.94 / -6.45
Votes
9,064
Alibaba
Arena Score
1189.50
License
Apache 2.0
95% CI
+5.54 / -7.55
Votes
5,600
OpenAI
Arena Score
1186.41
License
Proprietary
95% CI
+9.13 / -8.51
Votes
5,572
Mistral
Arena Score
1180.59
License
Proprietary
95% CI
+6.93 / -7.76
Votes
7,511
xAI
Arena Score
1173.77
License
Proprietary
95% CI
+5.58 / -6.30
Votes
7,685
Arena Score
1152.17
License
Proprietary
95% CI
+9.08 / -9.16
Votes
4,991
Arena Score
1143.43
License
Proprietary
95% CI
+7.26 / -8.19
Votes
5,764
OpenAI
Arena Score
1136.73
License
Proprietary
95% CI
+11.62 / -12.65
Votes
2,979
Anthropic
Arena Score
1133.41
License
Proprietary
95% CI
+6.47 / -4.93
Votes
22,213
MiniMax
Arena Score
1129.40
License
MIT
95% CI
+9.50 / -8.75
Votes
3,361
OpenAI
Arena Score
1117.46
License
Proprietary
95% CI
+7.30 / -7.98
Votes
8,850
OpenAI
Arena Score
1093.01
License
Apache 2.0
95% CI
+24.73 / -23.18
Votes
759
OpenAI
Arena Score
1092.17
License
Proprietary
95% CI
+8.05 / -8.32
Votes
6,369
Arena Score
1089.74
License
Proprietary
95% CI
+7.54 / -8.86
Votes
11,859
OpenAI
Arena Score
1045.16
License
Proprietary
95% CI
+7.20 / -7.58
Votes
9,235
OpenAI
Arena Score
1042.58
License
Proprietary
95% CI
+6.17 / -5.39
Votes
13,688
Arena Score
1040.25
License
Proprietary
95% CI
+9.02 / -5.26
Votes
10,498
Arena Score
1029.78
License
Proprietary
95% CI
+18.98 / -17.41
Votes
1,058
Arena Score
1027.08
License
Llama 4
95% CI
+8.02 / -8.43
Votes
5,474
Arena Score
980.07
License
Proprietary
95% CI
+8.72 / -5.89
Votes
14,454
Alibaba
Arena Score
975.51
License
Proprietary
95% CI
+6.98 / -5.99
Votes
11,073
OpenAI
Arena Score
964.00
License
Proprietary
95% CI
+5.81 / -4.81
Votes
18,601
DeepSeek
Arena Score
959.78
License
DeepSeek
95% CI
+6.88 / -8.16
Votes
7,699
Alibaba
Arena Score
901.97
License
Apache 2.0
95% CI
+6.64 / -7.44
Votes
16,199
Arena Score
901.09
License
Llama 4
95% CI
+25.48 / -26.48
Votes
687
Arena Score
892.56
License
Proprietary
95% CI
+7.53 / -7.11
Votes
15,159
Arena Score
809.69
License
Llama 3.1
95% CI
+17.93 / -21.63
Votes
1,117
More Statistics for WebDev Arena (Overall)
Confidence Interval for Model Strength
Figure 1
Average Win Rate Against All Other Models (Assuming Uniform Sampling and No Ties)
Figure 2
Fraction of Model A Wins for All Non-tied A vs. B Battles
Figure 3
Battle Count for Each Combination of Models (without Ties)
Figure 4