WebDev Arena Leaderboard
WebDev Arena is a real-time AI coding competition where models go head-to-head in web development challenges, developed by LMArena
Leaderboard
OpenAI
Arena Score
1472.37
License
Proprietary
95% CI
+8.18 / -6.69
Votes
8,142
Arena Score
1456.34
License
Proprietary
95% CI
+6.95 / -6.06
Votes
9,559
Anthropic
Arena Score
1450.50
License
Proprietary
95% CI
+6.68 / -7.13
Votes
9,752
Anthropic
Arena Score
1420.01
License
Proprietary
95% CI
+9.22 / -9.44
Votes
5,717
Arena Score
1398.72
License
Proprietary
95% CI
+6.26 / -5.61
Votes
15,501
MiniMax
Arena Score
1398.35
License
MIT
95% CI
+10.82 / -9.15
Votes
4,346
ZAI
Arena Score
1394.39
License
MIT
95% CI
+7.81 / -6.77
Votes
7,566
DeepSeek
Arena Score
1392.62
License
MIT
95% CI
+12.47 / -10.29
Votes
4,800
Anthropic
Arena Score
1384.60
License
Proprietary
95% CI
+9.49 / -6.96
Votes
8,681
Anthropic
Arena Score
1382.60
License
Proprietary
95% CI
+6.45 / -7.88
Votes
9,238
ZAI
Arena Score
1378.10
License
MIT
95% CI
+9.59 / -9.01
Votes
4,360
ZAI
Arena Score
1365.57
License
MIT
95% CI
+16.63 / -15.52
Votes
1,425
DeepSeek
Arena Score
1359.84
License
DeepSeek
95% CI
+17.56 / -14.50
Votes
1,459
Alibaba
Arena Score
1365.31
License
Apache 2.0
95% CI
+6.47 / -6.48
Votes
14,098
Anthropic
Arena Score
1361.80
License
Proprietary
95% CI
+7.34 / -4.79
Votes
11,526
Anthropic
Arena Score
1358.40
License
Proprietary
95% CI
+10.31 / -8.34
Votes
7,460
Anthropic
Arena Score
1353.72
License
Proprietary
95% CI
+9.00 / -7.62
Votes
7,402
Arena Score
1351.83
License
Apache 2.0
95% CI
+14.74 / -21.31
Votes
992
DeepSeek
Arena Score
1338.02
License
DeepSeek
95% CI
+16.03 / -14.30
Votes
1,304
Alibaba
Arena Score
1333.36
License
Proprietary
95% CI
+9.65 / -9.41
Votes
3,977
Deep Cogito
Arena Score
1315.32
License
MIT
95% CI
+18.91 / -13.97
Votes
1,302
Moonshot
Arena Score
1315.08
License
Modified MIT
95% CI
+8.53 / -6.77
Votes
7,027
Arena Score
1294.98
License
Proprietary
95% CI
+4.67 / -6.04
Votes
15,809
OpenAI
Arena Score
1252.49
License
Proprietary
95% CI
+5.39 / -4.71
Votes
11,506
Anthropic
Arena Score
1238.15
License
Proprietary
95% CI
+7.88 / -6.13
Votes
26,267
DeepSeek
Arena Score
1207.92
License
MIT
95% CI
+19.41 / -21.74
Votes
1,094
DeepSeek
Arena Score
1199.36
License
MIT
95% CI
+9.67 / -12.20
Votes
3,755
OpenAI
Arena Score
1192.72
License
Proprietary
95% CI
+7.20 / -5.19
Votes
9,064
Alibaba
Arena Score
1189.50
License
Apache 2.0
95% CI
+7.79 / -7.16
Votes
5,600
OpenAI
Arena Score
1186.42
License
Proprietary
95% CI
+6.07 / -8.33
Votes
5,572
Mistral
Arena Score
1180.56
License
Proprietary
95% CI
+9.24 / -7.79
Votes
7,511
xAI
Arena Score
1173.70
License
Proprietary
95% CI
+7.34 / -9.40
Votes
7,685
Arena Score
1151.73
License
Proprietary
95% CI
+10.25 / -8.45
Votes
4,991
Arena Score
1143.43
License
Proprietary
95% CI
+8.79 / -7.86
Votes
5,764
OpenAI
Arena Score
1136.74
License
Proprietary
95% CI
+10.17 / -15.73
Votes
2,979
Anthropic
Arena Score
1133.42
License
Proprietary
95% CI
+6.53 / -5.14
Votes
22,213
MiniMax
Arena Score
1129.41
License
MIT
95% CI
+11.81 / -7.72
Votes
3,361
OpenAI
Arena Score
1117.42
License
Proprietary
95% CI
+6.05 / -8.61
Votes
8,850
OpenAI
Arena Score
1092.96
License
Apache 2.0
95% CI
+25.67 / -30.08
Votes
759
OpenAI
Arena Score
1092.17
License
Proprietary
95% CI
+10.09 / -9.00
Votes
6,369
Arena Score
1089.74
License
Proprietary
95% CI
+8.97 / -9.18
Votes
11,859
OpenAI
Arena Score
1045.17
License
Proprietary
95% CI
+9.42 / -8.64
Votes
9,235
OpenAI
Arena Score
1042.58
License
Proprietary
95% CI
+9.73 / -7.16
Votes
13,688
Arena Score
1040.25
License
Proprietary
95% CI
+6.75 / -7.52
Votes
10,498
Arena Score
1029.81
License
Proprietary
95% CI
+19.17 / -18.30
Votes
1,058
Arena Score
1027.07
License
Llama 4
95% CI
+7.32 / -8.93
Votes
5,474
Arena Score
980.08
License
Proprietary
95% CI
+6.80 / -6.08
Votes
14,454
Alibaba
Arena Score
975.51
License
Proprietary
95% CI
+6.43 / -7.27
Votes
11,073
OpenAI
Arena Score
964.00
License
Proprietary
95% CI
+6.94 / -6.29
Votes
18,601
DeepSeek
Arena Score
959.79
License
DeepSeek
95% CI
+10.43 / -8.38
Votes
7,699
Alibaba
Arena Score
901.97
License
Apache 2.0
95% CI
+7.04 / -6.64
Votes
16,199
Arena Score
901.18
License
Llama 4
95% CI
+22.13 / -24.77
Votes
687
Arena Score
892.56
License
Proprietary
95% CI
+7.10 / -6.40
Votes
15,159
Arena Score
809.55
License
Llama 3.1
95% CI
+18.14 / -17.09
Votes
1,117
More Statistics for WebDev Arena (Overall)
Confidence Interval for Model Strength
Figure 1
Average Win Rate Against All Other Models (Assuming Uniform Sampling and No Ties)
Figure 2
Fraction of Model A Wins for All Non-tied A vs. B Battles
Figure 3
Battle Count for Each Combination of Models (without Ties)
Figure 4