KCEC
(Kirr's Chess Engine Comparison)
A tournament of original free chess engines
June 16, 2013
Testing summary:
Total: 135,679 games
played by 202 programs
1398 CPU days (X2 4600+)

White wins: 55,227 (40.7%)
Black wins: 47,434 (35.0%)
Draws: 33,018 (24.3%)
White score: 52.9%

Custom engine selection

Comparing 15 engines!
13 best versions of selected engines played 795 games with each other

KCEC Rating List — Custom engine selection (Quote)

Ponder off, neutral book (up to 8 moves), 3-4-5 piece EGTB
Time control: Equivalent to 40 moves in 4 minutes on Athlon 64 X2 4600+ (2.4 GHz)
Computed on June 16, 2013 with Bayeselo based on 135,679 games
Note: Please see how to read the list
 RankEngine   RatingAv.
Op.
Perf.
Slope
Av.
Df.
Draw-
ness
GamesLOS
Del 1 Delfi 5.4
Fabio Cavicchio (2008)
UCI
Ita
Europe 2801 +24
−24
V
V
−0.141
±0.285
o
o
o
96.3%
±12.0%
704      
100
   
    Delfi 5.2
Fabio Cavicchio (2007)
      2718 +21
−21
  −0.164
±0.331
o
o
98.6%
±12.0%
922
83
100
 
100
132
100
Ham 2 Hamsters 0.6
Alessandro Scotti (2007)
UCI
Ita
Europe 2669 +18
−18
  −0.461
±0.387
o
o
101.2%
±9.8%
1206
49
100
206
100
123
100
    Hamsters 0.5
Alessandro Scotti (2007)
      2595 +21
−21
  +0.012
±0.474
o
o
97.7%
±11.9%
831
74
100
228
100
179
100
Lei 3 Leila 0.53h
Carmelo Calzerano (2002)
! WB !
Ita
Europe 2490 +17
−17
V
V
−0.171
±0.210
o
o
o
83.5%
±8.4%
1430
105
100
301
100
227
100
Mat 4 Matilde 2008 64-bit
Andrea Lanza (2008)
! WB !
Ita
Europe 2368 +15
−15
V
+0.021
±0.191
o
o
o
o
90.2%
±9.2%
1792
122
100
250
98.9
145
100
Esc 5 Esc 1.16
Claudio Della Corte (2002)
WB
Ita
Europe 2345 +15
−15
V
−0.172
±0.111
o
o
o
o
o
97.4%
±9.3%
1811
23
100
195
100
73
100
Cyb 6 CyberPagno 2.1 32MB
Marco Pagnoncelli (2004)
WB
Ita
Europe 2295 +16
−16
  −0.036
±0.120
o
o
o
o
o
89.7%
±8.9%
1697
50
100
83
81.7
60
100
Mad 7 Madeleine 0.2
Luigi Ripamonti (2002)
! UCI !
Ita
Europe 2285 +15
−15
  +0.005
±0.121
o
o
o
o
o
93.0%
±7.9%
1760
10
100
129
100
79
100
For 8 Fortress 1.62
Alessandro Damiani (2000)
WB
Ita
Swi
Europe 2216 +15
−15
Λ
+0.002
±0.132
o
o
o
o
o
105.2%
±13.8%
1748
69
100
121
100
111
100
Ura 9 Uragano 3D 0.87
Luca Naddei (2006)
! WB !
Ita
Europe 2174 +16
−16
  −0.024
±0.150
o
o
o
o
92.2%
±9.7%
1696
42
100
249
100
180
100
Sma 10 Smash 1.0.3
Maurizio Sambati (2006)
UCI
Ita
Europe 2036 +17
−17
V
−0.243
±0.156
o
o
o
o
87.1%
±12.2%
1568
138
100
343
100
301
100
Sol 11 Soldat 0.25b
Marco Giusfredi (2002)
! WB !
Ita
Europe 1873 +17
−17
Λ
Λ
Λ
+0.083
±0.109
o
o
o
o
o
o
o
117.3%
±14.0%
1711
163
100
380
100
242
100
Ald 12 Aldebaran 0.7.0
Mauro Scarpa (2001)
! WB !
Ita
Europe 1794 +19
−19
Λ
Λ
+0.130
±0.124
o
o
o
o
o
o
o
168.0%
±14.7%
1280
79
100
269
98.6
106
 
Miz 13 Mizar 3.0
Nicola Rizzuti (2006)
! WB !
Ita
Europe 1767 +20
−20
Λ
Λ
Λ
Λ
−0.003
±0.117
o
o
o
o
o
o
o
86.9%
±14.4%
1344
27
   
     

Score matrix

Custom engine selection (best versions only)
#NameElo12345678910111213
1Delfi 5.42801             
2Hamsters 0.62669             
3Leila 0.53h2490    84%
21/25
79%
19/24
76%
17.5/23
84%
24.5/29
     
4Matilde 2008 64-bit2368    53%
17/32
78%
25/32
58%
18.5/32
70%
22.5/32
67%
21.5/32
    
5Esc 1.162345  16%
4/25
47%
15/32
 46%
11/24
58%
15/26
79%
20.5/26
77%
24.5/32
    
6CyberPagno 2.1 32MB2295  21%
5/24
22%
7/32
54%
13/24
 52%
13/25
69%
16.5/24
61%
19.5/32
    
7Madeleine 0.22285  24%
5.5/23
42%
13.5/32
42%
11/26
48%
12/25
 46%
11.5/25
70%
22.5/32
    
8Fortress 1.622216  16%
4.5/29
30%
9.5/32
21%
5.5/26
31%
7.5/24
54%
13.5/25
 61%
19.5/32
    
9Uragano 3D 0.872174   33%
10.5/32
23%
7.5/32
39%
12.5/32
30%
9.5/32
39%
12.5/32
 81%
26/32
80%
25.5/32
  
10Smash 1.0.32036        19%
6/32
 88%
28/32
77%
24.5/32
88%
28/32
11Soldat 0.25b1873        20%
6.5/32
12%
4/32
 69%
22/32
64%
20.5/32
12Aldebaran 0.7.01794         23%
7.5/32
31%
10/32
 45%
14.5/32
13Mizar 3.01767         12%
4/32
36%
11.5/32
55%
17.5/32
 
Score color legend:
(Only pairs with at least 20 games)
0% 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100%

LOS matrix

Custom engine selection (best versions only)
Each cell shows likelihood of superiority of one engine over the other one, in percents. These numbers are computed using Bayeselo for the complete game database.
#NameElo12345678910111213
1Delfi 5.42801 100.0100.0100.0100.0100.0100.0100.0100.0100.0100.0100.0100.0
2Hamsters 0.626690.0 100.0100.0100.0100.0100.0100.0100.0100.0100.0100.0100.0
3Leila 0.53h24900.00.0 100.0100.0100.0100.0100.0100.0100.0100.0100.0100.0
4Matilde 2008 64-bit23680.00.00.0 98.9100.0100.0100.0100.0100.0100.0100.0100.0
5Esc 1.1623450.00.00.01.1 100.0100.0100.0100.0100.0100.0100.0100.0
6CyberPagno 2.1 32MB22950.00.00.00.00.0 81.7100.0100.0100.0100.0100.0100.0
7Madeleine 0.222850.00.00.00.00.018.3 100.0100.0100.0100.0100.0100.0
8Fortress 1.6222160.00.00.00.00.00.00.0 100.0100.0100.0100.0100.0
9Uragano 3D 0.8721740.00.00.00.00.00.00.00.0 100.0100.0100.0100.0
10Smash 1.0.320360.00.00.00.00.00.00.00.00.0 100.0100.0100.0
11Soldat 0.25b18730.00.00.00.00.00.00.00.00.00.0 100.0100.0
12Aldebaran 0.7.017940.00.00.00.00.00.00.00.00.00.00.0 98.6
13Mizar 3.017670.00.00.00.00.00.00.00.00.00.00.01.4 
LOS color legend:
0 10 20 30 40 50 60 70 80 90 100

Alter engine selection



Alter output selection

Rating list
      Protocols
      Logos
      Flags
      Continents
     LOS columns:

Crosstables:
Results
Performances
Score
LOS
Ponder hit
Eval difference
Proportion of draws
Number of games
Number of connecting games
Percentage of connecting games
Expected score
Score with common opponents
Score with all opponents
Performance with common opponents
Performance with all opponents
LOS with common opponents
LOS with all opponents
Ponder hit with common opponents
Ponder hit with all opponents
Eval difference with common opponents
Eval difference with all opponents

Ponder hit: most similar pairs
Ponder hit: most similar pairs (different families only)
Ponder hit: most different pairs
Ponder hit: most different pairs (same families only)
Eval diff: most similar pairs
Eval diff: most similar pairs (different families only)
Eval diff: most different pairs
Eval diff: most different pairs (same families only)

Maximum size of cross-tables (from 2 to 200):
Limit crosstables to engines in Elo range: to

Cross-tables show only best version of each engine
Highlight diagonal of cells wide. (0 to highlight everything)

Reference rating list:
Recalibrate:
  No recalibration (reference and current list are compared as they are)
  Recalibrate reference list to current one using selected engines only
  Recalibrate reference list to current one using all common engines
  Recalibrate current list to reference using selected engines only
  Recalibrate current list to reference using all common engines


Created in 2005-2012 by Kirill Kryukov
Updated on June 16, 2013