KCEC
(Kirr's Chess Engine Comparison)
A tournament of original free chess engines
June 16, 2013
Testing summary:
Total: 135,679 games
played by 202 programs
1398 CPU days (X2 4600+)

White wins: 55,227 (40.7%)
Black wins: 47,434 (35.0%)
Draws: 33,018 (24.3%)
White score: 52.9%

Custom engine selection

Comparing 10 engines!
10 best versions of selected engines played 352 games with each other

KCEC Rating List — Custom engine selection (Quote)

Ponder off, neutral book (up to 8 moves), 3-4-5 piece EGTB
Time control: Equivalent to 40 moves in 4 minutes on Athlon 64 X2 4600+ (2.4 GHz)
Computed on June 16, 2013 with Bayeselo based on 135,679 games
Note: Please see how to read the list
 RankEngine   RatingAv.
Op.
Perf.
Slope
Av.
Df.
Draw-
ness
GamesLOS
Mat 1 Matacz 1.3 HT74 5-men-egbb
Maciej Pestka (2007)
WB
Pol
Europe 2551 +18
−18
  +0.034
±0.235
o
o
110.0%
±9.6%
1148      
100
   
Gos 2 Gosu 0.16
Arkadiusz Paterek (2006)
WB
Pol
Europe 2455 +16
−16
V
+0.029
±0.206
o
o
91.3%
±8.2%
1536
96
100
 
99.0
122
100
Tyt 3 Tytan 9.32 64-bit
Tomasz Michniewski (2007)
! WB !
Pol
Europe 2429 +17
−17
  −0.109
±0.366
o
o
94.8%
±10.2%
1312
26
100
314
100
218
100
Nes 4 Nesik 0.7.0 alpha
Marek Strejczek (2004)
WB
Pol
Europe 2237 +16
−16
  −0.032
±0.128
o
o
o
o
o
101.8%
±9.9%
1719
192
100
357
100
331
100
Arm 5 Armageddon 2.308
Grzegorz Sidorowicz (2007)
WB
Pol
Europe 2098 +17
−17
  +0.053
±0.215
o
o
o
112.5%
±11.7%
1440
139
100
376
100
184
100
Mat 6 Matant 5.04
Antoni Szczepanski (2007)
! WB !
Pol
Europe 2053 +17
−17
V
V
−0.140
±0.226
o
o
o
o
78.1%
±10.8%
1504
45
100
358
100
219
100
Eni 7 Enigma 1.1.4
Kamil Przybyla (2004)
WB
Pol
Europe 1879 +21
−21
V
−0.084
±0.147
o
o
o
o
o
102.9%
±13.9%
1088
174
100
264
100
219
100
Rob 8 Robin 0.983
Piotr Dachtera (2003)
WB
Pol
Europe 1834 +20
−20
Λ
+0.003
±0.118
o
o
o
o
o
o
102.4%
±15.1%
1248
45
100
238
94.5
64
100
Lau 9 Laurifer 1.0
Robert Lubczynski (2005)
! WB !
Pol
Europe 1815 +18
−18
Λ
Λ
Λ
Λ
Λ
+0.063
±0.117
o
o
o
o
o
o
o
o
106.4%
±12.4%
1696
19
100
135
100
90
 
Bel 10 Belzebub 0.67
Radoslaw Kamowski (2008)
! WB !
Pol
Europe 1744 +22
−22
Λ
Λ
+0.101
±0.129
o
o
o
o
o
o
133.5%
±15.0%
992
71
   
     

Score matrix

Custom engine selection (best versions only)
#NameElo12345678910
1Matacz 1.3 HT74 5-men-egbb2551          
2Gosu 0.162455  58%
18.5/32
       
3Tytan 9.32 64-bit2429 42%
13.5/32
        
4Nesik 0.7.0 alpha2237    62%
20/32
     
5Armageddon 2.3082098   38%
12/32
 69%
22/32
    
6Matant 5.042053    31%
10/32
  73%
23.5/32
88%
28/32
 
7Enigma 1.1.41879       50%
16/32
53%
17/32
67%
21.5/32
8Robin 0.9831834     27%
8.5/32
50%
16/32
 53%
17/32
72%
23/32
9Laurifer 1.01815     12%
4/32
47%
15/32
47%
15/32
 69%
22/32
10Belzebub 0.671744      33%
10.5/32
28%
9/32
31%
10/32
 
Score color legend:
(Only pairs with at least 20 games)
0% 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100%

LOS matrix

Custom engine selection (best versions only)
Each cell shows likelihood of superiority of one engine over the other one, in percents. These numbers are computed using Bayeselo for the complete game database.
#NameElo12345678910
1Matacz 1.3 HT74 5-men-egbb2551 100.0100.0100.0100.0100.0100.0100.0100.0100.0
2Gosu 0.1624550.0 99.0100.0100.0100.0100.0100.0100.0100.0
3Tytan 9.32 64-bit24290.01.0 100.0100.0100.0100.0100.0100.0100.0
4Nesik 0.7.0 alpha22370.00.00.0 100.0100.0100.0100.0100.0100.0
5Armageddon 2.30820980.00.00.00.0 100.0100.0100.0100.0100.0
6Matant 5.0420530.00.00.00.00.0 100.0100.0100.0100.0
7Enigma 1.1.418790.00.00.00.00.00.0 100.0100.0100.0
8Robin 0.98318340.00.00.00.00.00.00.0 94.5100.0
9Laurifer 1.018150.00.00.00.00.00.00.05.5 100.0
10Belzebub 0.6717440.00.00.00.00.00.00.00.00.0 
LOS color legend:
0 10 20 30 40 50 60 70 80 90 100

Alter engine selection



Alter output selection

Rating list
      Protocols
      Logos
      Flags
      Continents
     LOS columns:

Crosstables:
Results
Performances
Score
LOS
Ponder hit
Eval difference
Proportion of draws
Number of games
Number of connecting games
Percentage of connecting games
Expected score
Score with common opponents
Score with all opponents
Performance with common opponents
Performance with all opponents
LOS with common opponents
LOS with all opponents
Ponder hit with common opponents
Ponder hit with all opponents
Eval difference with common opponents
Eval difference with all opponents

Ponder hit: most similar pairs
Ponder hit: most similar pairs (different families only)
Ponder hit: most different pairs
Ponder hit: most different pairs (same families only)
Eval diff: most similar pairs
Eval diff: most similar pairs (different families only)
Eval diff: most different pairs
Eval diff: most different pairs (same families only)

Maximum size of cross-tables (from 2 to 200):
Limit crosstables to engines in Elo range: to

Cross-tables show only best version of each engine
Highlight diagonal of cells wide. (0 to highlight everything)

Reference rating list:
Recalibrate:
  No recalibration (reference and current list are compared as they are)
  Recalibrate reference list to current one using selected engines only
  Recalibrate reference list to current one using all common engines
  Recalibrate current list to reference using selected engines only
  Recalibrate current list to reference using all common engines


Created in 2005-2012 by Kirill Kryukov
Updated on June 16, 2013