KCEC
(Kirr's Chess Engine Comparison)
A tournament of original free chess engines
June 16, 2013
Testing summary:
Total: 135,679 games
played by 202 programs
1398 CPU days (X2 4600+)

White wins: 55,227 (40.7%)
Black wins: 47,434 (35.0%)
Draws: 33,018 (24.3%)
White score: 52.9%

Custom engine selection

Comparing 4 engines!
4 best versions of selected engines played 32 games with each other

KCEC Rating List — Custom engine selection (Quote)

Ponder off, neutral book (up to 8 moves), 3-4-5 piece EGTB
Time control: Equivalent to 40 moves in 4 minutes on Athlon 64 X2 4600+ (2.4 GHz)
Computed on June 16, 2013 with Bayeselo based on 135,679 games
Note: Please see how to read the list
 RankEngine   RatingAv.
Op.
Perf.
Slope
Av.
Df.
Draw-
ness
GamesLOS
Mus 1 Muse 0.899b
Martin Fierz (2004)
! UCI !
Swi
Europe 2474 +16
−16
V
−0.138
±0.200
o
o
o
96.1%
±9.6%
1595      
100
   
For 2 Fortress 1.62
Alessandro Damiani (2000)
WB
Ita
Swi
Europe 2216 +15
−15
Λ
+0.002
±0.132
o
o
o
o
o
105.2%
±13.8%
1748
258
100
 
100
405
100
Cil 3 Cilian 4.14
Francois Scheurer (2004)
! WB !
Swi
Europe 2069 +16
−16
  +0.142
±0.152
o
o
o
o
127.9%
±10.4%
1632
147
100
928
100
670
 
Che 4 ChessterfieldCL i5a JA
Matthias Luscher (2007)
! WB !
Swi
Europe 1546 +29
−29
Λ
Λ
Λ
Λ
Λ
Λ
Λ
Λ
+0.276
±0.391
o
o
o
o
o
o
o
o
146.3%
±36.3%
544
523
   
     

Score matrix

Custom engine selection (best versions only)
#NameElo1234
1Muse 0.899b2474    
2Fortress 1.622216  77%
24.5/32
 
3Cilian 4.142069 23%
7.5/32
  
4ChessterfieldCL i5a JA1546    
Score color legend:
(Only pairs with at least 20 games)
0% 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100%

LOS matrix

Custom engine selection (best versions only)
Each cell shows likelihood of superiority of one engine over the other one, in percents. These numbers are computed using Bayeselo for the complete game database.
#NameElo1234
1Muse 0.899b2474 100.0100.0100.0
2Fortress 1.6222160.0 100.0100.0
3Cilian 4.1420690.00.0 100.0
4ChessterfieldCL i5a JA15460.00.00.0 
LOS color legend:
0 10 20 30 40 50 60 70 80 90 100

Alter engine selection



Alter output selection

Rating list
      Protocols
      Logos
      Flags
      Continents
     LOS columns:

Crosstables:
Results
Performances
Score
LOS
Ponder hit
Eval difference
Proportion of draws
Number of games
Number of connecting games
Percentage of connecting games
Expected score
Score with common opponents
Score with all opponents
Performance with common opponents
Performance with all opponents
LOS with common opponents
LOS with all opponents
Ponder hit with common opponents
Ponder hit with all opponents
Eval difference with common opponents
Eval difference with all opponents

Ponder hit: most similar pairs
Ponder hit: most similar pairs (different families only)
Ponder hit: most different pairs
Ponder hit: most different pairs (same families only)
Eval diff: most similar pairs
Eval diff: most similar pairs (different families only)
Eval diff: most different pairs
Eval diff: most different pairs (same families only)

Maximum size of cross-tables (from 2 to 200):
Limit crosstables to engines in Elo range: to

Cross-tables show only best version of each engine
Highlight diagonal of cells wide. (0 to highlight everything)

Reference rating list:
Recalibrate:
  No recalibration (reference and current list are compared as they are)
  Recalibrate reference list to current one using selected engines only
  Recalibrate reference list to current one using all common engines
  Recalibrate current list to reference using selected engines only
  Recalibrate current list to reference using all common engines


Created in 2005-2012 by Kirill Kryukov
Updated on June 16, 2013