KCEC
(Kirr's Chess Engine Comparison)
A tournament of original free chess engines
June 16, 2013
Testing summary:
Total: 135,679 games
played by 202 programs
1398 CPU days (X2 4600+)

White wins: 55,227 (40.7%)
Black wins: 47,434 (35.0%)
Draws: 33,018 (24.3%)
White score: 52.9%

Custom engine selection

Comparing 1 engines!
1 best versions of selected engines played 0 games with each other

KCEC Rating List — Custom engine selection (Quote)

Ponder off, neutral book (up to 8 moves), 3-4-5 piece EGTB
Time control: Equivalent to 40 moves in 4 minutes on Athlon 64 X2 4600+ (2.4 GHz)
Computed on June 16, 2013 with Bayeselo based on 135,679 games
Note: Please see how to read the list
 RankEngine   RatingAv.
Op.
Perf.
Slope
Av.
Df.
Draw-
ness
GamesLOS
Elf 1 Elf 1.3.0
Erdi Ata Bleda (2005)
! WB !
Tur
Asia 1820 +18
−18
Λ
Λ
Λ
Λ
−0.198
±0.168
o
o
o
o
o
o
o
o
86.8%
±12.6%
1628

Score matrix

Custom engine selection (best versions only)
#NameElo1
1Elf 1.3.01820 
Score color legend:
(Only pairs with at least 20 games)
0% 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100%

LOS matrix

Custom engine selection (best versions only)
Each cell shows likelihood of superiority of one engine over the other one, in percents. These numbers are computed using Bayeselo for the complete game database.
#NameElo1
1Elf 1.3.01820 
LOS color legend:
0 10 20 30 40 50 60 70 80 90 100

Alter engine selection



Alter output selection

Rating list
      Protocols
      Logos
      Flags
      Continents
     LOS columns:

Crosstables:
Results
Performances
Score
LOS
Ponder hit
Eval difference
Proportion of draws
Number of games
Number of connecting games
Percentage of connecting games
Expected score
Score with common opponents
Score with all opponents
Performance with common opponents
Performance with all opponents
LOS with common opponents
LOS with all opponents
Ponder hit with common opponents
Ponder hit with all opponents
Eval difference with common opponents
Eval difference with all opponents

Ponder hit: most similar pairs
Ponder hit: most similar pairs (different families only)
Ponder hit: most different pairs
Ponder hit: most different pairs (same families only)
Eval diff: most similar pairs
Eval diff: most similar pairs (different families only)
Eval diff: most different pairs
Eval diff: most different pairs (same families only)

Maximum size of cross-tables (from 2 to 200):
Limit crosstables to engines in Elo range: to

Cross-tables show only best version of each engine
Highlight diagonal of cells wide. (0 to highlight everything)

Reference rating list:
Recalibrate:
  No recalibration (reference and current list are compared as they are)
  Recalibrate reference list to current one using selected engines only
  Recalibrate reference list to current one using all common engines
  Recalibrate current list to reference using selected engines only
  Recalibrate current list to reference using all common engines


Created in 2005-2012 by Kirill Kryukov
Updated on June 16, 2013