CCRL 40/15 Testing Conditions (previously 40/40)
Re: CCRL 40/15 Testing Conditions (previously 40/40)
Thanks for the specific link. I can see Stockfish 14 in this selection. But I am unsure what it means. Does it mean that Stockfish 14 is not better than its predecessor Stockfish 13 and therefore is hidden in the main page http://ccrl.chessdom.com/ccrl/4040/ ? Thanks
- Graham Banks
- Posts: 27536
- Joined: Sun Dec 18, 2005 5:47 pm
- Sign-up code: 0
- Location: Auckland, NZ
Re: CCRL 40/15 Testing Conditions (previously 40/40)
That is the situation at present, but could well change.ThatsMe wrote: ↑Sat Aug 14, 2021 11:53 pm Thanks for the specific link. I can see Stockfish 14 in this selection. But I am unsure what it means. Does it mean that Stockfish 14 is not better than its predecessor Stockfish 13 and therefore is hidden in the main page http://ccrl.chessdom.com/ccrl/4040/ ? Thanks
4CPU testing is in progress also:
https://www.computerchess.org.uk/ccrl/4 ... ons_only=1
Re: CCRL 40/15 Testing Conditions (previously 40/40)
Thanks for the clarification. What I take away from this is that the CCRL differences are in any case not so huge that you can clearly see them.
Funny that the Stockfish community's own measurements showed an increase of about 41 ELO points between Stockfish 13 and Stockfish 14.
For example, their list https://nextchessmove.com/dev-builds says:
v14 (+415.16 ELO increase over Stockfish 7).
v13 (+373.78 ELO increase compared to Stockfish 7)
This means for me (of course I can be totally wrong) that between Stockfish 13 and Stockfish 14 there must have been an ELO increase of about 41 points.
I wonder why are all these ELO measurements so contradictory? And who's right..?
Funny that the Stockfish community's own measurements showed an increase of about 41 ELO points between Stockfish 13 and Stockfish 14.
For example, their list https://nextchessmove.com/dev-builds says:
v14 (+415.16 ELO increase over Stockfish 7).
v13 (+373.78 ELO increase compared to Stockfish 7)
This means for me (of course I can be totally wrong) that between Stockfish 13 and Stockfish 14 there must have been an ELO increase of about 41 points.
I wonder why are all these ELO measurements so contradictory? And who's right..?
- Graham Banks
- Posts: 27536
- Joined: Sun Dec 18, 2005 5:47 pm
- Sign-up code: 0
- Location: Auckland, NZ
Re: CCRL 40/15 Testing Conditions (previously 40/40)
Never rely on just one rating list.
Check the other well-known ones and you'll get a pretty good overall picture.
Check the other well-known ones and you'll get a pretty good overall picture.
Re: CCRL 40/15 Testing Conditions (previously 40/40)
That's what I thought. Maybe the Stockfish team overestimated themselves in their own assessments. Thanks.
Re: CCRL 40/15 Testing Conditions (previously 40/40)
The Stockfish results are all over the place in the various lists. A couple of them do in fact agree with the fishtest results. There are so many variables - time control, threads and book choice for example. In our case I believe different hardware can also impact the results especially today with NNUE engines performing better on AVX2 machines etc. I don't think that the hardware made a significant difference a few years ago, with adapted time control of course, but now the situation has probably changed.
In terms of the list lagging behind, that is simply down to resources of course.
In terms of the list lagging behind, that is simply down to resources of course.
-
- Posts: 11
- Joined: Fri Aug 20, 2021 1:36 am
- Sign-up code: 10159
Re: CCRL 40/15 Testing Conditions (previously 40/40)
CCRL is the only ratings list that.I know of that doesn't have any visible elo difference between Stockfish 13 and Stockfish 14. Other ratings lists like FGRL and CEGT largely agree with Fishtest estimates.
- Graham Banks
- Posts: 27536
- Joined: Sun Dec 18, 2005 5:47 pm
- Sign-up code: 0
- Location: Auckland, NZ
Re: CCRL 40/15 Testing Conditions (previously 40/40)
That's why you should look at many rating lists in order to get the overall picture.Madeleine Birchfield wrote: ↑Fri Aug 20, 2021 9:13 pmCCRL is the only ratings list that.I know of that doesn't have any visible elo difference between Stockfish 13 and Stockfish 14. Other ratings lists like FGRL and CEGT largely agree with Fishtest estimates.
Re: CCRL 40/15 Testing Conditions (previously 40/40)
Hie
1) I would like to know which version of program are used for ccrl 40/15 rating list :
64 bit win ?
avx 2 ?
bmi 2 ?
sse 42 ?
2) The xmchess programm has won aginst stockfish_14_1 ( 20 seconds for each moves) for the first time yesterday (he has lost too against this great program !!) . I was very happy.
this new proramm will be avalaible soon
xmchess
1) I would like to know which version of program are used for ccrl 40/15 rating list :
64 bit win ?
avx 2 ?
bmi 2 ?
sse 42 ?
2) The xmchess programm has won aginst stockfish_14_1 ( 20 seconds for each moves) for the first time yesterday (he has lost too against this great program !!) . I was very happy.
this new proramm will be avalaible soon
xmchess
- Graham Banks
- Posts: 27536
- Joined: Sun Dec 18, 2005 5:47 pm
- Sign-up code: 0
- Location: Auckland, NZ
Re: CCRL 40/15 Testing Conditions (previously 40/40)
I would imagine that we're all using either bmi2 or avx2, but an sse4.2 compile is probably a good idea too.xmchess wrote: ↑Sat Jan 15, 2022 1:14 pm Hie
1) I would like to know which version of program are used for ccrl 40/15 rating list :
64 bit win ?
avx 2 ?
bmi 2 ?
sse 42 ?
2) The xmchess programm has won aginst stockfish_14_1 ( 20 seconds for each moves) for the first time yesterday (he has lost too against this great program !!) . I was very happy.
this new proramm will be avalaible soon
xmchess
Re: CCRL 40/15 Testing Conditions (previously 40/40)
hie thanxs
the experimental version 0.1 of xmchess will be avalaible soon.
this version is about 3300-3400 elo in 40/15
Is it possible to be in the ccrl list 40/15 ?
is it free ?
how to get ranked
thanks a lot
the experimental version 0.1 of xmchess will be avalaible soon.
this version is about 3300-3400 elo in 40/15
Is it possible to be in the ccrl list 40/15 ?
is it free ?
how to get ranked
thanks a lot
- Gabor Szots
- Posts: 13198
- Joined: Sat Dec 09, 2006 6:30 am
- Sign-up code: 10159
- Location: Szentendre, Hungary
Re: CCRL 40/15 Testing Conditions (previously 40/40)
Hie
No it will not be open source
sorry ...
I am looking for investors to value the new ia algorithms used in xmchess.
No it will not be open source
sorry ...
I am looking for investors to value the new ia algorithms used in xmchess.
-
- Posts: 2195
- Joined: Thu Aug 05, 2021 2:35 pm
- Sign-up code: 10159
- Location: Cavite, Philippines
- Contact:
Re: CCRL 40/15 Testing Conditions (previously 40/40)
How long u work for your engine?
CCRL Testing Group
Re: CCRL 40/15 Testing Conditions (previously 40/40)
i im working for xmchess since 2 years !!!
the final version will have a new neural network with 300 000 parameters (a non nnue neural network). It will use two new lerning algorithms and a genetic algorithm.
the final version will have a new neural network with 300 000 parameters (a non nnue neural network). It will use two new lerning algorithms and a genetic algorithm.
Re: CCRL 40/15 Testing Conditions (previously 40/40)
Hie
the experimental version 0.1 of xmchess will be avalaible soon.
this version is about 3300-3400 elo in 40/15
Is it possible to be in the ccrl list 40/15 ?
how to get ranked ?
Thx a lot
the experimental version 0.1 of xmchess will be avalaible soon.
this version is about 3300-3400 elo in 40/15
Is it possible to be in the ccrl list 40/15 ?
how to get ranked ?
Thx a lot
- Gabor Szots
- Posts: 13198
- Joined: Sat Dec 09, 2006 6:30 am
- Sign-up code: 10159
- Location: Szentendre, Hungary
Re: CCRL 40/15 Testing Conditions (previously 40/40)
There is no rule for that. Just release it, announce it and hope that someone might pick it up.
I recommend you reveal your real name and country but that is not obligatory if you want to remain anonym. However, some raw background information about your engine (origins, history, etc.) will be required. And most of all: exes running under various Windows platforms.
PS. Please open a new thread for your engine. This one is not for announcing and discussing engines.
Re: CCRL 40/15 Testing Conditions (previously 40/40)
thxs a lot
xavier merrheim researcher in ai University of Lyon, France i will open a new topic
xavier merrheim researcher in ai University of Lyon, France i will open a new topic
-
- Posts: 3
- Joined: Thu Oct 27, 2022 1:16 am
- Sign-up code: 10159
- Location: Canada
Re: CCRL 40/15 Testing Conditions (previously 40/40)
I think this is the only place I've been able to find this information, maybe it's documented privately I'm guessing from some of the other discussion here, can anyone confirm this? From this post I'm assuming that 2054 is the time produced on real hardware for the "standard" machine running the Stockfish 10 bench ... can anyone confirm which binary was used to obtain this value (bmi2, x64, etc.)?
Re: CCRL 40/15 Testing Conditions (previously 40/40)
Yes, Graham has an i7-4770k so it is an actual result on an actual machine. Another tester has similar Intel hardware and got the same results.Paul Galbraith wrote: ↑Fri Oct 28, 2022 2:07 amFrom this post I'm assuming that 2054 is the time produced on real hardware for the "standard" machine running the Stockfish 10 bench ... can anyone confirm which binary was used to obtain this value (bmi2, x64, etc.)?In Windows you just double-click the Stockfish 10 exe and type "bench", Adapted time control for 40/15 would be/:
T minutes for 40 moves where T = Your time / 2054 x 15 (for 40/15)
-
- Posts: 3
- Joined: Thu Oct 27, 2022 1:16 am
- Sign-up code: 10159
- Location: Canada
Re: CCRL 40/15 Testing Conditions (previously 40/40)
Thanks, Ray. Do you happen to know which binary was used by Graham?Ray wrote: ↑Fri Oct 28, 2022 11:38 pmYes, Graham has an i7-4770k so it is an actual result on an actual machine. Another tester has similar Intel hardware and got the same results.Paul Galbraith wrote: ↑Fri Oct 28, 2022 2:07 am From this post I'm assuming that 2054 is the time produced on real hardware for the "standard" machine running the Stockfish 10 bench ... can anyone confirm which binary was used to obtain this value (bmi2, x64, etc.)?
- Graham Banks
- Posts: 27536
- Joined: Sun Dec 18, 2005 5:47 pm
- Sign-up code: 0
- Location: Auckland, NZ
Re: CCRL 40/15 Testing Conditions (previously 40/40)
The machine supports both bm12 and avx2, so would have been the bm12 one, as the modern one is just popcount as far as I'm aware.Paul Galbraith wrote: ↑Sat Oct 29, 2022 12:54 amThanks, Ray. Do you happen to know which binary was used by Graham?Ray wrote: ↑Fri Oct 28, 2022 11:38 pmYes, Graham has an i7-4770k so it is an actual result on an actual machine. Another tester has similar Intel hardware and got the same results.Paul Galbraith wrote: ↑Fri Oct 28, 2022 2:07 am From this post I'm assuming that 2054 is the time produced on real hardware for the "standard" machine running the Stockfish 10 bench ... can anyone confirm which binary was used to obtain this value (bmi2, x64, etc.)?
-
- Posts: 3
- Joined: Thu Oct 27, 2022 1:16 am
- Sign-up code: 10159
- Location: Canada
Re: CCRL 40/15 Testing Conditions (previously 40/40)
Thanks, Graham!Graham Banks wrote: ↑Sat Oct 29, 2022 1:25 am The machine supports both bm12 and avx2, so would have been the bm12 one, as the modern one is just popcount as far as I'm aware.
Re: CCRL 40/15 Testing Conditions (previously 40/40)
I have a question in regards to relating the benchmark calibration to running conditions. The answer is perhaps obvious, but better to ask since I have not seen it addressed in this forum thread.
i) How many simultaneous games do CCRL testers typically run?
ii) If it is more than one, is benchmark calibration performed when the machine running a CPU load in line with what is expected during the tournament?
In Kirill's post he suggests rebooting to get a clean slate before benchmarking. But that was back in the years of single-CPU computers.
To be specific to my situation I have a 48-CPU test computer, and use cutechess-cli. I set "-concurrency 48" when running engines single-threaded, or "-concurrency 12" with engines configured to 4 threads. The benchmark result under heavy load is about double the time when run while the machine is idle. That makes a big difference to time control settings if I want a good match CCRL testing conditions.
i) How many simultaneous games do CCRL testers typically run?
ii) If it is more than one, is benchmark calibration performed when the machine running a CPU load in line with what is expected during the tournament?
In Kirill's post he suggests rebooting to get a clean slate before benchmarking. But that was back in the years of single-CPU computers.
To be specific to my situation I have a 48-CPU test computer, and use cutechess-cli. I set "-concurrency 48" when running engines single-threaded, or "-concurrency 12" with engines configured to 4 threads. The benchmark result under heavy load is about double the time when run while the machine is idle. That makes a big difference to time control settings if I want a good match CCRL testing conditions.
- Gabor Szots
- Posts: 13198
- Joined: Sat Dec 09, 2006 6:30 am
- Sign-up code: 10159
- Location: Szentendre, Hungary
Re: CCRL 40/15 Testing Conditions (previously 40/40)
To begin with, I would not let the engines use all threads simultaneously. I'd reserve a thread or two for system things. Just to be on the safe side. I may be wrong though.jkominek wrote: ↑Wed Dec 21, 2022 3:35 pm I have a question in regards to relating the benchmark calibration to running conditions. The answer is perhaps obvious, but better to ask since I have not seen it addressed in this forum thread.
i) How many simultaneous games do CCRL testers typically run?
ii) If it is more than one, is benchmark calibration performed when the machine running a CPU load in line with what is expected during the tournament?
In Kirill's post he suggests rebooting to get a clean slate before benchmarking. But that was back in the years of single-CPU computers.
To be specific to my situation I have a 48-CPU test computer, and use cutechess-cli. I set "-concurrency 48" when running engines single-threaded, or "-concurrency 12" with engines configured to 4 threads. The benchmark result under heavy load is about double the time when run while the machine is idle. That makes a big difference to time control settings if I want a good match CCRL testing conditions.
The machine I use has only 4 cores. They are all run always at the same speed. I have set this in BIOS and check the benchmarks every now and then. Load has no impact. I never run more than 3 tournaments at a time.