Ratings of chess engines are well established on the known rating lists. Will anything change when we feed engines with only (playable) gambit positions? We carefully created 100 gambit opening positions to find out.
1. Only active engines, those who are in development.
2. Time control CCRL style, 40/2. Hash table size 128Mb.
3. Cutechess-cli concurrency compatible.
1. Inactive engines, those engines who haven't been updated
for a long time.
2. Known derivatives.
Gambit Rating List
Gambit Rating List : 2021-10-18
Time Control : 40 moves in 2 minutes repeating
Software : cutechess-cli 1.1.0.f.1.0
Elo calculation : Ordo 1.2.6
Games : 180.053
Head to Head
Gambit Rating List
on 20 cores and 4Gb Hash
Whenever a new engine manages to enter the top-5 of the Gambit Rating List it wil be included in the 20 core rating list. The newest entry is Koivisto 6.16 The latest update update is Komodo-Dragon 2.5 which gained 63 elo and is now only 13 elo points away from Stockfish 14.
With the start of the Gambit Rating List we modeled the ratings to be compatible with CCRL. Now (day of writing July 9, 2021) with over 90,000 games played we can make a comparison and conclude it does. Consider:
Our first conclusion is that the top engines profit the most, see the full list. A few observations:
Observation 1 - Looking at the top-3
On CCRL the difference between Stockfish and Komodo is 37 elo, on GRL 86.
On CCRL the difference between Stockfish and SlowChess is 127 elo, on GRL 246.
Stockfish and Komodo massively profited from the gambit openings.
Observation 2 - if we look further from rank 4-20 (with the exception of Nemorini) we see that these engines still profit from the gambit openings but less and less. Engines after rank 20 even start to lose considerable elo. The biggest exceptions are : Tucano 9.0 +45 | Benjamin +41 and Supernova 2.3 +35. While we are not familiar with the playing style of Tucano and Supernova we are with Benjamin because it's the gambit version of ProDeo.
Observation 3 - Benjamin -- because of its playing style -- greatly profits (+87 elo) when we compare the ratings with ProDeo 3.0 and ProDeo 3.1 while 3.0 and 3.1 definitely are stronger, not only in self-play but also when we compare the ratings of CCRL and GRL.
On CCRL ProDeo 3.0 scores 77 elo more than Benjamin while on the GRL Benjamin rates 10 points more than ProDeo 3.0, a difference of 87 elo!
1. The GRL makes sense.
2. Making a special gambit version makes even more sense.
Anchore Engines are inactive rock solid engines with well established elo ratings that are needed to stabilize the ORDO elo calculation. Anchore Engines will disappear in time when more games are played, they are only needed at the start of a rating list. However games played by Anchore Engines are included in the PGN download.