Hello. A mini-tournament is underway to test the new book, Tria 1.5a Varied by Saldom, and another book, ZBook. Now, my new way of testing books is to bring any new books being tested, and add the top 5 books I have into the "tournament" to test the book's ability to go against lines of other books. This "top 5" excludes books under the same name, and usually the same author, to ensure diversity in openings.
Time control: Five minutes
Engine Score
1: Tria 1.4L Houdini 61.0/93 <-- New contender
2: Tria 1.5L Varied Houdini 56.0/92
3: Aaricia 11.11 Houdini 2 48.5/92
4: Perfect 2012 Houdini 38.5/92
5: ZBook Houdini 2 38.0/92 <-- New contender
6: Morphius Houdini Wizzie 35.0/93
Important note:
Unfortunately, I accidently changed the book settings for the Arena Book. I made Tria 1.5 Varied the Arena book, and this gave Tria 1.5L Varied an advantage over other books. I have fixed this halfway in the tournament, and since then, Tria 1.5L Varied has fallen into second place. I WILL update this score sheet later on, as I feel 92 games isn't enough to give a clear, defined picture of ratings considering my mistake with the book settings.
I have decided not to test Lange and MrQ unless somebody requests I do so. Looking at the size of the books, I don't think it would have any possiblity of being a top book. Saldom's tests at http://www.tpp89.org/p-abk-2.htm for the two books prove my guess to be correct.
Time control: Five minutes
Engine Score
1: Tria 1.4L Houdini 61.0/93 <-- New contender
2: Tria 1.5L Varied Houdini 56.0/92
3: Aaricia 11.11 Houdini 2 48.5/92
4: Perfect 2012 Houdini 38.5/92
5: ZBook Houdini 2 38.0/92 <-- New contender
6: Morphius Houdini Wizzie 35.0/93
Important note:
Unfortunately, I accidently changed the book settings for the Arena Book. I made Tria 1.5 Varied the Arena book, and this gave Tria 1.5L Varied an advantage over other books. I have fixed this halfway in the tournament, and since then, Tria 1.5L Varied has fallen into second place. I WILL update this score sheet later on, as I feel 92 games isn't enough to give a clear, defined picture of ratings considering my mistake with the book settings.
I have decided not to test Lange and MrQ unless somebody requests I do so. Looking at the size of the books, I don't think it would have any possiblity of being a top book. Saldom's tests at http://www.tpp89.org/p-abk-2.htm for the two books prove my guess to be correct.
In my humble opinion, if possible, it is necessary to test all the new books. Meanwhile, more books by authors who produce not the first book.
And focus on my tests are not worth it. Different computers, different time control.
And focus on my tests are not worth it. Different computers, different time control.
And the strength of the game on the size of the book can not be judged. Very often great books are much worse than the carefully crafted book.
Do not forget about the depth of the book.
Lange_2011.abk
Filesize: 39mb
Book depth: 20 halfmoves
TriA + JK 1.4L +. Abk
Filesize: 246mb
Book depth: 200 halfmoves
... If Evgeny Timoshchuk make his book a depth of 200 halfmoves his book will become much more of my books.
Do not forget about the depth of the book.
Lange_2011.abk
Filesize: 39mb
Book depth: 20 halfmoves
TriA + JK 1.4L +. Abk
Filesize: 246mb
Book depth: 200 halfmoves
... If Evgeny Timoshchuk make his book a depth of 200 halfmoves his book will become much more of my books.
Then the question is, how is a book with that few moves going to get anywhere close to the top?
Alrite, if you want me to test Lango and MrQ, I will test them...
Results will take a few days, unfortunately. I need to redo and change some stuff up to ensure my results are accurate.
Quick question: What exactly is a "halfmove"?
Alrite, if you want me to test Lango and MrQ, I will test them...
Results will take a few days, unfortunately. I need to redo and change some stuff up to ensure my results are accurate.
Quick question: What exactly is a "halfmove"?
Dear Dark_wizzie,
You should decide what books are and how to test, I only expressed my opinion.
Best regards,
Aleksandr
You should decide what books are and how to test, I only expressed my opinion.
Best regards,
Aleksandr
halfmove - deep plies in the book. 200 halfmove - means that the book has information for 100 of the black and white.
Engine Score
1: Tria 1.4L Houdini 60.5/98
2: Tria 1.5L Varied Houdini 56.0/99
3: Aaricia 11.11 Houdini 51.0/98
4: Perfect 2012 Houdini 43.5/99
5: ZBook Houdini 38.5/98
6: Quinox 4 Houdini 31.0/56
7: MrQ1 Houdini 29.0/56
8: Lange_2011 Houdini 20.5/56
Standings after ZBook and Lange are eliminated for being the last and second to last place:
Engine Score
1: Tria 1.4L Houdini 42.0/72
2: Tria 1.5L Varied Houdini 40.0/72
3: Aaricia 11.11 Houdini 35.5/72
4: Perfect 2012 Houdini 26.5/73
5: Quinox 4 Houdini 21.5/40
6: MrQ1 Houdini 19.5/41
Now adding TriA+JK 1.6.1L+.abk although I was hoping for TriA+JK 1.6.1A40.abk...
1: Tria 1.4L Houdini 60.5/98
2: Tria 1.5L Varied Houdini 56.0/99
3: Aaricia 11.11 Houdini 51.0/98
4: Perfect 2012 Houdini 43.5/99
5: ZBook Houdini 38.5/98
6: Quinox 4 Houdini 31.0/56
7: MrQ1 Houdini 29.0/56
8: Lange_2011 Houdini 20.5/56
Standings after ZBook and Lange are eliminated for being the last and second to last place:
Engine Score
1: Tria 1.4L Houdini 42.0/72
2: Tria 1.5L Varied Houdini 40.0/72
3: Aaricia 11.11 Houdini 35.5/72
4: Perfect 2012 Houdini 26.5/73
5: Quinox 4 Houdini 21.5/40
6: MrQ1 Houdini 19.5/41
Now adding TriA+JK 1.6.1L+.abk although I was hoping for TriA+JK 1.6.1A40.abk...
Hello, it a little strange result, because in Saldom site is a little different :
1 Aaricia 11.11.ABK 18.0/23 78.2 ··· 2-1-2 2-0-3 5-0-3 5-0-0 142,25
2 TriA+JK 1.6.1A40.abk 8.5/18 47.2 1-2-2 ··· 1-1-2 2-1-2 0-1-3 87,50
3 Lange 2011.abk 8.5/19 44.7 0-2-3 1-1-2 ··· 0-0-5 0-0-5 81,50
4 MrQ1.abk 8.0/22 36.3 0-5-3 1-2-2 0-0-5 ··· 2-2-0 79,25
5 TriA+JK 1.6.1 Varied.abk 7.0/18 38.8 0-5-0 1-0-3 0-0-5 2-2-0 ··· 58,50
http://www.tpp89.org/p-abk-2.htm
S.
1 Aaricia 11.11.ABK 18.0/23 78.2 ··· 2-1-2 2-0-3 5-0-3 5-0-0 142,25
2 TriA+JK 1.6.1A40.abk 8.5/18 47.2 1-2-2 ··· 1-1-2 2-1-2 0-1-3 87,50
3 Lange 2011.abk 8.5/19 44.7 0-2-3 1-1-2 ··· 0-0-5 0-0-5 81,50
4 MrQ1.abk 8.0/22 36.3 0-5-3 1-2-2 0-0-5 ··· 2-2-0 79,25
5 TriA+JK 1.6.1 Varied.abk 7.0/18 38.8 0-5-0 1-0-3 0-0-5 2-2-0 ··· 58,50
http://www.tpp89.org/p-abk-2.htm
S.
A difference in place in a testing scene like that isn't surprising at all. Don't forget that Saldom and I probrably use different time controls and hardware. We also have different setting for Arena Mainbook weights. I've also done more games of chess total than in Saldom with Aaricia/Tria, etc. (Looks like Saldom only did 23 games for testing Aaricia, while I've done 100+ for this testing alone, not forgetting the other 100+ games per engine for ABK Book Bash.) Also, there is also the factor of pure luck. A little of this is up to chance. Saldom's earlier tests (also with Aairicia, I think) show Tria leading, from what I remember. Finally, we have different books up for testing, that might affect the results a little bit. Some books are better against others than some. By the way, Aaricia is closer to Tria 1.4L as ever, as it's only 4.5 points off.
Don't worry, more tests are on the way, just give me a day or two and I'll post the results.
For now, here are the results I have currently.
Engine Score
1: Tria 1.4L Houdini 48.0/85
2: Tria 1.5L Varied Houdini 47.0/86
3: Aaricia 11.11 Houdini 2 43.5/86
4: Perfect 2012 Houdini 33.5/87
5: Quinox 4 Houdini 31.5/60
6: Tria 1.6.1L+ Houdini 2 31.0/60
7: MrQ1 Houdini 2.0 27.5/60
Don't worry, more tests are on the way, just give me a day or two and I'll post the results.
For now, here are the results I have currently.
Engine Score
1: Tria 1.4L Houdini 48.0/85
2: Tria 1.5L Varied Houdini 47.0/86
3: Aaricia 11.11 Houdini 2 43.5/86
4: Perfect 2012 Houdini 33.5/87
5: Quinox 4 Houdini 31.5/60
6: Tria 1.6.1L+ Houdini 2 31.0/60
7: MrQ1 Houdini 2.0 27.5/60
Here you have the perfect proof that the little book may be stronger than large.
My experiment has failed.
The fact that increased Polyglot book - weakened the book for the Arena. I'm sorry.
My experiment has failed.
The fact that increased Polyglot book - weakened the book for the Arena. I'm sorry.
Not really, Saldom. You are too quick to put your own books down. Look at this:
Engine Score
1: Tria 1.4L Houdini 55.5/101 .549 2nd place -Same author as other Tria books;all tria books count as one book; Book contestant 1
2: Aaricia 11.11 Houdini 2 52.0/101 .514 3rd place - Book contestant 2
3: Tria 1.5L Varied Houdini 51.0/102 .5 5th place -Same author as other Tria books
4: Tria 1.6.1L+ Houdini 2 50.5/91 .554 1st place -Same author as other Tria books
5: Quinox 4 Houdini 47.0/92 .51 4th place - Book contestant 3
6: MrQ1 Houdini 2.0 42.5/92 .461 6th place -Book contestant 4
7: Perfect 2012 Houdini 42.5/103 .412 7th place -Book contestant 5
If you want to find generally who would be in first place if all books had the same amount of games, divide won games by games played. Tria 1.6.1 is BEATING 1.4L if it keeps up this ratio of won games.
Now:
1) Very curious thing happening: I saw how Perfect 2012 is clearly the losing book here. So if I remove it from the list, this happens:
Engine Score
1: Tria 1.6.1L+ Houdini 2 44.0/77 .571
2: Tria 1.4L Houdini 41.0/83 .493
3: Tria 1.5L Varied Houdini 39.5/83
4: Aaricia 11.11 Houdini 2 39.5/82
5: Quinox 4 Houdini 39.0/77
6: MrQ1 Houdini 2.0 36.0/76
Aaricia drops from third place to fourth place! Also, Tria 1.6.1 gains a huge lead! It seems like Tria 1.6.1 is unable to effectively beat Perfect 2012, while Aaricia was able to. However, according to by rules, the books being tested all count as one book in my "tournament of five", so Perfect 2012 belongs on that list since it is the strongest book not counting the other books in this test. (I have quite a few other ABK books that I do not use to test Tria since they are too weak.)
I will continue to test. As of now, Aaricia is still in 3rd place.
Also, Saldom:
I would appreciate immensely if you focus a lot on your 20 minute time control tournaments, since I'm focusing on the 5 minute time controls.
Engine Score
1: Tria 1.4L Houdini 55.5/101 .549 2nd place -Same author as other Tria books;all tria books count as one book; Book contestant 1
2: Aaricia 11.11 Houdini 2 52.0/101 .514 3rd place - Book contestant 2
3: Tria 1.5L Varied Houdini 51.0/102 .5 5th place -Same author as other Tria books
4: Tria 1.6.1L+ Houdini 2 50.5/91 .554 1st place -Same author as other Tria books
5: Quinox 4 Houdini 47.0/92 .51 4th place - Book contestant 3
6: MrQ1 Houdini 2.0 42.5/92 .461 6th place -Book contestant 4
7: Perfect 2012 Houdini 42.5/103 .412 7th place -Book contestant 5
If you want to find generally who would be in first place if all books had the same amount of games, divide won games by games played. Tria 1.6.1 is BEATING 1.4L if it keeps up this ratio of won games.
Now:
1) Very curious thing happening: I saw how Perfect 2012 is clearly the losing book here. So if I remove it from the list, this happens:
Engine Score
1: Tria 1.6.1L+ Houdini 2 44.0/77 .571
2: Tria 1.4L Houdini 41.0/83 .493
3: Tria 1.5L Varied Houdini 39.5/83
4: Aaricia 11.11 Houdini 2 39.5/82
5: Quinox 4 Houdini 39.0/77
6: MrQ1 Houdini 2.0 36.0/76
Aaricia drops from third place to fourth place! Also, Tria 1.6.1 gains a huge lead! It seems like Tria 1.6.1 is unable to effectively beat Perfect 2012, while Aaricia was able to. However, according to by rules, the books being tested all count as one book in my "tournament of five", so Perfect 2012 belongs on that list since it is the strongest book not counting the other books in this test. (I have quite a few other ABK books that I do not use to test Tria since they are too weak.)
I will continue to test. As of now, Aaricia is still in 3rd place.
Also, Saldom:
I would appreciate immensely if you focus a lot on your 20 minute time control tournaments, since I'm focusing on the 5 minute time controls.
Engine Score
1: Tria 1.6.1L+ Houdini 2 76.0/133 .571
2: Tria 1.4L Houdini 75.0/133 .563
3: Aaricia 11.11 Houdini 2 69.5/132 .526
4: Quinox 4 Houdini 64.5/132 .488
5: Tria 1.5L Varied Houdini 63.0/133 .379
6: MrQ1 Houdini 2.0 59.0/133 .443
7: Perfect 2012 Houdini 57.0/132 .431
Results fit my exact predictions.
...But, the testing isn't over. Tria 1.6.1L+ and 1.4l+ are too close to count.
I will continue to run more tests.
1: Tria 1.6.1L+ Houdini 2 76.0/133 .571
2: Tria 1.4L Houdini 75.0/133 .563
3: Aaricia 11.11 Houdini 2 69.5/132 .526
4: Quinox 4 Houdini 64.5/132 .488
5: Tria 1.5L Varied Houdini 63.0/133 .379
6: MrQ1 Houdini 2.0 59.0/133 .443
7: Perfect 2012 Houdini 57.0/132 .431
Results fit my exact predictions.
...But, the testing isn't over. Tria 1.6.1L+ and 1.4l+ are too close to count.
I will continue to run more tests.
HI Saldom.
If I adjust the Arena Mainbook setting for Testo book, will it affect the way another book will use their moves? Or does each book have their own weights for moves played, priority, etc?
What time controls are you using for your website to test Aaricia and Tria 1.6?
Thanks
If I adjust the Arena Mainbook setting for Testo book, will it affect the way another book will use their moves? Or does each book have their own weights for moves played, priority, etc?
What time controls are you using for your website to test Aaricia and Tria 1.6?
Thanks
If the engine configuration is not set "main book isprolzovat Arena" - that will not be affected.
I use the control 20 minutes per batch + 10 seconds per move
I use the control 20 minutes per batch + 10 seconds per move
Hmm, then what weights for "priority", "won games", and "games played" will be set for those books not set as Arena Mainbook? Default? Or same as Mainbook?
To test the books were accurate and honest, you must:
- Do not modify the author of the book.
- Do not change the settings of the books.
- Do not include the engine configuration option "Use the basic book of the Arena"
- Do not modify the author of the book.
- Do not change the settings of the books.
- Do not include the engine configuration option "Use the basic book of the Arena"
Ok Saldom. I want to set up an accurate, honest tournament. I'd appreciate if you would elaborate on what you mean.
Here is my understanding, correct me if I'm wrong, please.
All books have settings that apply whether they are mainbook or not.
All books have settings that may only be changed if they are the mainbook.
The book system is confusing for Arena. I'd really like to provide accurate testing, and testing all the games in the world means nothing if my settings are whacko.
I have double checked everything, and the only book that got modified in Arena settings is Tria 1.4L+. I have fixed the error.
Thank you.
Here is my understanding, correct me if I'm wrong, please.
All books have settings that apply whether they are mainbook or not.
All books have settings that may only be changed if they are the mainbook.
The book system is confusing for Arena. I'd really like to provide accurate testing, and testing all the games in the world means nothing if my settings are whacko.
I have double checked everything, and the only book that got modified in Arena settings is Tria 1.4L+. I have fixed the error.
Thank you.
Dark_wizzie,
To test books must meet the rules of my previous posts.
Sorry for my English.
Best regards
To test books must meet the rules of my previous posts.
Sorry for my English.
Best regards
Hello Dark_wizzie, can You test my new book, please,
Later please share results in PGN,
Sekos
Later please share results in PGN,
Sekos
Powered by mwForum 2.27.4 © 1999-2012 Markus Wichitill
