Overview
Run # |
Reference |
Summary |
Currently Active |
Net Numbers |
Best nets |
NA |
Old Main |
Original 192x15 “main” run |
No |
1 to 601 |
ID595 |
test10 |
[[Lc0 Transition]] |
Original 256x20 test run |
No |
10'000 to 11'262 |
11250 11248 |
test20 |
Training run reset |
Many changes, see blog. |
No |
20'001 to 22'201 |
22018 |
test30 |
TB rescoring |
Experiment with network initialization strategy, trying to solve spike issues. Experiment with Tablebase rescoring |
No |
30'001 to 33'005 |
32930 |
LR Drop
Training Run |
1st LR drop |
Elo |
2nd LR drop |
Elo |
3rd LR drop |
Elo |
Best Net |
Elo |
Current best |
Old Main |
|
|
|
|
|
|
ID 595 |
3148 |
|
Test 10 |
ID 10077 |
|
ID 10320 |
|
ID 11013 |
|
ID 11248 |
3282 |
* |
Test 20 |
ID 20247 |
2318 |
ID 20493 |
|
ID 21281 |
|
ID 22018 |
3118 |
|
Test 30 |
ID 30854 |
|
|
|
|
|
|
|
|
ID for test 20 to be checked
Sampling ratio
Most data from this sheet
- Alpha Zero reference paper
Use best guess for games length and assuming resign cuts game length by 30%
- Old Main
Initially new networks generated based on fixed timing rather than on games
Item |
A0 with resign |
A0 w/out resign |
Main up to ID xxx |
Main from ID xxx |
Main from IDyyy to ID598 |
Test 10 |
Test 20 |
Positions per training game |
95 |
135 |
135 |
135 |
135 |
135 |
———– |
New networks per day |
———– |
|
6 |
6 |
|
|
|
Training Games per day |
———– |
|
160,000 |
160,000 |
|
|
|
Training Games per network |
———– |
|
26,700 |
26,700 |
40,000 |
40,000 |
|
Total training games |
44,000,000 |
44,000,000 |
|
|
25,000,000 |
|
|
Positions generated per day |
———– |
————- |
21,600,000 |
21,600,000 |
|
|
|
Positions generated per network |
———– |
————- |
3,600,000 |
3,600,000 |
5,400,000 |
5,400,000 |
|
Total positions generated |
4.158 B |
5.940 B |
|
|
|
|
|
Batch size |
4,096 |
4,096 |
1,024 |
256 |
256 |
2,048 |
|
Training steps per day |
———– |
————- |
300,000 |
300,000 |
|
|
|
Training steps per network |
———– |
————- |
50,000 |
50,000 |
10,000 |
2,500 |
|
Total training steps |
700,000 |
700,000 |
|
|
|
|
|
Positions trained per day |
———– |
————- |
307,200,000 |
76,800,000 |
|
|
|
Positions trained per network |
———– |
————- |
51,200,000 |
12,800,000 |
2,560,000 |
5,120,000 |
|
Total position trained |
2.867 B |
2.867 B |
|
|
|
|
|
Sampling ratio |
0.69 |
0.48 |
14.22 |
3.55 |
0.47 |
0.95 |
0.89 |