Overview
Run # | Reference | Summary | Currently Active | Net Numbers | Best nets |
---|
NA | Old Main | Original 192x15 “main” run | No | 1 to 601 | ID595 |
test10 | [[Lc0 Transition]] | Original 256x20 test run | No | 10'000 to 11'262 | 11250 11248 |
test20 | Training run reset | Many changes, see blog. | No | 20'001 to 22'201 | 22018 |
test30 | TB rescoring | Experiment with network initialization strategy, trying to solve spike issues. Experiment with Tablebase rescoring | No | 30'001 to 33'005 | 32930 |
LR Drop
Training Run | 1st LR drop | Elo | 2nd LR drop | Elo | 3rd LR drop | Elo | Best Net | Elo | Current best |
---|
Old Main | | | | | | | ID 595 | 3148 | |
Test 10 | ID 10077 | | ID 10320 | | ID 11013 | | ID 11248 | 3282 | * |
Test 20 | ID 20247 | 2318 | ID 20493 | | ID 21281 | | ID 22018 | 3118 | |
Test 30 | ID 30854 | | | | | | | | |
ID for test 20 to be checked
Sampling ratio
Most data from this sheet
- Alpha Zero reference paper
Use best guess for games length and assuming resign cuts game length by 30% - Old Main
Initially new networks generated based on fixed timing rather than on games
Item | A0 with resign | A0 w/out resign | Main up to ID xxx | Main from ID xxx | Main from IDyyy to ID598 | Test 10 | Test 20 |
---|
Positions per training game | 95 | 135 | 135 | 135 | 135 | 135 | ———– |
New networks per day | ———– | | 6 | 6 | | | |
Training Games per day | ———– | | 160,000 | 160,000 | | | |
Training Games per network | ———– | | 26,700 | 26,700 | 40,000 | 40,000 | |
Total training games | 44,000,000 | 44,000,000 | | | 25,000,000 | | |
Positions generated per day | ———– | ————- | 21,600,000 | 21,600,000 | | | |
Positions generated per network | ———– | ————- | 3,600,000 | 3,600,000 | 5,400,000 | 5,400,000 | |
Total positions generated | 4.158 B | 5.940 B | | | | | |
Batch size | 4,096 | 4,096 | 1,024 | 256 | 256 | 2,048 | |
Training steps per day | ———– | ————- | 300,000 | 300,000 | | | |
Training steps per network | ———– | ————- | 50,000 | 50,000 | 10,000 | 2,500 | |
Total training steps | 700,000 | 700,000 | | | | | |
Positions trained per day | ———– | ————- | 307,200,000 | 76,800,000 | | | |
Positions trained per network | ———– | ————- | 51,200,000 | 12,800,000 | 2,560,000 | 5,120,000 | |
Total position trained | 2.867 B | 2.867 B | | | | | |
Sampling ratio | 0.69 | 0.48 | 14.22 | 3.55 | 0.47 | 0.95 | 0.89 |