Edit on wiki

Training runs

Overview

Run #ReferenceSummaryCurrently ActiveNet NumbersBest nets
NAOld MainOriginal 192x15 “main” runNo1 to 601ID595
test10[[Lc0 Transition]]Original 256x20 test runNo10'000 to 11'26211250 11248
test20Training run resetMany changes, see blog.No20'001 to 22'20122018
test30TB rescoringExperiment with network initialization strategy, trying to solve spike issues. Experiment with Tablebase rescoringNo30'001 to 33'00532930

LR Drop

Training Run1st LR dropElo2nd LR dropElo3rd LR dropEloBest NetEloCurrent best
Old MainID 5953148
Test 10ID 10077ID 10320ID 11013ID 112483282*
Test 20ID 202472318ID 20493ID 21281ID 220183118
Test 30ID 30854

ID for test 20 to be checked

Sampling ratio

Most data from this sheet

  • Alpha Zero reference paper
    Use best guess for games length and assuming resign cuts game length by 30%
  • Old Main
    Initially new networks generated based on fixed timing rather than on games
ItemA0 with resignA0 w/out resignMain up to ID xxxMain from ID xxxMain from IDyyy to ID598Test 10Test 20
Positions per training game95135135135135135———–
New networks per day———–66
Training Games per day———–160,000160,000
Training Games per network———–26,70026,70040,00040,000
Total training games44,000,00044,000,00025,000,000
Positions generated per day———–————-21,600,00021,600,000
Positions generated per network———–————-3,600,0003,600,0005,400,0005,400,000
Total positions generated4.158 B5.940 B
Batch size4,0964,0961,0242562562,048
Training steps per day———–————-300,000300,000
Training steps per network———–————-50,00050,00010,0002,500
Total training steps700,000700,000
Positions trained per day———–————-307,200,00076,800,000
Positions trained per network———–————-51,200,00012,800,0002,560,0005,120,000
Total position trained2.867 B2.867 B
Sampling ratio0.690.4814.223.550.470.950.89
Last Updated: 2023-08-18