Pull Request: Enhanced Training and Analysis Features for KataGo #1072

anonym-g · 2025-06-10T08:38:56Z

This pull request introduces several new features to improve the training and analysis capabilities of the KataGo project. The additions include scripts for noise injection, model merging, Elo estimation, and training statistics visualization, each designed to streamline workflows and enhance model performance.

New Features

Noise Injection Scripts
- A Python script (python/noise.py) and Bash wrapper (python/selfplay/noise.sh) to apply noise after training.
- These scripts leverage statistical data logged in stdout.txt by the modified train.py.
- Purpose: Facilitate fine-tuning of pre-trained models with controlled noise to improve performance efficiently.
Model Merging Scripts
- A Python script (python/merge.py) and Bash wrapper (python/selfplay/merge.sh) for merging multiple checkpoint files into a single .bin.gz model.
- Purpose: Enable consolidation of trained models, potentially aligning with techniques used in the experimental network released on April 28, 2025.
Elo Estimation Script
- A Python script (python/elo_estimate.py) for streamlined estimation of model Elo ratings.
- Purpose: Provide a user-friendly tool to evaluate model strength relative to official releases.
Training Statistics Visualization
- Tools to visualize key statistical data collected during the training process, which are used for noise generation.
- Note: Currently located in my training directory for convenience, but paths may need adjustment to accommodate other users' setups.

Statistical Noise Injection (SNI) Results

The noise injection scripts enable efficient fine-tuning of pre-trained models. Key findings from testing include:

Using approximately 12,000 training rows and 1,000 noise iterations (via noise.sh), fine-tuned models achieved performance comparable to official updates trained on 150,000–200,000 games (6.9M–9.25M rows).
Testing across three randomly selected pairs of consecutive official releases showed that fine-tuned models achieved an average ~51% win rate against newer models, indicating significant performance gains.
A paper documenting these results is in progress, and I welcome community testing to validate the approach.

Example from Pair 3:

Older model: kata1-b28c512nbt-s8003120896-d4541551568 (2024-11-23, Elo: 13935.7 ± 16.3, 3113 games)
Newer model: kata1-b28c512nbt-s8032072448-d4548958859 (2024-11-28, Elo: 13950.1 ± 16.4, 3372 games)
Fine-tuned model: kata1-b28c512nbt-s8003240928-d120263 & noisy-1.0-1000iters-s8003240928-d120263
Results:
- Baseline (no noise): Win rate 49.47% (139/281), Elo boost -3.71 ± 20.73
- With noise (1,000 iterations): Win rate 51.37% (150/292), Elo boost +9.52 ± 20.34

Model Merging

The merging scripts consolidate multiple checkpoint files into a unified .bin.gz model, simplifying deployment and analysis. This approach may resemble the methodology behind the experimental network released on April 28, 2025.

Elo Estimation

The elo_estimate.py script provides a convenient method to estimate two models' Elo rating difference.

Notes and Next Steps

The visualization tools are currently tailored to my directory structure and may require path adjustments for broader compatibility.
I invite feedback on the scripts’ usability and performance, particularly for the noise injection approach, which shows promising results.
Community validation of the fine-tuning results would be valuable, and I’m happy to collaborate on further testing or integration.

Script for adding random noises to model weights

b28c512nbt models' parameter list & structure overview

anonym-g and others added 30 commits March 9, 2025 10:39

Added the create_symlink.py file

60315e3

Update shuffle.sh

c05fe25

Merge branch 'lightvector:master' into master

3dce474

Add files via upload

204fbc7

Script for adding random noises to model weights

Add files via upload

8f744bc

Update README.md

f54f90c

Create parameters.md

03d5a8a

b28c512nbt models' parameter list & structure overview

Create TestModels.md

0954b7f

Update noise.py

629e428

Update noise.sh

447a698

update some scripts

f6e8a9e

Update some scripts

eaa6102

Adjust the script calling order

35bfb35

Fix log error

52d0089

Update some scripts

e38e8bb

Update noise.py

026d4e8

Merge branch 'lightvector:master' into master

0512a87

Merge branch 'lightvector:master' into master

7320cba

Update some scripts

a11d9a4

Update some scripts

c9dd4d8

Ratio Data & Visualization

2dadef7

Update some scripts

79c09b8

Update visualization scripts

d69cecd

Update noise.py

ba0eeda

Update selfplay1.cfg

cd862d2

visualization data update

b673ec9

Update noise.py

3bae0be

Update train.py

aea3653

Update noise.py

6c708ac

Update train.py

37e72fd

anonym-g and others added 22 commits May 23, 2025 18:04

Update shuffle.sh

aba2625

Update to match the master

eb13ff0

Update train.sh

2873d76

Merge branch 'lightvector:master' into master

c4fc0d0

Delete create_symlink.py

1c543d2

modified python calling

0a89559

Update train.sh

9380991

Fetch the update of master

ffcf71d

Delete NoiseModels-Test/Classified Noise directory

0a85776

Update files related to visualization

cfec429

Merge branch 'master' of https://github.com/anonym-g/KataGo-Noise

ffcd70c

Update merge.py

2435848

Update selfplay1.cfg

a212569

Update gatekeeper1.cfg

eed9775

Update README.md

766cc11

Update train.sh

8281b0e

Update shuffle_loop.sh

68f2e3e

Update shuffle_and_export_loop.sh

381a0aa

Update shuffle.sh

234ab9a

Update export_model_for_selfplay.sh

2de2f2c

Update synchronous_loop.sh

63702dd

Merge branch 'lightvector:master' into master

71acd52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Pull Request: Enhanced Training and Analysis Features for KataGo #1072

Pull Request: Enhanced Training and Analysis Features for KataGo #1072

Uh oh!

anonym-g commented Jun 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

Pull Request: Enhanced Training and Analysis Features for KataGo #1072

Are you sure you want to change the base?

Pull Request: Enhanced Training and Analysis Features for KataGo #1072

Uh oh!

Conversation

anonym-g commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

New Features

Statistical Noise Injection (SNI) Results

Model Merging

Elo Estimation

Notes and Next Steps

Uh oh!

Uh oh!

anonym-g commented Jun 10, 2025 •

edited

Loading