Skip to content

Commit d7c31fa

Browse files
authored
Merge branch 'main' into data-load-refactor
2 parents ce70f27 + 6778856 commit d7c31fa

File tree

24 files changed

+1899
-1554
lines changed

24 files changed

+1899
-1554
lines changed

_quarto.yml

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,9 @@ quartodoc:
1717
- convert
1818
- prompt_tokenizers
1919
- logging_config
20-
- core.trainer_builder
20+
- core.builders.base
21+
- core.builders.causal
22+
- core.builders.rl
2123
- core.training_args
2224
- core.chat.messages
2325
- core.chat.format.chatml

docs/config.qmd

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -514,6 +514,7 @@ output_dir: ./completed-model
514514
# setting to `auto` will enable torch compile when torch>=2.5.1
515515
torch_compile: # Optional[Union[Literal["auto"], bool]]
516516
torch_compile_backend: # Optional[str]
517+
torch_compile_mode: # 'default' | 'reduce-overhead' | 'max-autotune'
517518

518519
# Training hyperparameters
519520

@@ -560,7 +561,7 @@ profiler_steps: # enable the pytorch profiler to capture the first N steps of tr
560561
loss_watchdog_threshold: # High loss value, indicating the learning has broken down (a good estimate is ~2 times the loss at the start of training)
561562
loss_watchdog_patience: # Number of high-loss steps in a row before the trainer aborts (default: 3)
562563

563-
# Save model as safetensors (require safetensors package)
564+
# Save model as safetensors (require safetensors package). Default True
564565
save_safetensors:
565566

566567
# Whether to mask out or include the human's prompt from the training labels

src/axolotl/core/builders/__init__.py

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
"""Trainer builder classes"""
2+
3+
from .causal import HFCausalTrainerBuilder
4+
from .rl import HFRLTrainerBuilder
5+
6+
__all__ = ["HFCausalTrainerBuilder", "HFRLTrainerBuilder"]

0 commit comments

Comments
 (0)