Skip to content

Commit ec4ebfd

Browse files
authored
Add a few items to faq (#2734)
* Add a few items to faq * formatting * chore: lint
1 parent bde8b5b commit ec4ebfd

File tree

2 files changed

+14
-12
lines changed

2 files changed

+14
-12
lines changed

docs/faq.qmd

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -110,3 +110,17 @@ description: Frequently asked questions
110110
> A: If `eot_tokens: ` is not provided, the default behavior is the same as before. EOS tokens used to delimit turns are masked/unmasked depending on whether the turn is trainable.
111111
112112
> Internally, `eot_tokens: tokenizer.eos_token` and `train_on_eot: train_on_eos` (which defaults to `turn`). This transition helps clarify the naming and behavior of EOT/EOS tokens.
113+
114+
**Q: `Data processing error: CAS service error`**
115+
116+
> A: Try disabling XET with `export HF_HUB_DISABLE_XET=1`
117+
118+
**Q: `torch._inductor.exc.LoweringException: NoValidChoicesError: No choices to select, please consider adding ATEN into max_autotune_gemm_backends config (defined in torch/_inductor/config.py) to allow at least one choice. `**
119+
120+
> A: Depending on the version of torch, you may need to include this in your YAML:
121+
122+
> ```yaml
123+
> flex_attn_compile_kwargs:
124+
> dynamic: false
125+
> mode: max-autotune-no-cudagraphs
126+
> ```

src/axolotl/utils/schemas/config.py

Lines changed: 0 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1175,18 +1175,6 @@ def check_kto_config(cls, data):
11751175

11761176
return data
11771177

1178-
# @model_validator(mode="before")
1179-
# @classmethod
1180-
# def check_grpo_peft_liger(cls, data):
1181-
# if (
1182-
# data.get("rl") == "grpo"
1183-
# and data.get("trl", {})
1184-
# and data.get("trl").get("use_liger_loss")
1185-
# and data.get("adapter")
1186-
# ):
1187-
# raise ValueError("PEFT + GRPO + Liger is not yet supported")
1188-
# return data
1189-
#
11901178
@model_validator(mode="before")
11911179
@classmethod
11921180
def check_grpo_liger_sequence_parallel(cls, data):

0 commit comments

Comments
 (0)