axolotl-ai-cloud / axolotl Public

Notifications You must be signed in to change notification settings
Fork 1.1k
Star 9.8k

Code
Issues 170
Pull requests 85
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: axolotl-ai-cloud/axolotl

Labels 22 Milestones 1

New pull request New

85 Open 1,777 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fix: do not call preprocess in multimodal or pretraining case

#2861 opened Jul 3, 2025 by NanoCode012

Loading…

fix: set add_generation_prompt to False when apply chat template for multimodal

#2859 opened Jul 3, 2025 by NanoCode012

Loading…

Add venv to shell prompt in dockerfiles

#2857 opened Jul 2, 2025 by SalmanMohammadi

Loading…

Feat: add gemma3n support

#2852 opened Jul 1, 2025 by NanoCode012 • Draft

1 task

fix nightlies to use correct cache

#2848 opened Jun 30, 2025 by winglian

Loading…

update transformers to 4.53.0

#2844 opened Jun 30, 2025 by winglian

Loading…

fix: remove unnecessary movement of eval logits to cpu

#2824 opened Jun 23, 2025 by NanoCode012

Loading…

feat: add sageattention

#2823 opened Jun 23, 2025 by NanoCode012 • Draft

Enable Memory Efficient Loading when using Deepspeed 3 for Mistral

#2804 opened Jun 18, 2025 by benHeid

Loading…

feat: add phi_35_vl support

#2798 opened Jun 17, 2025 by NanoCode012 • Draft

manage jinja templates as nicely formatted files

#2795 opened Jun 16, 2025 by winglian

Loading…

[Draft] Token-weighted datasets: Control up/down-sampling of multiple datasets

#2794 opened Jun 16, 2025 by casper-hansen • Draft

feat(mm_chat): enhance multimodal chat collator for audio/text suppor… hold

don't merge this yet

#2765 opened Jun 5, 2025 by voidful

Loading…

6 of 9 tasks

Add StableMax integration to enable grokking and prevent Softmax Collapse

#2761 opened Jun 5, 2025 by ehartford

Loading…

FSDP1 -> FSDP2

#2760 opened Jun 5, 2025 by SalmanMohammadi • Draft

5 of 6 tasks

Make De-duplication Multi-threaded and Happen Only During Pre-processing

#2747 opened Jun 1, 2025 by xzuyn

Loading…

don't use zero first context for loading datasets hold

don't merge this yet

#2713 opened May 23, 2025 by winglian • Draft

add support for Select Activation Checkpointing

#2711 opened May 23, 2025 by winglian • Draft

Create base docker images for CUDA 12.8 with custom FlashAttention 3 installed

#2685 opened May 16, 2025 by winglian

Loading…

(WIP) Feat: Add wizard CLI to create yaml config

#2669 opened May 13, 2025 by NanoCode012 • Draft

User-agent on CI snapshot download hold

don't merge this yet

#2665 opened May 12, 2025 by winglian

Loading…

run codecov action at end of CI; only_pulls: true

#2664 opened May 12, 2025 by djsaunde

Loading…

Implement configurable handling of excess tokens in datasets

#2662 opened May 12, 2025 by mhenrichsen

Loading…

setup defaults for dataloader to ensure GPU is kept busy

#2632 opened May 5, 2025 by winglian

Loading…

[WIP] feat: add heartbeat endpoint

#2618 opened May 2, 2025 by AlpinDale • Draft

Previous 1 2 3 4 Next

Previous Next

ProTip! Filter pull requests by the default branch with base:main.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!