Skip to content

Latest commit

 

History

History
198 lines (130 loc) · 9.62 KB

File metadata and controls

198 lines (130 loc) · 9.62 KB

Awesome Papers for Sparse Auto-Encoder (SAE)

This list focuses on sparse auto-encoder (SAE) techniques in mechanistic interpretability. Another list focuses on understanding the internal mechanism of LLMs.

Paper/preprint/blog recommendation: please release a issue or contact me.

Papers

2025

2024

2023

2022