Sparse Autoencoders Can Capture Language-Specific Concepts Across Diverse Languages
Published in , 2025
This paper is about utilizing sparse autoencoder to find language-specific concepts that could be important to multilingual capability. We also introduced new interpretability method called SAE-LAPE.
Download here