The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper • 2509.26507 • Published Sep 30, 2025 • 547
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 135 items • Updated Dec 18, 2025 • 118
Towards Best Practices of Activation Patching in Language Models: Metrics and Methods Paper • 2309.16042 • Published Sep 27, 2023 • 4