Suggested readme update #14

allenschmaltz · 2025-05-17T04:25:49Z

I believe these 2 papers and 2 blog posts will be of interest to LLM interpretability researchers, as they cover a rather different set of approaches for interpretability over the models with non-identifiable parameters (e.g., LLMs), and work well in practice. These can be characterized at a high-level as "uncertainty-aware interpretability-by-exemplar" methods.

Feel free to include or not include at your discretion. (I also fixed a minor typo.)

allenschmaltz added 2 commits May 16, 2025 23:07

Suggested update to README.md

584a782

Suggested update to README.md

0e1052b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Suggested readme update #14

Suggested readme update #14

Uh oh!

allenschmaltz commented May 17, 2025

Uh oh!

Uh oh!

Suggested readme update #14

Are you sure you want to change the base?

Suggested readme update #14

Uh oh!

Conversation

allenschmaltz commented May 17, 2025

Uh oh!

Uh oh!