Skip to content
View HoagyC's full-sized avatar

Block or report HoagyC

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Sparse Autoencoder for Mechanistic Interpretability

Python 173 38 Updated Jul 20, 2024

Work on sparse coding, replicating and extending the sparse coding approach to taking transformer features out of superposition.

Jupyter Notebook 1 Updated Oct 19, 2023

Sparse probing paper full code.

Jupyter Notebook 47 10 Updated Dec 17, 2023
Jupyter Notebook 7 5 Updated Feb 15, 2024

Keeping language models honest by directly eliciting knowledge encoded in their activations.

Python 182 33 Updated Sep 23, 2024

A library for mechanistic interpretability of GPT-style language models

Python 1,433 278 Updated Sep 23, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,942 1,550 Updated Sep 24, 2024

Multiversal tree writing interface for human-AI collaboration

Python 1,024 75 Updated Jun 28, 2024
Jupyter Notebook 72 9 Updated Jun 28, 2024
Python 8 2 Updated Jul 10, 2024
Python 7 3 Updated Sep 30, 2021