HoagyC

HoagyC

24 followers · 0 following

Achievements

Stars

ai-safety-foundation / sparse_autoencoder

Sparse Autoencoder for Mechanistic Interpretability

Python 173 38 Updated Jul 20, 2024

Baidicoot / sparse_coding

Forked from HoagyC/sparse_coding

Work on sparse coding, replicating and extending the sparse coding approach to taking transformer features out of superposition.

Jupyter Notebook 1 Updated Oct 19, 2023

wesg52 / sparse-probing-paper

Sparse probing paper full code.

Jupyter Notebook 47 10 Updated Dec 17, 2023

loganriggs / sparse_coding

Forked from HoagyC/sparse_coding

Jupyter Notebook 7 5 Updated Feb 15, 2024

EleutherAI / elk

Keeping language models honest by directly eliciting knowledge encoded in their activations.

Python 182 33 Updated Sep 23, 2024

TransformerLensOrg / TransformerLens

A library for mechanistic interpretability of GPT-style language models

Python 1,433 278 Updated Sep 23, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,942 1,550 Updated Sep 24, 2024

socketteer / loom

Multiversal tree writing interface for human-AI collaboration

Python 1,024 75 Updated Jun 28, 2024

jessicarumbelow / Backwards

Jupyter Notebook 72 9 Updated Jun 28, 2024

collin-burns / discovering_latent_knowledge

Python 246 36 Updated Mar 2, 2024

mishajw / dotfiles

Python 8 2 Updated Jul 10, 2024

mishajw / vaxtldr.uk

Python 7 3 Updated Sep 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HoagyC

Achievements

Achievements

Block or report HoagyC

Stars

ai-safety-foundation / sparse_autoencoder

Baidicoot / sparse_coding

wesg52 / sparse-probing-paper

loganriggs / sparse_coding

EleutherAI / elk

TransformerLensOrg / TransformerLens

huggingface / peft

socketteer / loom

jessicarumbelow / Backwards

collin-burns / discovering_latent_knowledge

mishajw / dotfiles

mishajw / vaxtldr.uk