Skip to content
View obalcells's full-sized avatar
  • Berkeley, USA

Highlights

  • Pro

Block or report obalcells

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".

Python 79 16 Updated Aug 27, 2024

Structured state space sequence models

Jupyter Notebook 2,375 285 Updated Jul 17, 2024

Mamba SSM architecture

Python 12,658 1,061 Updated Aug 15, 2024

Fast, collaborative live terminal sharing over the web

Rust 5,758 171 Updated Sep 24, 2024

Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.

Python 460 16 Updated Aug 30, 2024

Proving the missing direction of Abel-Ruffini's Theorem

Lean 2 Updated Dec 7, 2023

LLMs as Copilots for Theorem Proving in Lean

C++ 954 83 Updated Sep 2, 2024

Tool for data extraction and interacting with Lean programmatically.

Python 544 83 Updated Sep 19, 2024

LLM verified with Monte Carlo Tree Search

Python 8 Updated Nov 15, 2023
Lean 82 8 Updated Nov 12, 2023

Experiments in Mechanistic Interpretability and AI Safety in general

Jupyter Notebook 2 1 Updated Nov 29, 2023

Semantic search for competitive programming problems

Python 175 10 Updated Aug 9, 2024
Python 32 2 Updated May 21, 2024
Jupyter Notebook 69 8 Updated Jan 30, 2024

Interactively grep source code. Source for http://livegrep.com/

C++ 2,011 180 Updated Jun 21, 2024

Lean 4 programming language and theorem prover

Lean 4,530 400 Updated Sep 24, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,669 447 Updated May 3, 2024

Steering Llama 2 with Contrastive Activation Addition

Jupyter Notebook 83 27 Updated May 23, 2024

Library for algorithmic trading

C++ 62 7 Updated Oct 3, 2023

Using sparse coding to find distributed representations used by neural networks.

Jupyter Notebook 165 28 Updated Nov 10, 2023

Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.

HTML 185 77 Updated Feb 7, 2024
Jupyter Notebook 174 36 Updated May 8, 2024
Jupyter Notebook 4 Updated Jan 19, 2023

Language model alignment-focused deep learning curriculum

1,221 102 Updated Aug 19, 2024
Rust 2 Updated Apr 23, 2024
Python 11 4 Updated Dec 20, 2019

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 6,840 995 Updated Sep 24, 2024

Embedded Firmware for the CATS Flight Computers

C 30 6 Updated Sep 22, 2024

Code for the best subteam ever

C 2 3 Updated Feb 2, 2021
Next