Skip to content
View snoop2head's full-sized avatar

Highlights

  • Pro

Organizations

@PoolC @QuoQA-NLP @AttentionX

Block or report snoop2head

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

🏫 Basics

47 repositories

🧪 bio

17 repositories

🕶 Braille Recognition

28 repositories

🧲 Contrastive Learning

17 repositories

🧗‍♀️ DeepClimb

8 repositories

💬 Interesting Writings

Works that persuasively resolves long-lasting questions of mine
82 repositories

⚡️ JAX/FLAX

98 repositories

👄 Lip Reading

Visual-Speech Recognition
232 repositories
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"

Python 39 3 Updated Jul 27, 2024

The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"

Python 15 Updated Jul 5, 2024

Mixture-of-Experts for Large Vision-Language Models

Python 1,917 121 Updated May 15, 2024
Jupyter Notebook 202 30 Updated Dec 22, 2023

This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.

Python 386 24 Updated Feb 12, 2024

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 366 27 Updated Sep 17, 2024

Scikit-learn friendly library to interpret, and prompt-engineer text datasets using large language models.

Python 152 25 Updated Jul 6, 2024
Jupyter Notebook 174 36 Updated May 8, 2024

Transformer implementation for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 53 4 Updated Aug 15, 2024

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 545 20 Updated Sep 17, 2024

Pre-train LLMs faster with Early Weight Averaging.

Python 14 1 Updated Jan 26, 2024

Switch EMA: A Free Lunch for Better Flatness and Sharpness

Python 19 2 Updated Feb 16, 2024

Official implementation of Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs (ICLR 2024).

Python 29 2 Updated Aug 6, 2024

Pytorch implementation of our paper accepted by ICML 2024 -- CaM: Cache Merging for Memory-efficient LLMs Inference

Python 21 3 Updated Jun 19, 2024

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,460 151 Updated Aug 17, 2024

A library for mechanistic interpretability of GPT-style language models

Python 1,426 274 Updated Sep 20, 2024
Python 93 4 Updated Aug 2, 2024

Minimalistic large language model 3D-parallelism training

Python 1,121 105 Updated Sep 20, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 1,943 137 Updated Sep 11, 2024

PAIR.withgoogle.com and friend's work on interpretability methods

JavaScript 139 29 Updated Sep 11, 2024

A user-friendly library for reproducible video moment retrieval and highlight detection.

Python 70 7 Updated Sep 19, 2024
Python 768 77 Updated Jan 27, 2024
Python 45 8 Updated Jul 17, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,847 889 Updated Aug 21, 2024

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Python 2,538 352 Updated Aug 10, 2024

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 681 108 Updated Jul 30, 2024

Official repository for EXAONE built by LG AI Research

162 11 Updated Aug 8, 2024

Official repository for KoMT-Bench built by LG AI Research

Python 44 Updated Aug 8, 2024
Next