xwinxu

💭

alignment and llms.

Winnie Xu xwinxu

💭

alignment and llms.

General Generative Models. UofT CS/Stats/Math '22. Ex- @google-research @cohere-ai @facebookresearch @VectorInstitute

270 followers · 127 following

Achievements

x2 x2

Achievements

x2 x2

Highlights

Organizations

Lists (2)

Sort

🔮 Future ideas

3 repositories

pytorch

useful repos for working in torch

1 repository

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

EurekaLabsAI / ngram

The n-gram Language Model

C 1,316 93 Updated Aug 5, 2024

google-deepmind / dangerous-capability-evaluations

Python 42 2 Updated Sep 26, 2024

callummcdougall / ARENA_3.0

HTML 288 175 Updated Oct 4, 2024

Psycoy / MixEval

The official evaluation suite and dynamic data release for MixEval.

Python 209 31 Updated Sep 29, 2024

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

Python 375 47 Updated Oct 4, 2024

srush / LLM-Training-Puzzles

What would you do with 1000 H100s...

Jupyter Notebook 881 52 Updated Jan 10, 2024

srush / Triton-Puzzles

Puzzles for learning Triton

Jupyter Notebook 1,015 65 Updated Sep 25, 2024

mitmath / 1806

18.06 course at MIT

Jupyter Notebook 2,499 679 Updated Sep 14, 2024

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,540 393 Updated Sep 23, 2024

facebookresearch / dpr-scale

Scalable training for dense retrieval models.

Python 268 25 Updated May 27, 2023

Data-Provenance-Initiative / Data-Provenance-Collection

Jupyter Notebook 187 40 Updated Oct 2, 2024

Lightning-AI / forked-pdb

Python pdb for multiple processes

Python 30 6 Updated Nov 5, 2022

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 11,104 666 Updated Oct 5, 2024

meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 12,015 1,857 Updated Oct 4, 2024

huggingface / trl

Train transformer language models with reinforcement learning.

Python 9,608 1,207 Updated Oct 5, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,536 5,753 Updated Aug 19, 2024

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,711 454 Updated May 3, 2024

allenai / natural-instructions

Expanding natural instructions

Python 950 189 Updated Dec 11, 2023

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 10,175 2,289 Updated Oct 5, 2024

NVIDIA / NeMo-Framework-Launcher

Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.

Python 453 134 Updated Oct 4, 2024

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,062 167 Updated Aug 11, 2024

explodinggradients / ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Python 6,786 669 Updated Oct 5, 2024

meta-llama / llama

Inference code for Llama models

Python 55,851 9,512 Updated Aug 18, 2024

princeton-nlp / MeZO

[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333

Python 1,029 62 Updated Jan 11, 2024

gpakosz / .tmux

🇫🇷 Oh my tmux! My self-contained, pretty & versatile tmux configuration made with ❤️

Shell 21,895 3,359 Updated Oct 5, 2024

Sroy20 / machine-learning-interview-questions

This repository is to prepare for Machine Learning interviews.

1,475 391 Updated May 19, 2019

tloen / alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,570 2,213 Updated Jul 29, 2024

srush / raspy

An interactive exploration of Transformer programming.

Jupyter Notebook 244 20 Updated Nov 15, 2023

q-hwang / ai_for_research

AI tools for research

Python 11 Updated Apr 27, 2023

HazyResearch / H3

Language Modeling with the H3 State Space Model

Assembly 511 53 Updated Sep 29, 2023