Skip to content
View xwinxu's full-sized avatar
💭
alignment and llms.
💭
alignment and llms.

Highlights

  • Pro

Organizations

@for-ai @VectorInstitute @UTMIST

Block or report xwinxu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

The n-gram Language Model

C 1,316 93 Updated Aug 5, 2024

The official evaluation suite and dynamic data release for MixEval.

Python 209 31 Updated Sep 29, 2024

RewardBench: the first evaluation tool for reward models.

Python 375 47 Updated Oct 4, 2024

What would you do with 1000 H100s...

Jupyter Notebook 881 52 Updated Jan 10, 2024

Puzzles for learning Triton

Jupyter Notebook 1,015 65 Updated Sep 25, 2024

18.06 course at MIT

Jupyter Notebook 2,499 679 Updated Sep 14, 2024

Robust recipes to align language models with human and AI preferences

Python 4,540 393 Updated Sep 23, 2024

Scalable training for dense retrieval models.

Python 268 25 Updated May 27, 2023

Python pdb for multiple processes

Python 30 6 Updated Nov 5, 2022

Machine Learning Engineering Open Book

Python 11,104 666 Updated Oct 5, 2024

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 12,015 1,857 Updated Oct 4, 2024

Train transformer language models with reinforcement learning.

Python 9,608 1,207 Updated Oct 5, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,536 5,753 Updated Aug 19, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,711 454 Updated May 3, 2024

Expanding natural instructions

Python 950 189 Updated Dec 11, 2023

Ongoing research training transformer models at scale

Python 10,175 2,289 Updated Oct 5, 2024

Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.

Python 453 134 Updated Oct 4, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,062 167 Updated Aug 11, 2024

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Python 6,786 669 Updated Oct 5, 2024

Inference code for Llama models

Python 55,851 9,512 Updated Aug 18, 2024

[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333

Python 1,029 62 Updated Jan 11, 2024

🇫🇷 Oh my tmux! My self-contained, pretty & versatile tmux configuration made with ❤️

Shell 21,895 3,359 Updated Oct 5, 2024

This repository is to prepare for Machine Learning interviews.

1,475 391 Updated May 19, 2019

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,570 2,213 Updated Jul 29, 2024

An interactive exploration of Transformer programming.

Jupyter Notebook 244 20 Updated Nov 15, 2023

AI tools for research

Python 11 Updated Apr 27, 2023

Language Modeling with the H3 State Space Model

Assembly 511 53 Updated Sep 29, 2023
Next