Skip to content
View louaaron's full-sized avatar

Highlights

  • Pro

Organizations

@CUAI

Block or report louaaron

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Triton Kernels for LLM Training

Python 2,991 153 Updated Sep 22, 2024

Ring attention implementation with flash attention

Python 537 41 Updated Sep 20, 2024

A repository for research on medium sized language models.

Python 469 70 Updated Aug 20, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,395 2,131 Updated Aug 12, 2024

Scalable Diffusion Models with State Space Backbone

Python 146 7 Updated Mar 7, 2024

Mamba SSM architecture

Python 12,632 1,061 Updated Aug 15, 2024

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Jupyter Notebook 893 58 Updated Mar 25, 2023

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,652 2,100 Updated Jul 18, 2024

[NeurIPS 2023] Riemannian Residual Neural Networks (https://arxiv.org/abs/2006.10254)

Python 14 1 Updated Feb 4, 2024

🧬 Generative modeling of regulatory DNA sequences with diffusion probabilistic models 💨

Python 358 50 Updated Sep 21, 2024

Implementation of MagViT2 Tokenizer in Pytorch

Python 538 35 Updated Jul 23, 2024

Consistency Distilled Diff VAE

Python 2,125 75 Updated Nov 7, 2023

Official implementation of VQ-Diffusion

Python 879 62 Updated Apr 17, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,420 844 Updated Sep 4, 2024

Dataset of GPT-2 outputs for research in detection, biases, and more

Python 1,933 550 Updated Dec 13, 2023

Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.

Python 271 24 Updated Jul 12, 2024
Jupyter Notebook 291 25 Updated Sep 20, 2022

Cramming the training of a (BERT-type) language model into limited compute.

Python 1,286 100 Updated Jun 13, 2024

Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.

Python 709 79 Updated Dec 8, 2022
Python 1,470 127 Updated Apr 27, 2023

Fast and memory-efficient exact attention

Python 13,478 1,234 Updated Sep 21, 2024

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

Python 966 227 Updated Jul 8, 2019

CUDA Library Samples

Cuda 1,537 321 Updated Sep 10, 2024

Transformers with Arbitrarily Large Context

Python 617 48 Updated Aug 12, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,031 535 Updated May 31, 2024

Reparameterized Discrete Diffusion Models for Text Generation

Python 90 3 Updated Feb 14, 2023
Python 64 5 Updated May 29, 2023

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,670 306 Updated Jul 14, 2024

Application of the L2HMC algorithm to simulations in lattice QCD.

Jupyter Notebook 66 8 Updated Feb 2, 2024
Next