Highlights
- Pro
Stars
This repo contains a curative list of robot learning (mainly for manipulation) resources.
A collection of awesome video generation studies.
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥
Code of the paper "NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning"
Awesome speech/audio LLMs, representation learning, and codec models
A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold N…
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language model alignment-focused deep learning curriculum
Collaborative Training of Large Language Models in an Efficient Way
Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
A statically typed programming language for scientific computations with first class support for physical dimensions and units
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Analyzing LLM Alignment via Token distribution shift
Curated list of open source tooling for data-centric AI on unstructured data.
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
[CVPR'24 Highlight] The official code and data for paper "EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Language Models"
Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Latex preprocessor — apply macro definitions, remove comments, and more
In this project, we will parse arxiv latex file, restruct it(recomplie \newcommand and \input and so on) and extract figure/table or other information tag for training.
Simple LaTeX parser providing latex-to-unicode and unicode-to-latex conversion