Stars
Dead simple FLUX LoRA training UI with LOW VRAM support
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Lumina-T2X is a unified framework for Text to Any Modality Generation
AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
A throughput-oriented high-performance serving framework for LLMs
Official PyTorch implementation of "Authentic Hand Avatar from a Phone Scan via Universal Hand Model", CVPR 2024.
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
Efficient Triton Kernels for LLM Training
Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds
Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
SGLang is a fast serving framework for large language models and vision language models.
CVPR 2024 Papers Autonomous Driving
[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"
An automated pipeline for evaluating LLMs for role-playing.
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Explore LLM model deployment based on AXera's AI chips
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"