Skip to content
View FrankYoungchen's full-sized avatar

Block or report FrankYoungchen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Dead simple FLUX LoRA training UI with LOW VRAM support

Python 748 49 Updated Sep 21, 2024

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 1,181 87 Updated Aug 22, 2024

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 7,522 696 Updated Sep 22, 2024

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Python 7,995 2,569 Updated Aug 13, 2024
Python 454 25 Updated Nov 29, 2023

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Python 572 20 Updated Sep 20, 2024

Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"

Python 689 22 Updated Sep 20, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,030 85 Updated Aug 6, 2024

OMG-LLaVA and OMG-Seg codebase

Python 1,231 48 Updated Aug 16, 2024

AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation

Python 60 9 Updated Sep 4, 2024

A throughput-oriented high-performance serving framework for LLMs

Cuda 487 17 Updated Sep 21, 2024

Official PyTorch implementation of "Authentic Hand Avatar from a Phone Scan via Universal Hand Model", CVPR 2024.

Python 65 1 Updated Jul 10, 2024

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

134 1 Updated Sep 9, 2024

Efficient Triton Kernels for LLM Training

Python 2,987 153 Updated Sep 20, 2024

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Python 1,496 100 Updated Jul 22, 2024

Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具

Python 1,211 129 Updated Sep 14, 2024

✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM

Python 769 38 Updated Sep 20, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,228 381 Updated Sep 20, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,206 369 Updated Sep 22, 2024

CVPR 2024 Papers Autonomous Driving

173 14 Updated Aug 12, 2024

[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"

Python 174 9 Updated Aug 15, 2024

An automated pipeline for evaluating LLMs for role-playing.

Python 120 3 Updated Sep 14, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,885 404 Updated Sep 6, 2024

Explore LLM model deployment based on AXera's AI chips

C++ 48 4 Updated Sep 4, 2024

Multi-view Diffusion for 3D Generation

Python 770 56 Updated Oct 7, 2023

LLM101n: Let's build a Storyteller

28,756 1,575 Updated Aug 1, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,092 65 Updated Aug 13, 2024

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,522 170 Updated Sep 19, 2024

Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗

Jupyter Notebook 122 10 Updated Jan 10, 2024

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Jupyter Notebook 904 41 Updated Aug 12, 2024
Next