Skip to content
View weixi-feng's full-sized avatar
💭
slow to respond
💭
slow to respond

Highlights

  • Pro

Block or report weixi-feng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 7,848 731 Updated Oct 3, 2024
Python 53 3 Updated Aug 16, 2024

Official inference repo for FLUX.1 models

Python 14,412 1,035 Updated Oct 3, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,299 966 Updated Oct 3, 2024

Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything

Python 969 62 Updated Sep 18, 2024

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Python 1,234 128 Updated Aug 1, 2024

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Jupyter Notebook 896 58 Updated Mar 25, 2023

Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation

Python 9 1 Updated Sep 19, 2024

Improved Implementation for Training GLIGEN: Open-Set Grounded Text-to-Image Generation

Python 34 3 Updated Jun 1, 2024

Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)

Python 61 4 Updated Jun 26, 2024

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 2,802 338 Updated Apr 25, 2024

(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision

Jupyter Notebook 109 3 Updated Jun 25, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,704 112 Updated Sep 19, 2024
Python 16 Updated Jun 22, 2024

Official repo of the paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"

Python 20 1 Updated Sep 21, 2024

[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling

Python 526 62 Updated Jan 9, 2024

[NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation

Python 199 6 Updated Sep 27, 2024

Code repository for T2V-Turbo

Python 172 14 Updated Jun 25, 2024

PIPs++

Python 285 34 Updated Jul 8, 2024
Python 388 22 Updated Dec 8, 2023

[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper

Python 121 7 Updated May 7, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,166 89 Updated Oct 3, 2024

Reward Guided Latent Consistency Distillation

Python 14 Updated May 28, 2024

[ICLR 2024 spotlight] Official implementation of "InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior".

Python 81 11 Updated Jul 11, 2024
Python 31 3 Updated Sep 16, 2024

Dromedary: towards helpful, ethical and reliable LLMs.

Python 1,115 86 Updated Oct 26, 2023

Official repo for LayoutGPT

Python 285 20 Updated Apr 10, 2024

[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"

Python 484 25 Updated Jul 16, 2024
Python 3,841 253 Updated Mar 15, 2024

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Python 1,092 62 Updated Oct 30, 2023
Next