weixi-feng

Follow

💭

slow to respond

Weixi Feng weixi-feng

💭

slow to respond

Follow

Ph.D. student @ UCSB CS

38 followers · 23 following

Santa Barbara, CA
weixi-feng.github.io
@weixi_feng

Achievements

Achievements

Highlights

Pro

Stars

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 7,848 731 Updated Oct 3, 2024

SAIS-FUXI / VidGen

Python 53 3 Updated Aug 16, 2024

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 14,412 1,035 Updated Oct 3, 2024

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,299 966 Updated Oct 3, 2024

siyuanliii / masa

Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything

Python 969 62 Updated Sep 18, 2024

hkchengrex / Tracking-Anything-with-DEVA

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Python 1,234 128 Updated Aug 1, 2024

baofff / U-ViT

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Jupyter Notebook 896 58 Updated Mar 25, 2023

McGill-NLP / AURORA

Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation

Python 9 1 Updated Sep 19, 2024

TonyLianLong / igligen

Improved Implementation for Training GLIGEN: Open-Set Grounded Text-to-Image Generation

Python 34 3 Updated Jun 1, 2024

pixeli99 / TrackDiffusion

Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)

Python 61 4 Updated Jun 26, 2024

z-x-yang / Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 2,802 338 Updated Apr 25, 2024

mlpc-ucsd / TokenCompose

(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision

Jupyter Notebook 109 3 Updated Jun 25, 2024

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,704 112 Updated Sep 19, 2024

weixi-feng / TC-Bench

Python 16 Updated Jun 22, 2024

eric-ai-lab / MMWorld

Official repo of the paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"

Python 20 1 Updated Sep 21, 2024

bertjiazheng / Structured3D

[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling

Python 526 62 Updated Jan 9, 2024

YangLing0818 / VideoTetris

[NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation

Python 199 6 Updated Sep 27, 2024

Ji4chenLi / t2v-turbo

Code repository for T2V-Turbo

Python 172 14 Updated Jun 25, 2024

aharley / pips2

PIPs++

Python 285 34 Updated Jul 8, 2024

yael-vinker / live_sketch

Python 388 22 Updated Dec 8, 2023

TonyLianLong / LLM-groundedVideoDiffusion

[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper

Python 121 7 Updated May 7, 2024

McGill-NLP / llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,166 89 Updated Oct 3, 2024

Ji4chenLi / rg-lcd

Reward Guided Latent Consistency Distillation

Python 14 Updated May 28, 2024

chenguolin / InstructScene

[ICLR 2024 spotlight] Official implementation of "InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior".

Python 81 11 Updated Jul 11, 2024

a-antoniades / Neuroformer

Python 31 3 Updated Sep 16, 2024

IBM / Dromedary

Dromedary: towards helpful, ethical and reliable LLMs.

Python 1,115 86 Updated Oct 26, 2023

weixi-feng / LayoutGPT

Official repo for LayoutGPT

Python 285 20 Updated Apr 10, 2024

frank-xwang / InstanceDiffusion

[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"

Python 484 25 Updated Jul 16, 2024

apple / ml-mgie

Python 3,841 253 Updated Mar 15, 2024

showlab / Show-1

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Python 1,092 62 Updated Oct 30, 2023