The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,328 970 Updated Oct 5, 2024

opendatalab / MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具，支持PDF/网页/多格式电子书提取。

Python 12,371 919 Updated Sep 30, 2024

tencent-ailab / V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Python 2,202 277 Updated Jun 29, 2024

megvii-research / megactor

Python 737 100 Updated Aug 29, 2024

BadToBest / EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 2,560 303 Updated Aug 15, 2024

OpenDriveLab / Vista

[NeurIPS 2024] A Generalizable World Model for Autonomous Driving

Python 502 33 Updated Oct 1, 2024

ucd-dare / CarDreamer

World Model based Autonomous Driving Platform in CARLA 🚗

Python 131 25 Updated Sep 26, 2024

lkwq007 / stablediffusion-infinity

Outpainting with Stable Diffusion on an infinite canvas

Python 3,845 303 Updated May 16, 2023

Uminosachi / sd-webui-inpaint-anything

Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.

Python 1,108 101 Updated Aug 9, 2024

lllyasviel / Fooocus

Focus on prompting and generating

Python 40,509 5,649 Updated Aug 21, 2024

nicyyyy / FPGA-CLAHE

Constrast limited adaptive histogram equlization based on Verilog

Verilog 21 4 Updated Jul 21, 2023

changyanchuan / TrajCL

Contrastive Trajectory Similarity Learning with Dual-Feature Attention (TrajCL) - ICDE 2023

Python 42 10 Updated Aug 1, 2023

fudan-generative-vision / hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 9,259 1,275 Updated Sep 14, 2024

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,233 1,062 Updated May 23, 2024

Wangt-CN / DisCo

[CVPR2024] DisCo: Referring Human Dance Generation in Real World

Python 1,056 114 Updated Jul 22, 2024

Boese0601 / MagicDance

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

Python 685 61 Updated Jul 3, 2024

magic-research / magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Python 10,399 1,066 Updated Jun 21, 2024

ID-Animator / ID-Animator

Python 343 26 Updated Jun 6, 2024

HVision-NKU / StoryDiffusion

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 5,834 582 Updated Sep 26, 2024

TIGER-AI-Lab / ConsistI2V

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)

Python 202 14 Updated Jul 1, 2024

Zejun-Yang / AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 4,546 571 Updated Jul 2, 2024

johndpope / Emote-hack

Emote Portrait Alive - using ai to reverse engineer code from white paper. (abandoned)

Python 168 7 Updated Sep 20, 2024

TadasBaltrusaitis / OpenFace

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

MATLAB 6,874 1,842 Updated Jun 1, 2024

guoyww / AnimateDiff

Official implementation of AnimateDiff.

Python 10,365 849 Updated Jul 31, 2024

harryzhangOG / Deep-RL-Notes

A collection of comprehensive notes on Deep Reinforcement Learning, customized for UC Berkeley's CS 285 (prev. CS 294-112)

TeX 1,168 188 Updated Apr 2, 2023

ZikangZhou / QCNet

[CVPR 2023] Query-Centric Trajectory Prediction

Python 481 76 Updated Oct 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kmzy youngzhou1999

Achievements