Stars
Industry leading face manipulation platform
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
real time face swap and one-click video deepfake with only a single image
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
[NeurIPS 2024] A Generalizable World Model for Autonomous Driving
World Model based Autonomous Driving Platform in CARLA 🚗
Outpainting with Stable Diffusion on an infinite canvas
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
Constrast limited adaptive histogram equlization based on Verilog
Contrastive Trajectory Similarity Learning with Dual-Feature Attention (TrajCL) - ICDE 2023
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
llama3 implementation one matrix multiplication at a time
[CVPR2024] DisCo: Referring Human Dance Generation in Real World
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Emote Portrait Alive - using ai to reverse engineer code from white paper. (abandoned)
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
A collection of comprehensive notes on Deep Reinforcement Learning, customized for UC Berkeley's CS 285 (prev. CS 294-112)
[CVPR 2023] Query-Centric Trajectory Prediction