Stars
Character Animation (AnimateAnyone, Face Reenactment)
推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.
[ECCV 2024] Embodied Understanding of Driving Scenarios
A curated list of papers and open-source resources focused on 3D AIGC.
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Open-Sora: Democratizing Efficient Video Production for All
Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)
Paper reading notes on Deep Learning and Machine Learning
Emu Series: Generative Multimodal Models from BAAI
Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"
[ICCV 2023] Consistent Image Synthesis and Editing
[CVPR2024 Highlight] Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration
LangChain 的中文入门教程
🔥 StableIdentity: Inserting Anybody into Anywhere at First Sight
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
[CVPR2024] DisCo: Referring Human Dance Generation in Real World
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"
Python framework that facilitates the quick development of complex video analysis applications and other series-processing based applications in a multiprocessing environment.
SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization
Unofficial Implementation of Animate Anyone