Stars
[IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration
Code for the paper Breaking reCAPTCHAv2 accepted at COMPSAC 2024
StoryMaker: Towards consistent characters in text-to-image generation
MLLM for On-Demand Spatial-Temporal Understanding at Arbitrary Resolution
[CSUR] A Survey on Video Diffusion Models
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
High-resolution models for human tasks.
Scratch extension for deep learning education
Command-line program to download videos from YouTube.com and other video sites
🎥 Python and OpenCV-based scene cut/transition detection program & library.
哔哩下载姬downkyi,哔哩哔哩网站视频下载工具,支持批量下载,支持8K、HDR、杜比视界,提供工具箱(音视频提取、去水印等)。
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
A repo for parsing m3u8 link and downloading non-DRM protected movies from iQIYI (爱奇艺).
Download videos from websites like YouTube and many others (based on yt-dlp)
This is the official reproduction of FancyVideo.
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
Code for the paper "Restoring Degraded Old Films with Recursive Recurrent Transformer Networks"
Denoising Diffusion Probabilistic Models
Concept Sliders for Precise Control of Diffusion Models
Official Implementation of weights2weights