- NanJing
Lists (4)
Sort Name ascending (A-Z)
Starred repositories
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Video-P2P: Video Editing with Cross-attention Control
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Stable-Hair: Real-World Hair Transfer via Diffusion Model
ViViD: Video Virtual Try-on using Diffusion Models
Official inference repo for FLUX.1 models
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception
SEED-Story: Multimodal Long Story Generation with Large Language Model
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新
VideoTetris: Towards Compositional Text-To-Video Generation
[ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控制信号的图像生成模型,能够根据多种控制生成自然和谐的结果!
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Enjoy the magic of Diffusion models!
STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
RS5M: a large-scale vision language dataset for remote sensing
Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"
A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
A Survey on Vision-Language Geo-Foundation Models (VLGFMs)
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text