-
BUPT-PRIV
- Beijing
Stars
Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
Implementation of "TrackFormer: Multi-Object Tracking with Transformers”. [Conference on Computer Vision and Pattern Recognition (CVPR), 2022]
Official inference repo for FLUX.1 models
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Awesome work on object 6 DoF pose estimation
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…
rafaelperez / ViTMatte-for-Nuke
Forked from hustvl/ViTMatte[Information Fusion] Boosting Image Matting with Pretrained Plain Vision Transformers
[Information Fusion (Vol.103, Mar. '24)] Boosting Image Matting with Pretrained Plain Vision Transformers
A programming framework for agentic AI 🤖
Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
A PyTorch Library for Multi-Task Learning
[ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745
official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation
Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation
ECCV 2024 论文和开源项目合集,同时欢迎各位大佬提交issue,分享ECCV 2024论文和开源项目
[AAAI2024]Hybrid-SORT: Weak Cues Matter for Online Multi-Object Tracking
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
[ECCV2022] Global Spectral Filter Memory Network for Video Object Segmentation
Official code for CVPR2023 Boosting Video Object Segmentation via Space-time Correspondence Learning