Skip to content
View ziqipang's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report ziqipang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

A paper list of some recent works about Token Compress for Vit and VLM

45 Updated Sep 21, 2024

Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding

Python 25 4 Updated Sep 6, 2024

✨✨ MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Python 65 5 Updated Sep 20, 2024
1 Updated Aug 13, 2024

Textureless Underwater Real Time Localization and Mapping

22 1 Updated Aug 10, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,505 4,496 Updated Sep 20, 2024

[CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning

Python 16 1 Updated Jun 26, 2024

LLM101n: Let's build a Storyteller

28,744 1,575 Updated Aug 1, 2024

We propose CRKD to bridge the performance gap between LC and CR detectors with a novel cross-modality knowledge distillation (KD) framework.

Python 22 3 Updated Jul 3, 2024

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Python 608 21 Updated Sep 12, 2024

official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation

Python 29 2 Updated Aug 22, 2024

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 5,708 1,138 Updated Jul 30, 2024

Your image is almost there!

Python 7,228 418 Updated Jul 26, 2024

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 622 23 Updated Sep 16, 2024

[ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects

Python 68 3 Updated Jan 26, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,360 105 Updated Sep 22, 2024

[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Python 114 2 Updated Aug 23, 2024

Code for PhysDreamer

Python 464 23 Updated Sep 15, 2024

Reaching LLaMA2 Performance with 0.1M Dollars

Python 958 79 Updated Jul 23, 2024

✨✨Latest Advances on Multimodal Large Language Models

11,812 763 Updated Sep 19, 2024

DUSt3R: Geometric 3D Vision Made Easy

Python 5,047 549 Updated Sep 20, 2024

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 2,159 128 Updated Aug 29, 2024

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 4,015 301 Updated Jul 16, 2024

EfficientViT is a new family of vision models for efficient high-resolution vision.

Python 1,770 162 Updated Aug 9, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python 13,326 1,306 Updated Sep 16, 2024

Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)

Python 142 8 Updated Jul 5, 2024

[ECCV 2024] Official PyTorch implementation code for realizing the technical part of Mixture of All Intelligence (MoAI) to improve performance of numerous zero-shot vision language tasks.

Python 304 31 Updated Mar 28, 2024
16 Updated Jul 10, 2024

[ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

Python 715 32 Updated Aug 27, 2024

When do we not need larger vision models?

Python 316 9 Updated Aug 19, 2024
Next