ziqipang

Follow

🎯

Focusing

Ziqi Pang ziqipang

🎯

Focusing

Follow

137 followers · 107 following

UIUC & Peking University & TuSimple
Urbana, USA
https://ziqipang.github.io

Achievements

Achievements

Lists (7)

Sort

Datasets

Foundation Models

Motion Forecasting

Nice Resources

Paper List

Simulator

Single/Multi-view Perception

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

daixiangzi / Awesome-Token-Compress

A paper list of some recent works about Token Compress for Vit and VLM

45 Updated Sep 21, 2024

YunzeMan / Lexicon3D

Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding

Python 25 4 Updated Sep 6, 2024

yfzhang114 / MME-RealWorld

✨✨ MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Python 65 5 Updated Sep 20, 2024

Lyken17 / sample-video

1 Updated Aug 13, 2024

umfieldrobotics / TURTLMap

Textureless Underwater Real Time Localization and Mapping

22 1 Updated Aug 10, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,505 4,496 Updated Sep 20, 2024

YunzeMan / Situation3D

[CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning

Python 16 1 Updated Jun 26, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

28,744 1,575 Updated Aug 1, 2024

Song-Jingyu / CRKD

We propose CRKD to bridge the performance gap between LC and CR detectors with a novel cross-modality knowledge distillation (KD) framework.

Python 22 3 Updated Jul 3, 2024

TencentARC / Open-MAGVIT2

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Python 608 21 Updated Sep 12, 2024

Restricted-Memory / RMem

official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation

Python 29 2 Updated Aug 22, 2024

CompVis / taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 5,708 1,138 Updated Jul 30, 2024

lllyasviel / Omost

Your image is almost there!

Python 7,228 418 Updated Jul 26, 2024

NVlabs / RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 622 23 Updated Sep 16, 2024

3dlg-hcvc / M3DRef-CLIP

[ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects

Python 68 3 Updated Jan 26, 2024

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,360 105 Updated Sep 22, 2024

zzxslp / SoM-LLaVA

[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Python 114 2 Updated Aug 23, 2024

a1600012888 / PhysDreamer

Code for PhysDreamer

Python 464 23 Updated Sep 15, 2024

myshell-ai / JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Python 958 79 Updated Jul 23, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

11,812 763 Updated Sep 19, 2024

naver / dust3r

DUSt3R: Geometric 3D Vision Made Easy

Python 5,047 549 Updated Sep 20, 2024

IDEA-Research / T-Rex

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 2,159 128 Updated Aug 29, 2024

FoundationVision / VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 4,015 301 Updated Jul 16, 2024

mit-han-lab / efficientvit

EfficientViT is a new family of vision models for efficient high-resolution vision.

Python 1,770 162 Updated Aug 9, 2024

princeton-nlp / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python 13,326 1,306 Updated Sep 16, 2024

snap-research / MyVLM

Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)

Python 142 8 Updated Jul 5, 2024

ByungKwanLee / MoAI

[ECCV 2024] Official PyTorch implementation code for realizing the technical part of Mixture of All Intelligence (MoAI) to improve performance of numerous zero-shot vision language tasks.

Python 304 31 Updated Mar 28, 2024

MengLcool / SEGIC

16 Updated Jul 10, 2024

fuxiao0719 / GeoWizard

[ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

Python 715 32 Updated Aug 27, 2024

bfshi / scaling_on_scales

When do we not need larger vision models?

Python 316 9 Updated Aug 19, 2024