Skip to content
View DingchenYang99's full-sized avatar

Block or report DingchenYang99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 11,979 842 Updated Sep 13, 2024

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Python 200 6 Updated Sep 11, 2024

Train transformer language models with reinforcement learning.

Python 9,323 1,170 Updated Sep 21, 2024

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Go 90,038 7,068 Updated Sep 21, 2024

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

Python 54 4 Updated Aug 7, 2024
Python 1,413 108 Updated May 12, 2023

Efficient Multi-modal Models via Stage-wise Visual Context Compression

Python 34 2 Updated Aug 5, 2024

The official Github page for "Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models"

Python 6 Updated Jul 24, 2024

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Python 1,018 41 Updated May 31, 2024

[EMNLP 2024] Multi-modal code generation problems.

Python 15 Updated Sep 6, 2024

The official Meta Llama 3 GitHub site

Python 26,205 2,951 Updated Aug 12, 2024
Jupyter Notebook 1,122 545 Updated May 13, 2024

💫 [CVPR 2024] LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis

Python 142 13 Updated Jun 18, 2024

The official repo of our work "Pensieve: Retrospect-then-Compare mitigates Visual Hallucination"

Python 14 Updated May 4, 2024

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python 408 48 Updated Apr 24, 2024

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 8,831 768 Updated Aug 7, 2024

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Python 761 41 Updated Jul 21, 2024
Python 278 7 Updated Jan 27, 2024

[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

Python 256 22 Updated Aug 24, 2024

✨✨Latest Advances on Multimodal Large Language Models

11,809 761 Updated Sep 19, 2024

[CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension

Python 27 4 Updated Apr 8, 2024
Python 25 2 Updated May 9, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,676 949 Updated Aug 23, 2024

huggingface mirror download

Python 547 55 Updated May 22, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,378 2,128 Updated Aug 12, 2024

[CVPR 2024 Highlight] Visual Point Cloud Forecasting

Python 260 17 Updated Jun 24, 2024

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Python 179 8 Updated Jul 16, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,305 2,906 Updated Sep 2, 2024

Official repository for "IntentQA: Context-aware Video Intent Reasoning" from ICCV 2023.

10 1 Updated Dec 19, 2023
Next