Skip to content
View ltp1995's full-sized avatar
  • Tongji University
  • Shanghai

Organizations

@Tongji-MIC-Lab

Block or report ltp1995

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Collection of Open Source Projects Related to GPT,GPT相关开源项目合集🚀、精选🔥🔥

Python 5,484 515 Updated Aug 21, 2024

自动驾驶散修一枚 -> 记录自己对自动驾驶的学习过程和相关学习链接

Makefile 328 44 Updated Sep 10, 2024

A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-to-end driving

Python 64 4 Updated Jul 15, 2024
Python 196 7 Updated Jul 28, 2024

CVPR 2024 Papers Autonomous Driving

170 14 Updated Aug 12, 2024

PyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"

Python 406 38 Updated Jun 12, 2024

Learning to Drive with GPT

Python 224 12 Updated Feb 1, 2024

A curated list of awesome LLM for Autonomous Driving resources (continually updated)

900 44 Updated Aug 13, 2024

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Python 690 44 Updated Jul 29, 2024

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,856 206 Updated Jul 27, 2024
Python 52 Updated Apr 24, 2024

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Python 759 41 Updated Jul 21, 2024
Python 11 2 Updated Sep 15, 2023

Demo code of the paper "Deep Image Registration With Depth-Aware Homography Estimation"

Python 8 1 Updated Feb 18, 2023

Obtain bird's eye view of a scene from a single input image

Python 99 21 Updated Jun 30, 2021

[TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.

Jupyter Notebook 104 5 Updated Jul 16, 2024

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023

Python 144 5 Updated Sep 9, 2024
3 3 Updated Aug 28, 2023

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,552 242 Updated Mar 5, 2024

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs

Python 497 33 Updated Jul 21, 2023

[CVPR 2024] 🎬💭 chat with over 10K frames of video!

Python 490 39 Updated Sep 6, 2024
Python 703 76 Updated Sep 14, 2023

[TPAMI'2023]Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling

Python 1 Updated Jan 3, 2023

[ICIP'23]Structure-aware Generative Adversarial Network for Text-to-image Generation

Jupyter Notebook 5 Updated Jul 11, 2023

Codes From Top Teams in 2023 AIC challenge

75 5 Updated Jun 5, 2023

VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automatic pipeline starting from the Conceptual Captions Image-Cap…

76 3 Updated Dec 5, 2022

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

Python 254 15 Updated May 28, 2024

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 2,981 245 Updated Sep 5, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,298 2,907 Updated Sep 2, 2024
Next