Skip to content
View zd11024's full-sized avatar

Highlights

  • Pro

Block or report zd11024

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Awesome Quadrupedal Robots

Python 511 61 Updated Sep 18, 2024

Mamba SSM architecture

Python 12,702 1,063 Updated Sep 26, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,315 69 Updated Aug 21, 2024

The Paper List on Data Contamination for Large Language Models Evaluation.

50 1 Updated Sep 27, 2024

[ICLR 2023] SQA3D for embodied scene understanding and reasoning

Python 117 3 Updated Oct 13, 2023

A Collection of LiDAR-Camera-Calibration Papers, Toolboxes and Notes

924 139 Updated Aug 20, 2024

😎 Awesome LIDAR list. The list includes LIDAR manufacturers, datasets, point cloud-processing algorithms, point cloud frameworks and simulators.

914 110 Updated Jul 17, 2024

[CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos

Python 51 9 Updated Jan 29, 2024

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 366 55 Updated Sep 1, 2024

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 561 30 Updated Sep 13, 2024

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

893 48 Updated Sep 23, 2024

[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'

Python 104 6 Updated Jun 18, 2024
JavaScript 2,379 843 Updated Jun 21, 2024

[ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"

Python 52 1 Updated Sep 21, 2023

[EMNLP 2023 Demo] CLEVA: Chinese Language Models EVAluation Platform

Shell 55 2 Updated Dec 13, 2023

✨✨Latest Advances on Multimodal Large Language Models

11,954 769 Updated Sep 25, 2024

Multimodal-GPT

Python 1,467 123 Updated Jun 4, 2023

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,587 245 Updated Dec 12, 2023

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 6,819 772 Updated Aug 24, 2023

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.

HTML 255 59 Updated Aug 18, 2022

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Python 6,179 1,180 Updated May 28, 2023

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Python 1,197 59 Updated Oct 18, 2022
Python 4 Updated Jul 5, 2023

EMNLP 2022: ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization

Python 33 2 Updated Jan 13, 2024

Vision-Language Pre-training for Image Captioning and Question Answering

Python 411 62 Updated Jan 18, 2022

MAttNet: Modular Attention Network for Referring Expression Comprehension

Jupyter Notebook 292 74 Updated Nov 29, 2022

re-implementation of speaker-listener-reinforcer

Jupyter Notebook 7 2 Updated Mar 16, 2019

Generating Easy-to-Understand Referring Expressions for Target Identifications

Jupyter Notebook 14 1 Updated Aug 30, 2019
Next