Skip to content
View KainingYing's full-sized avatar

Organizations

@zjutcv

Block or report KainingYing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].

Python 11 2 Updated Sep 28, 2024

[ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation

Python 45 1 Updated Jul 29, 2024

A curated list of audio-visual learning methods and datasets.

221 17 Updated Sep 11, 2024

LLM101n: Let's build a Storyteller

28,982 1,586 Updated Aug 1, 2024

The Multilayer Perceptron Language Model

Python 507 45 Updated Aug 9, 2024

[ECCV 2024] PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation

17 Updated Jul 2, 2024

The n-gram Language Model

C 1,308 93 Updated Aug 5, 2024

The Autograd Engine

HTML 497 45 Updated Sep 11, 2024

The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024

Python 22 1 Updated Jul 27, 2024

[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model

Python 110 1 Updated Aug 5, 2024

This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).

784 50 Updated Sep 27, 2024

Mathematical Visual Instruction Tuning for Multi-modal Large Language Models

92 1 Updated Aug 5, 2024

Understand Human Behavior to Align True Needs

Python 3,310 291 Updated Jul 20, 2024

[CVPR2023] Referring Multi-Object Tracking

Python 116 12 Updated Jul 2, 2024

Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video”

Python 384 40 Updated Jul 5, 2024

VisionLLM Series

Python 857 21 Updated Sep 13, 2024

A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

352 19 Updated May 2, 2024
Jupyter Notebook 794 54 Updated Aug 21, 2024

[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'

Python 104 6 Updated Jun 18, 2024

[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI

Python 460 34 Updated Sep 12, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,197 1,057 Updated May 23, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 6,109 610 Updated Sep 28, 2024

A 4-hour coding workshop to understand how LLMs are implemented and used

Jupyter Notebook 640 155 Updated Sep 20, 2024

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 4,668 146 Updated Sep 11, 2024

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 31,615 3,891 Updated Sep 29, 2024

[ECCV 2024] VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement

Python 24 1 Updated Jul 29, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,070 838 Updated Jul 1, 2024

[CVPR 2024] iKUN: Speak to Trackers without Retraining

Python 103 2 Updated Jun 19, 2024
Python 27 2 Updated Jun 19, 2024

Multi-Granularity Language-Guided Multi-Object Tracking

Python 13 1 Updated Jun 14, 2024
Next