Skip to content
View yukke42's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report yukke42

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

3,399 181 Updated Sep 20, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,899 902 Updated Aug 21, 2024

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 4,636 146 Updated Sep 11, 2024

A unified framework for 3D content generation.

Python 6,145 471 Updated Aug 9, 2024

Generative Models by Stability AI

Python 24,135 2,685 Updated Sep 4, 2024

RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)

Python 280 23 Updated Aug 31, 2024

An open source implementation of CLIP.

Python 9,844 954 Updated Aug 19, 2024

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, sparsity, distillation, etc. It compresses deep learning models for downstream …

Python 444 27 Updated Sep 18, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,544 434 Updated Sep 19, 2024

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Python 848 121 Updated Apr 12, 2024

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 1,826 163 Updated Sep 20, 2024

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Python 3,365 646 Updated Aug 23, 2024

The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multi…

Python 240 31 Updated Aug 9, 2024

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Python 2,007 621 Updated Aug 9, 2023

Python class for calculating confusion matrix for object detection task

Python 85 18 Updated Apr 21, 2022

A toolbox of ocr models and algorithms based on MindSpore

Python 203 50 Updated Aug 19, 2024

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 23,794 3,117 Updated Aug 14, 2024

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 42,780 7,686 Updated Sep 21, 2024

Run ruff, isort, pyupgrade, mypy, pylint, flake8, and more on Jupyter Notebooks

Python 1,024 39 Updated Sep 3, 2024

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 6,794 521 Updated Jul 17, 2024

A natural language interface for computers

Python 52,274 4,616 Updated Sep 18, 2024

A 3D computer vision development toolkit based on PaddlePaddle. It supports point-cloud object detection, segmentation, and monocular 3D object detection models.

Python 566 141 Updated Aug 7, 2024

EVA Series: Visual Representation Fantasies from BAAI

Python 2,220 162 Updated Aug 1, 2024

(TPAMI 2024) A Survey on Open Vocabulary Learning

797 45 Updated Aug 24, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,680 950 Updated Aug 23, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,234 913 Updated Sep 18, 2024

AMBER: Automated annotation and Multimodal Bag Extraction for Robotics

Python 29 2 Updated Sep 22, 2024

Grounded Language-Image Pre-training

Python 2,162 191 Updated Jan 24, 2024

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Python 1,977 206 Updated Aug 15, 2024
Next