yukke42

🏠

Working from home

Yusuke Muramatsu yukke42

🏠

Working from home

24 followers · 17 following

Japan
https://www.kaggle.com/yukke42

Achievements

x3 x2

Achievements

x3 x2

Lists (6)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

3,399 181 Updated Sep 20, 2024

facebookresearch / segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,899 902 Updated Aug 21, 2024

XuehaiPan / nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 4,636 146 Updated Sep 11, 2024

threestudio-project / threestudio

A unified framework for 3D content generation.

Python 6,145 471 Updated Aug 9, 2024

VAST-AI-Research / TripoSR

Python 4,332 498 Updated Aug 16, 2024

Stability-AI / generative-models

Generative Models by Stability AI

Python 24,135 2,685 Updated Sep 4, 2024

robustsam / RobustSAM

RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)

Python 280 23 Updated Aug 31, 2024

mlfoundations / open_clip

An open source implementation of CLIP.

Python 9,844 954 Updated Aug 19, 2024

NVIDIA / TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, sparsity, distillation, etc. It compresses deep learning models for downstream …

Python 444 27 Updated Sep 18, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,544 434 Updated Sep 19, 2024

ArrowLuo / CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Python 848 121 Updated Apr 12, 2024

chaofengc / IQA-PyTorch

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 1,826 163 Updated Sep 20, 2024

aim-uofa / AdelaiDet

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Python 3,365 646 Updated Aug 23, 2024

ViTAE-Transformer / DeepSolo

The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multi…

Python 240 31 Updated Aug 9, 2024

ankush-me / SynthText

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Python 2,007 621 Updated Aug 9, 2023

kaanakan / object_detection_confusion_matrix

Python class for calculating confusion matrix for object detection task

Python 85 18 Updated Apr 21, 2022

mindspore-lab / mindocr

A toolbox of ocr models and algorithms based on MindSpore

Python 203 50 Updated Aug 19, 2024

JaidedAI / EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 23,794 3,117 Updated Aug 14, 2024

PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 42,780 7,686 Updated Sep 21, 2024

nbQA-dev / nbQA

Run ruff, isort, pyupgrade, mypy, pylint, flake8, and more on Jupyter Notebooks

Python 1,024 39 Updated Sep 3, 2024

LiheYoung / Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 6,794 521 Updated Jul 17, 2024

OpenInterpreter / open-interpreter

A natural language interface for computers

Python 52,274 4,616 Updated Sep 18, 2024

PaddlePaddle / Paddle3D

A 3D computer vision development toolkit based on PaddlePaddle. It supports point-cloud object detection, segmentation, and monocular 3D object detection models.

Python 566 141 Updated Aug 7, 2024

baaivision / EVA

EVA Series: Visual Representation Fantasies from BAAI

Python 2,220 162 Updated Aug 1, 2024

jianzongwu / Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

797 45 Updated Aug 24, 2024

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,680 950 Updated Aug 23, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,234 913 Updated Sep 18, 2024

rosbag-sharing-community / amber

AMBER: Automated annotation and Multimodal Bag Extraction for Robotics

Python 29 2 Updated Sep 22, 2024

microsoft / GLIP

Grounded Language-Image Pre-training

Python 2,162 191 Updated Jan 24, 2024

IDEA-Research / detrex

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Python 1,977 206 Updated Aug 15, 2024

Yusuke Muramatsu yukke42

Lists (6)

3D Object Detection

3D Occupancy Prediction

CUDA/TensorRT

Dataset

OCR

Simulator / Synthetic Data

Stars