Stars
SSD: Single Shot MultiBox Detector | a PyTorch Tutorial to Object Detection
Official PyTorch implementation of PyramidTabNet: Transformer-based Table Recognition in Image-based Documents
A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.
Tesseract Open Source OCR Engine (main repository)
UB-Mannheim / tesseract
Forked from tesseract-ocr/tesseractTesseract Open Source OCR Engine (main repository)
Fast and memory-efficient exact attention
A simple tool for labeling object bounding boxes in images.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Checkbox Detection Model for Scanned Documents
A modular graph-based Retrieval-Augmented Generation (RAG) system
Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"
Autoware - the world's leading open-source software project for autonomous driving
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system in 275+ supported cars.
[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering
[INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark
Master programming by recreating your favorite technologies from scratch.
Official implementation of "Perturbed-Attention Guidance"
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Instant voice cloning by MIT and MyShell.
Time series forecasting with PyTorch
ML has an impact on the climate. But not all models are born equal. Compute your model's emissions with our calculator and add the results to your paper with our generated latex template