Stars
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
Diffusion Illusions: Hiding Images in Plain Sight
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
An extremely fast Python package and project manager, written in Rust.
High performance self-hosted photo and video management solution.
🎨 The exhaustive Pattern Matching library for TypeScript, with smart type inference.
We write your reusable computer vision tools. 💜
[IROS24] Offical Code for "FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework" - Inegrated into Nerfstudio
real time face swap and one-click video deepfake with only a single image
Misc; latest version of waifu2x; 2D video to stereo 3D video conversion
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Fork du code de LMSYS (FastChat) pour l'arène de comparaison de LLM francophones LANGU:IA
The code used to train and run inference with the ColPali architecture.
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Using a pre-commit hook, Talisman validates the outgoing changeset for things that look suspicious — such as tokens, passwords, and private keys.
🎬 An opensource LTI Learning Content Management System (LCMS)
🔀 Deployement of LLM at a large scale using VLLM server for inference
etalab-ia / doctr
Forked from mindee/doctrDocument Text Recognition (DocTR) made seamless, high-performing & accessible to anyone using Deep Learning for OCR-related tasks.
OCR, layout analysis, reading order, line detection in 90+ languages
Docmost is an open-source collaborative wiki and documentation software. It is an open-source alternative to Confluence and Notion.
FFmpeg for browser, powered by WebAssembly
Display PDFs in your React app as easily as if they were images.
Implementation of Nougat Neural Optical Understanding for Academic Documents
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Creating beautiful plots of data maps
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
🇫🇷 The French Government Design system React toolkit