Stars
The all-in-one solution for RAG. Build, scale, and deploy state of the art Retrieval-Augmented Generation applications
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
AirLLM 70B inference with single 4GB GPU
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Model components of the Llama Stack APIs
High accuracy RAG for answering questions from scientific documents with citations
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.
A Python library to chunk/group your texts based on semantic similarity.
#1 Locally hosted web application that allows you to perform various operations on PDF files
Math OCR model that outputs LaTeX and markdown
A chrome extension to easily do visual trials of clothing from any e-commerce store. Fill the form below to get notified about release of the simple non-tech version of the extension 👇
Better user experience plugin for ComfyUI
An open-source RAG-based tool for chatting with your documents.
Build AI Assistants with memory, knowledge and tools.
A trivial programmatic Llama 3 jailbreak. Sorry Zuck!
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Stable Diffusion web UI
real time face swap and one-click video deepfake with only a single image
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
Official inference repo for FLUX.1 models
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone