Stars
Inference and training library for high-quality TTS models.
Extract clean markdown from PDFs, URLs, Word docs, slides, videos, and more, ready for any LLM. โก
Just some stuff for Interview questions, books, annotated paper, notes, cheat sheets etc etc related to ML,AI, Deep Learning and Data Science
OpenAI compatible API for TensorRT LLM triton backend
๐ง ๐ฌ Articles I wrote about machine learning, archived from MachineCurve.com.
AI powered one-click comprehensive docs from transcripts and text.
Machine Learning Engineering Open Book
This repository contains tutorials and examples for Triton Inference Server
A high-throughput and memory-efficient inference and serving engine for LLMs
๐ฆ ๐๐ฒ๐ฎ๐ฟ๐ป about ๐๐๐ ๐, ๐๐๐ ๐ข๐ฝ๐, and ๐๐ฒ๐ฐ๐๐ผ๐ฟ ๐๐๐ for free by designing, training, and deploying a real-time financial advisor LLM system ~ ๐ด๐ฐ๐ถ๐ณ๐ค๐ฆ ๐ค๐ฐ๐ฅ๐ฆ + ๐ท๐ช๐ฅ๐ฆ๐ฐ & ๐ณ๐ฆ๐ข๐ฅ๐ช๐ฏ๐จ ๐ฎ๐ข๐ต๐ฆ๐ณ๐ช๐ข๐ญ๐ด
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding