Skip to content
View rbgo404's full-sized avatar
  • INFERLESS
  • BANGALORE

Block or report rbgo404

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results
7 Updated Mar 19, 2024

Inference and training library for high-quality TTS models.

Python 4,263 428 Updated Sep 23, 2024

Extract clean markdown from PDFs, URLs, Word docs, slides, videos, and more, ready for any LLM. โšก

Python 1,096 70 Updated Sep 13, 2024

Just some stuff for Interview questions, books, annotated paper, notes, cheat sheets etc etc related to ML,AI, Deep Learning and Data Science

107 15 Updated Jul 12, 2024

OpenAI compatible API for TensorRT LLM triton backend

Rust 152 25 Updated Aug 1, 2024

๐Ÿง ๐Ÿ’ฌ Articles I wrote about machine learning, archived from MachineCurve.com.

3,385 721 Updated Jun 28, 2024

AI powered one-click comprehensive docs from transcripts and text.

TypeScript 1,529 97 Updated Aug 30, 2024

Machine Learning Engineering Open Book

Python 11,074 663 Updated Sep 19, 2024

This repository contains tutorials and examples for Triton Inference Server

Python 534 91 Updated Sep 26, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,574 4,057 Updated Sep 29, 2024

๐Ÿฆ– ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป about ๐—Ÿ๐—Ÿ๐— ๐˜€, ๐—Ÿ๐—Ÿ๐— ๐—ข๐—ฝ๐˜€, and ๐˜ƒ๐—ฒ๐—ฐ๐˜๐—ผ๐—ฟ ๐——๐—•๐˜€ for free by designing, training, and deploying a real-time financial advisor LLM system ~ ๐˜ด๐˜ฐ๐˜ถ๐˜ณ๐˜ค๐˜ฆ ๐˜ค๐˜ฐ๐˜ฅ๐˜ฆ + ๐˜ท๐˜ช๐˜ฅ๐˜ฆ๐˜ฐ & ๐˜ณ๐˜ฆ๐˜ข๐˜ฅ๐˜ช๐˜ฏ๐˜จ ๐˜ฎ๐˜ข๐˜ต๐˜ฆ๐˜ณ๐˜ช๐˜ข๐˜ญ๐˜ด

Jupyter Notebook 2,988 462 Updated Apr 7, 2024

A bagel, with everything.

Python 307 31 Updated Apr 11, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,364 466 Updated Sep 28, 2024

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,110 64 Updated Feb 14, 2024