Skip to content
View nav13n's full-sized avatar
🐍
🐍

Highlights

  • Pro

Block or report nav13n

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Kickstart your MLOps initiative with a flexible, robust, and productive Python package.

Jupyter Notebook 639 96 Updated Jul 28, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 7,725 1,055 Updated Sep 10, 2024
Python 251 33 Updated Aug 20, 2024

Social Distancing Detector using deep learning and capable to run on edge AI devices such as NVIDIA Jetson, Google Coral, and more.

Python 140 38 Updated May 23, 2023

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

Jupyter Notebook 289 29 Updated Aug 27, 2024

Guide for fine-tuning Llama/Mistral/CodeLlama models and more

Python 524 79 Updated Aug 28, 2024
Jupyter Notebook 158 37 Updated Jun 3, 2024

A collection of postmortems. Sorry for the delay in merging PRs!

11,272 437 Updated Jul 24, 2024

Inspect: A framework for large language model evaluations

Python 574 100 Updated Oct 7, 2024

A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.

Python 968 49 Updated Sep 11, 2024

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 1,081 52 Updated Nov 4, 2023

The Art of Debugging

C 800 31 Updated Aug 3, 2024

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Jupyter Notebook 2,803 254 Updated May 3, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,062 167 Updated Aug 11, 2024

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,316 115 Updated Apr 17, 2024

Mamba SSM architecture

Python 12,760 1,076 Updated Oct 7, 2024

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,721 91 Updated Jan 21, 2024

The goal of this project is to enable users to create cool web demos using the newly released OpenAI GPT-3 API with just a few lines of Python.

JavaScript 2,900 883 Updated Oct 4, 2023

LLM Frontend for Power Users.

JavaScript 7,736 2,190 Updated Oct 7, 2024

📋 A list of open LLMs available for commercial use.

11,005 703 Updated Jul 5, 2024

Machine Learning Engineering Open Book

Python 11,235 677 Updated Oct 5, 2024

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…

C++ 23,760 1,817 Updated Oct 7, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 92,336 7,270 Updated Oct 7, 2024

Making large AI models cheaper, faster and more accessible

Python 38,697 4,336 Updated Sep 30, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,344 938 Updated Oct 1, 2024

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,155 543 Updated Sep 27, 2024

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 9,488 737 Updated Sep 30, 2024

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Python 7,152 503 Updated Sep 18, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,235 153 Updated Jun 25, 2024

A CLI that writes your git commit messages for you with AI

TypeScript 7,787 368 Updated Aug 15, 2024
Next