Skip to content
View Pushkinue's full-sized avatar

Block or report Pushkinue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 17,336 1,653 Updated Sep 20, 2024

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 6,411 614 Updated Sep 21, 2024

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Python 674 75 Updated Sep 18, 2024

Supercharge Your Model Training

Python 5,120 414 Updated Sep 21, 2024

A playbook for systematically maximizing the performance of deep learning models.

26,528 2,204 Updated Jun 18, 2024

All-in-one text de-duplication

Python 588 69 Updated May 21, 2024

Python package for lexicon; Trie and DAWG implementation.

Python 55 7 Updated Jun 10, 2024
Jupyter Notebook 1,173 237 Updated Sep 18, 2024

A Collection of BM25 Algorithms in Python

Python 990 83 Updated May 28, 2024

RAG AutoML Tool - Find optimal RAG pipeline for your own data.

Python 1,393 121 Updated Sep 21, 2024

Chat with your own data - LLM+RAG workshop

Jupyter Notebook 166 59 Updated Sep 21, 2024

AutoPhrase: Automated Phrase Mining from Massive Text Corpora

C++ 1,169 272 Updated Jan 27, 2022

Evaluate your LLM's response with Prometheus and GPT4 πŸ’―

Python 756 47 Updated Sep 9, 2024

Fine-Tuning Embedding for RAG with Synthetic Data

Jupyter Notebook 456 65 Updated Sep 11, 2023

Model interpretability and understanding for PyTorch

Python 4,820 489 Updated Sep 20, 2024

Universal and Transferable Attacks on Aligned Language Models

Python 3,291 462 Updated Aug 2, 2024

Lazy Predict help build a lot of basic models without much code and helps understand which models works better without any parameter tuning

Python 2,869 332 Updated Jun 2, 2024

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 2,889 374 Updated Sep 4, 2024

System design patterns for machine learning

2,253 238 Updated Oct 7, 2021

An open collection of methodologies to help with successful training of large language models.

Python 443 32 Updated Feb 15, 2024

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 666 33 Updated Aug 19, 2024

A guidance language for controlling large language models.

Jupyter Notebook 18,725 1,032 Updated Sep 18, 2024

πŸ€– Chat with your SQL database πŸ“Š. Accurate Text-to-SQL Generation via LLMs using RAG πŸ”„.

Python 10,861 843 Updated Sep 17, 2024

Come join the best place on the internet to learn AI skills. Use code "chatbotui" for an extra 20% off.

TypeScript 28,370 7,889 Updated Aug 3, 2024

All the resources you need to get to Senior Engineer and beyond

12,904 1,187 Updated Sep 21, 2024

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 270,191 45,619 Updated Aug 7, 2024

A curated list of Large Language Model (LLM) Interpretability resources.

1,073 87 Updated Jul 31, 2024

Probabilistic properties of language

Jupyter Notebook 1 Updated Dec 23, 2023

The tiniest sentence encoder for Russian language

Python 175 10 Updated Jul 25, 2024

A blazing fast inference solution for text embeddings models

Rust 2,615 163 Updated Sep 19, 2024
Next