Stars
Machine Learning Engineering Open Book
Acceptance rates for the major AI conferences
Automatic Generation of Visualizations and Infographics using Large Language Models
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Latency and Memory Analysis of Transformer Models for Training and Inference
Performs benchmarking on two Korean datasets with minimal time and effort.
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
A beautiful, simple, clean, and responsive Jekyll theme for academics
A high-throughput and memory-efficient inference and serving engine for LLMs
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
An open source implementation of CLIP.
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
Roadmap to becoming a data engineer in 2021
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning
Build AI Assistants with memory, knowledge and tools.
Demo about realtime analytics of user behavior using elk stack/apache spark streaming+mllib/redis/slamdata
The official PyTorch implementation of Google's Gemma models
An LLM-powered advanced RAG pipeline built from scratch
A comprehensive guide to building RAG-based LLM applications for production.
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
A proof-of-concept of retrieval-augmented generation, using Google's PaLM API.