Skip to content
View esp0r's full-sized avatar

Block or report esp0r

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Zero Bubble Pipeline Parallelism

Python 257 13 Updated Sep 4, 2024

A lecture note for understanding deep learning

Jupyter Notebook 169 19 Updated Jul 24, 2024

上海交通大学抢课脚本(2020临时版本更新)

Python 162 41 Updated Jun 4, 2024

unlocks the 60 fps cap

C# 2,695 209 Updated Aug 28, 2024
C 169 18 Updated Sep 16, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 25,216 5,216 Updated Sep 20, 2024

Distributed Training Over-The-Internet

587 21 Updated Aug 27, 2024

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 1,820 303 Updated Sep 19, 2024

FlagScale is a large model toolkit based on open-sourced projects.

Python 132 40 Updated Sep 19, 2024

A library to analyze PyTorch traces.

Python 272 37 Updated Sep 7, 2024

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

HTML 692 164 Updated Sep 20, 2024

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 778 38 Updated Nov 4, 2023

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Python 712 88 Updated Sep 13, 2024

A fast communication-overlapping library for tensor parallelism on GPUs.

C++ 189 13 Updated Sep 18, 2024

Implementation of a parallel least squares support vector machine using multiple backends for different GPU vendors.

C++ 35 10 Updated Sep 19, 2024

An OAI compatible exllamav2 API that's both lightweight and fast

Python 463 65 Updated Sep 20, 2024

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 3,499 266 Updated Sep 20, 2024

学园偶像大师助手 | Assistant For Gakuen Idolmaster/学園アイドルマスター/学マス

Python 169 8 Updated Aug 13, 2024

AI for MAA

Jupyter Notebook 153 9 Updated Jul 27, 2023
C++ 1 Updated Dec 2, 2022

ThunderSVM: A Fast SVM Library on GPUs and CPUs

C++ 1,562 217 Updated Apr 1, 2024

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Svelte 39,635 4,631 Updated Sep 20, 2024

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 662 31 Updated Sep 19, 2024
5,612 669 Updated Aug 14, 2024

downloader using selenium for iwara

Python 1 Updated Jun 14, 2023

Process-aware, eBPF-based tcpdump

C 446 36 Updated Sep 17, 2024

Passive ping network monitoring utility (C++)

C++ 84 15 Updated Apr 20, 2024

Making eBPF programming easier via build env and examples

C 420 84 Updated Jan 31, 2024

A curated list of awesome projects related to eBPF.

4,188 358 Updated Aug 18, 2024

Mitsuba 3: A Retargetable Forward and Inverse Renderer

C++ 2,022 232 Updated Sep 20, 2024
Next