Skip to content
View markurtz's full-sized avatar

Highlights

  • Pro

Organizations

@neuralmagic

Block or report markurtz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,253 3,997 Updated Sep 24, 2024

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python 134 9 Updated Sep 16, 2024

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 431 35 Updated Sep 23, 2024

An open-source NLP research library, built on PyTorch.

Python 11,735 2,244 Updated Nov 22, 2022

The open-source tool for building high-quality datasets and computer vision models

Python 8,127 542 Updated Sep 24, 2024

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 270,424 45,650 Updated Aug 7, 2024
Jupyter Notebook 5,623 940 Updated Sep 22, 2024

Top-level directory for documentation and general content

MDX 120 7 Updated Jun 23, 2024

Sparsity-aware deep learning inference runtime for CPUs

Python 2,980 172 Updated Jul 19, 2024

ML model optimization product to accelerate inference.

Python 318 28 Updated Apr 10, 2024

Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

Python 364 24 Updated Jul 19, 2024

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Python 2,043 143 Updated Aug 1, 2024