EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Models (LLMs).

Jupyter Notebook 121 15 Updated Sep 11, 2024

zyushun / Adam-mini

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 285 10 Updated Sep 18, 2024

HowieHwong / TrustLLM

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

Python 433 39 Updated Sep 6, 2024

01-ai / Yi-Coder

🌟 Yi-Coder is a series of open-source code language models that delivers state-of-the-art coding performance with fewer than 10 billion parameters.

HTML 311 21 Updated Sep 18, 2024

Outsider565 / LoRA-GA

Jupyter Notebook 126 5 Updated Sep 14, 2024

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 5,380 907 Updated Sep 19, 2024

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Cuda 482 17 Updated Sep 20, 2024

feifeibear / long-context-attention

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 314 20 Updated Sep 19, 2024

SafeAILab / EAGLE

Official Implementation of EAGLE-1 and EAGLE-2

Python 754 74 Updated Aug 28, 2024

feder-cr / linkedIn_auto_jobs_applier_with_AI

LinkedIn_AIHawk is a tool that automates the jobs application process on LinkedIn. Utilizing artificial intelligence, it enables users to apply for multiple job offers in an automated and personali…

Python 12,116 1,897 Updated Sep 17, 2024

microsoft / LongRoPE

LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.

Python 83 6 Updated Aug 23, 2024

FasterDecoding / Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,213 150 Updated Jun 25, 2024

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 2,966 152 Updated Sep 19, 2024

TIGER-AI-Lab / MAmmoTH

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)

Jupyter Notebook 318 44 Updated Aug 25, 2024

dafny-lang / dafny

Dafny is a verification-aware programming language

C# 2,880 256 Updated Sep 20, 2024

leanprover / lean4

Lean 4 programming language and theorem prover

Lean 4,520 397 Updated Sep 20, 2024

bighuang624 / AI-research-tools

🔨AI 方向好用的科研工具

2,302 344 Updated Jun 10, 2024

cognitivecomputations / grokadamw

Python 109 6 Updated Aug 19, 2024

deepseek-ai / DeepSeek-Prover-V1.5

Python 195 13 Updated Aug 16, 2024

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 7,534 1,005 Updated Sep 10, 2024

Azure / MS-AMP

Microsoft Automatic Mixed Precision Library

Python 509 42 Updated Sep 18, 2024

microsoft / T-MAC

Low-bit LLM inference on CPU with lookup table

C++ 443 32 Updated Sep 14, 2024

pytorch-labs / attention-gym

Helpful tools and examples for working with flex-attention

Python 349 14 Updated Aug 17, 2024

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 1,820 303 Updated Sep 19, 2024