Skip to content
View MoonRide303's full-sized avatar

Block or report MoonRide303

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Models (LLMs).

Jupyter Notebook 119 15 Updated Sep 11, 2024

High accuracy RAG for answering questions from scientific documents with citations

Python 5,508 522 Updated Sep 19, 2024

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

C++ 1,403 93 Updated Aug 7, 2024

An extremely fast Python linter and code formatter, written in Rust.

Rust 31,119 1,032 Updated Sep 19, 2024

Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment

Jupyter Notebook 40 1 Updated Aug 30, 2024

interactive visualization of 5 popular gradient descent methods with step-by-step illustration and hyperparameter tuning UI

C++ 1,181 119 Updated Aug 4, 2024

Official inference repo for FLUX.1 models

Python 13,814 976 Updated Sep 13, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,833 885 Updated Aug 21, 2024

Test Software for the Characterization of AI Technologies

Python 214 31 Updated Sep 19, 2024

Make it easy to automatically and uniformly measure the behavior of many AI Systems.

Python 25 7 Updated Sep 14, 2024

Code and example data for the paper: Rule Based Rewards for Language Model Safety

Jupyter Notebook 131 13 Updated Jul 19, 2024

Agentic components of the Llama Stack APIs

Python 3,239 322 Updated Sep 19, 2024

Utilities intended for use with Llama models.

Python 3,854 698 Updated Sep 18, 2024

DataComp for Language Models

HTML 1,112 97 Updated Sep 5, 2024

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…

Python 196 15 Updated Aug 19, 2024

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) avai…

Jupyter Notebook 2,253 225 Updated Sep 17, 2024

WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining …

Jupyter Notebook 72 21 Updated Apr 27, 2024

Improving Alignment and Robustness with Circuit Breakers

Jupyter Notebook 124 16 Updated Jul 12, 2024

Generative AI extensions for onnxruntime

C++ 425 99 Updated Sep 19, 2024

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 14,135 2,851 Updated Sep 19, 2024

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 933 47 Updated Sep 3, 2024

A curated list of awesome leaderboard-oriented resources for foundation models

183 18 Updated Sep 15, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,876 403 Updated Sep 6, 2024

A Python toolbox for performing gradient-free optimization

Python 3,932 352 Updated Sep 19, 2024

FAIR Sequence Modeling Toolkit 2

Python 674 78 Updated Sep 19, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 1,981 129 Updated Sep 3, 2024

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 506 20 Updated Sep 16, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 5,916 500 Updated Sep 19, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,242 501 Updated Jul 31, 2024
Next