Skip to content
View smallred-god's full-sized avatar

Block or report smallred-god

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge manageme…

TypeScript 41,878 9,442 Updated Sep 29, 2024

Building a quick conversation-based search demo with Lepton AI.

TypeScript 7,740 984 Updated Sep 18, 2024

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,241 148 Updated Aug 23, 2024

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…

Python 2,392 347 Updated Sep 27, 2024

记录大模型相关的一些知识和方法

Jupyter Notebook 85 16 Updated Sep 8, 2024

Convert PDF to markdown quickly with high accuracy

Python 16,697 947 Updated Sep 7, 2024

Mixture of Agents using Groq

Python 910 157 Updated Aug 9, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,068 838 Updated Jul 1, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,193 1,057 Updated May 23, 2024

Pytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi labe…

Python 323 50 Updated Jul 18, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 17,620 1,684 Updated Sep 27, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 46,739 6,606 Updated Sep 28, 2024

欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python 258 21 Updated Sep 20, 2024

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 859 45 Updated Jun 25, 2024

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Python 418 43 Updated Sep 20, 2024

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2,119 209 Updated Sep 26, 2024

Letta (fka MemGPT) is a framework for creating stateful LLM services.

Python 11,836 1,285 Updated Sep 28, 2024

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…

TypeScript 17,166 4,596 Updated Sep 29, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 91,260 7,174 Updated Sep 28, 2024

天眼查爬虫&企查查爬虫,指定关键字爬取公司信息

Python 628 168 Updated Feb 16, 2023

LLM training code for Databricks foundation models

Python 3,982 525 Updated Sep 27, 2024

The Triton TensorRT-LLM Backend

Python 663 96 Updated Sep 24, 2024

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,499 236 Updated May 1, 2024

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,685 517 Updated Sep 19, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,552 4,053 Updated Sep 29, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,451 445 Updated Sep 28, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,291 925 Updated Sep 27, 2024

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 342 27 Updated Jun 29, 2024

Retrieval and Retrieval-augmented LLMs

Python 6,979 510 Updated Sep 26, 2024

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 9,436 729 Updated Sep 27, 2024
Next