Lists (2)
Sort Name ascending (A-Z)
Starred repositories
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
DSPy: The framework for programming—not prompting—foundation models
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
A blazingly fast, local, Ethereum block explorer built on top of Erigon
Blockchain explorer for Ethereum based network and a tool for inspecting and analyzing EVM based blockchains.
A Gradio web UI for Large Language Models.
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Train transformer language models with reinforcement learning.
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Your Automatic Prompt Engineering Assistant for GenAI Applications
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
A high-throughput and memory-efficient inference and serving engine for LLMs
Question and Answer based on Anything.
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
The official Android library for the Google Gemini API
Large language model code completion for Emacs
Foundational Models for State-of-the-Art Speech and Text Translation
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
High-Resolution Image Synthesis with Latent Diffusion Models
🦜🔗 Build context-aware reasoning applications