Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 47,264 6,720 Updated Oct 8, 2024

chatchat-space / Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 31,471 5,489 Updated Sep 30, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,351 939 Updated Oct 1, 2024

NVIDIA / GenerativeAIExamples

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Python 2,213 441 Updated Oct 1, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,731 2,449 Updated Oct 8, 2024

xai-org / grok-1

Grok open release

Python 49,464 8,323 Updated Aug 30, 2024

ollama / ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 92,381 7,277 Updated Oct 8, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,966 4,129 Updated Oct 8, 2024

netease-youdao / QAnything

Question and Answer based on Anything.

Python 11,557 1,118 Updated Sep 27, 2024

QwenLM / qwen.cpp

C++ implementation of Qwen-LM

C++ 539 49 Updated Dec 25, 2023

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 10,447 669 Updated Aug 14, 2024

google-gemini / generative-ai-android

The official Android library for the Google Gemini API

Kotlin 717 154 Updated Sep 16, 2024

jart / emacs-copilot

Large language model code completion for Emacs

Emacs Lisp 707 19 Updated Jan 7, 2024

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,823 1,055 Updated Aug 15, 2024

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 6,196 659 Updated Sep 30, 2024

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 13,617 1,108 Updated Sep 24, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,713 2,110 Updated Jul 18, 2024

Stability-AI / stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Python 38,686 4,991 Updated Sep 20, 2024

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 93,151 14,980 Updated Oct 8, 2024