Stars
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Official Implementation for "Only a Matter of Style: Age Transformation Using a Style-Based Regression Model" (SIGGRAPH 2021) https://arxiv.org/abs/2102.02754
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Image to prompt with BLIP and CLIP
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Desktop app for prototyping and debugging LangGraph applications locally.
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
video.js plugin for recording audio/video/image files
html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
LangChain 的中文入门教程
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview
CapsWriter 的离线版,一个好用的 PC 端的语音输入工具
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Multilingual Voice Understanding Model
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmark.