Skip to content
View zhanghengjing2's full-sized avatar

Block or report zhanghengjing2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 11,623 875 Updated Sep 23, 2024

llm-export can export llm model to onnx.

Python 197 21 Updated Sep 19, 2024

export llama to onnx

Python 92 11 Updated May 27, 2024
Python 474 39 Updated Jun 7, 2024

GPT based autonomous agent designed to create personalized newspapers tailored to user preferences.

Python 1,163 160 Updated Jun 21, 2024

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Python 10,942 849 Updated Sep 17, 2024

OCR, layout analysis, reading order, line detection in 90+ languages

Python 9,914 646 Updated Sep 21, 2024

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,413 105 Updated Jul 5, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 13,527 1,101 Updated Sep 24, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 8,500 518 Updated Sep 26, 2024

Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vi…

Python 3,587 310 Updated Sep 26, 2024

llm deploy project based mnn.

C++ 1,429 155 Updated Sep 26, 2024

A real time Multimodal Emotion Recognition web app for text, sound and video inputs

Jupyter Notebook 860 285 Updated Apr 29, 2021

[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition".

Python 158 25 Updated May 15, 2024

Kaggle | 1st place solution for Freesound Audio Tagging 2019

Python 313 55 Updated Jun 22, 2022

Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras

Python 571 229 Updated Nov 3, 2023

Question and Answer based on Anything.

Python 11,485 1,112 Updated Sep 23, 2024

Community list of startups working with AI in audio and music technology

1,531 135 Updated Aug 9, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 18,090 1,830 Updated Sep 26, 2024

Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别

Python 975 217 Updated Mar 25, 2023

Open-source IoT Platform - Device management, data collection, processing and visualization.

Java 17,305 5,095 Updated Sep 26, 2024

RestAI is an AIaaS (AI as a Service) open-source platform. Built on top of LlamaIndex, Ollama and HF Pipelines. Supports any public LLM supported by LlamaIndex and any local LLM suported by Ollama.…

Python 356 72 Updated Sep 18, 2024

Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMs

Python 772 49 Updated Jun 15, 2024

优质稳定的OpenAI的API接口-For企业和开发者。OpenAI的api proxy,支持ChatGPT的API调用,支持openai的API接口,支持:gpt-4,gpt-3.5。不需要openai Key, 不需要买openai的账号,不需要美元的银行卡,通通不用的,直接调用就行,稳定好用!!智增增

PHP 598 49 Updated Sep 24, 2024

LLM based autonomous agent that does online comprehensive research on any given topic

Python 14,169 1,844 Updated Sep 25, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 46,587 6,576 Updated Sep 27, 2024

✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.

TypeScript 2,081 155 Updated Apr 29, 2024

Stable Diffusion web UI

Python 140,225 26,549 Updated Sep 9, 2024

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge manageme…

TypeScript 41,785 9,423 Updated Sep 27, 2024
Next