Skip to content
View leo3349's full-sized avatar

Block or report leo3349

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Python 1,626 131 Updated Sep 11, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 3,816 293 Updated Sep 19, 2024

[WIP] Streaming Audio Models Examples in JS

JavaScript 8 1 Updated Mar 29, 2024

🥚 Transform PDF to JSON or Markdown with ease and speed 🐣

Python 476 46 Updated Sep 21, 2024

Noise supression using deep filtering

Python 2,369 219 Updated Jul 31, 2024

Python text-to-speech library with built-in voice effects and support for multiple TTS engines

Python 15 3 Updated Jun 2, 2024

Lightweight, performant, deep table extraction

Python 262 17 Updated Sep 20, 2024

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…

Python 2,840 436 Updated Sep 20, 2024

Real time interactive streaming digital human

Python 3,499 488 Updated Sep 21, 2024

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Jupyter Notebook 4,863 303 Updated Oct 18, 2023

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,004 395 Updated Sep 11, 2024

实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果

Python 202 31 Updated Jul 1, 2024

Recurrent neural network for audio noise reduction

C 3,997 889 Updated Aug 24, 2024

Table Recognition and Content Extraction in PDF Files

Python 23 7 Updated Apr 22, 2019

OpenCV-Python图像处理教程

Python 2 1 Updated Nov 30, 2018

darknet text detect and darknet cnn ocr

C 1,137 287 Updated Oct 12, 2021

yolo3+ocr

Python 5,918 1,729 Updated Aug 29, 2022

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 6,986 367 Updated Sep 21, 2024
Python 469 39 Updated Jun 7, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 4,803 327 Updated Sep 20, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 11,455 859 Updated Sep 20, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,573 2,495 Updated Aug 28, 2024

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction…

Python 1,795 297 Updated Sep 1, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 33,582 4,083 Updated Aug 16, 2024

Brand new TTS solution

Python 12,170 923 Updated Sep 20, 2024

Instant voice cloning by MIT and MyShell.

Python 28,463 2,784 Updated Aug 21, 2024

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 4,421 554 Updated Aug 9, 2024

Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app

Python 52,035 6,775 Updated Sep 12, 2024

使用GAN擦出文档印章 remove stamp by GAN

Python 150 29 Updated May 20, 2021

多平台容器镜像代理服务,支持 Docker Hub, GitHub, Google, k8s, Quay, Microsoft 等镜像仓库.

Shell 1,114 108 Updated Sep 3, 2024
Next