A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 6,121 653 Updated Sep 27, 2024

rayliuca / T-Ragx

Enhancing Translation with RAG-Powered Large Language Models

Python 60 3 Updated Aug 11, 2024

ufal / whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Python 1,818 224 Updated Sep 1, 2024

jianfch / stable-ts

Transcription, forced alignment, and audio indexing with OpenAI's Whisper

Python 1,497 170 Updated Sep 12, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 33,085 3,802 Updated Sep 17, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

11,906 768 Updated Sep 25, 2024

shibeiing / Personality-aware-Training-PAT

speaker adaptation, ASR, personality

Python 2 Updated Mar 7, 2024

Hypotheses-Paradise / UADF

Python 7 Updated May 5, 2024

YuanGongND / ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Python 366 32 Updated Apr 24, 2024

BriansIDP / WhisperBiasing

Jupyter Notebook 62 2 Updated Sep 12, 2023

raotnameh / End-to-end-E2E-Named-Entity-Recognition-from-English-Speech

Python 27 14 Updated Dec 2, 2020

Lhx94As / Awesome-Spoken-Language-Identification

An awesome spoken LID repository. (Working in progress

Python 93 10 Updated Apr 22, 2024

metame-ai / faster-distil-whisper

Forked from SYSTRAN/faster-whisper

Faster distil-whisper transcription with CTranslate2

Python 10 Updated Jan 23, 2024

ddlBoJack / Speech-Resources

语音方向实验室/公司/资源/实习等，欢迎推荐或自荐

489 62 Updated Sep 24, 2024

Uberi / speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Python 8,341 2,396 Updated Aug 13, 2024

espnet / espnet_model_zoo

ESPnet Model Zoo

Python 241 39 Updated Jul 9, 2023

pswietojanski / slurp

Repository for SLURP paper

Python 96 19 Updated Apr 20, 2022

karpathy / randomfun

Notebooks and various random fun

Jupyter Notebook 1,075 128 Updated Apr 18, 2023

MingLunHan / CIF-ColDec

[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection

23 3 Updated May 18, 2023

thuhcsi / Contextual-Biasing-Dataset

open-source Mandarian biased word dataset

10 Updated Sep 21, 2023

Lionelsy / Conference-Accepted-Paper-List

Some Conferences' accepted paper lists (including AI, ML, Robotic)

942 74 Updated Aug 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lishaojun412

Achievements

Achievements

Block or report lishaojun412

Stars

wdndev / llm_interview_note

X-LANCE / SLAM-LLM

lyogavin / airllm

FlagOpen / FlagEmbedding

naver / bergen

kvcache-ai / Mooncake

hsing-wang / Awesome-LLM-MT

lizhe2004 / Awesome-LLM-RAG-Application

Amiannn / Dancer

modelscope / FunASR