Skip to content
View lishaojun412's full-sized avatar

Block or report lishaojun412

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 2,830 329 Updated Aug 19, 2024

Speech, Language, Audio, Music Processing with Large Language Model

Python 509 42 Updated Sep 27, 2024

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 4,380 354 Updated Sep 25, 2024

Retrieval and Retrieval-augmented LLMs

Python 6,965 506 Updated Sep 26, 2024

Benchmarking library for RAG

Jupyter Notebook 89 8 Updated Sep 25, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

1,060 22 Updated Jul 31, 2024

the resources about the application based on LLM with RAG pattern

767 48 Updated Aug 30, 2024

Named Entity Correctior for ASR system.

Python 5 Updated Apr 4, 2024

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 6,121 653 Updated Sep 27, 2024

Enhancing Translation with RAG-Powered Large Language Models

Python 60 3 Updated Aug 11, 2024

Whisper realtime streaming for long speech-to-text transcription and translation

Python 1,818 224 Updated Sep 1, 2024

Transcription, forced alignment, and audio indexing with OpenAI's Whisper

Python 1,497 170 Updated Sep 12, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 33,085 3,802 Updated Sep 17, 2024

✨✨Latest Advances on Multimodal Large Language Models

11,906 768 Updated Sep 25, 2024

speaker adaptation, ASR, personality

Python 2 Updated Mar 7, 2024
Python 7 Updated May 5, 2024

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Python 366 32 Updated Apr 24, 2024
Jupyter Notebook 62 2 Updated Sep 12, 2023

An awesome spoken LID repository. (Working in progress

Python 93 10 Updated Apr 22, 2024

Faster distil-whisper transcription with CTranslate2

Python 10 Updated Jan 23, 2024

语音方向实验室/公司/资源/实习等,欢迎推荐或自荐

489 62 Updated Sep 24, 2024

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Python 8,341 2,396 Updated Aug 13, 2024

ESPnet Model Zoo

Python 241 39 Updated Jul 9, 2023

Repository for SLURP paper

Python 96 19 Updated Apr 20, 2022

Notebooks and various random fun

Jupyter Notebook 1,075 128 Updated Apr 18, 2023

[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection

23 3 Updated May 18, 2023

open-source Mandarian biased word dataset

10 Updated Sep 21, 2023

Some Conferences' accepted paper lists (including AI, ML, Robotic)

942 74 Updated Aug 14, 2024
Next