Skip to content
View eonglints's full-sized avatar

Block or report eonglints

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
71 stars written in Python
Clear filter

Robust Speech Recognition via Large-Scale Weak Supervision

Python 68,048 8,039 Updated Sep 10, 2024

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 52,241 8,731 Updated Aug 14, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,219 6,380 Updated Sep 27, 2024

Machine Learning Engineering Open Book

Python 11,067 662 Updated Sep 19, 2024

End-to-End Speech Processing Toolkit

Python 8,325 2,160 Updated Sep 26, 2024

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 6,499 1,057 Updated Jun 13, 2024

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Python 5,824 1,190 Updated Mar 31, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,482 383 Updated Sep 23, 2024

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,808 810 Updated Jul 5, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,438 307 Updated Jan 4, 2024

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,126 253 Updated Sep 6, 2023

torch-optimizer -- collection of optimizers for Pytorch

Python 3,020 295 Updated Mar 22, 2024

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 2,930 376 Updated Sep 4, 2024

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,384 255 Updated Jan 27, 2024

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,218 482 Updated Sep 9, 2024

WaveRNN Vocoder + TTS

Python 2,129 698 Updated Jul 2, 2022

Audio generation using diffusion models, in PyTorch.

Python 1,924 167 Updated Jun 12, 2023

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 1,916 504 Updated Jul 27, 2024

An unofficial styleguide and best practices summary for PyTorch

Python 1,906 170 Updated Dec 28, 2021

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,818 187 Updated Sep 26, 2024

Official implementation of "Implicit Neural Representations with Periodic Activation Functions"

Python 1,735 247 Updated Jul 27, 2024

Spearmint Bayesian optimization codebase

Python 1,545 327 Updated Dec 27, 2019

gentle forced aligner

Python 1,433 296 Updated Apr 25, 2024

MT3: Multi-Task Multitrack Music Transcription

Python 1,410 185 Updated Sep 23, 2024
Python 1,368 185 Updated Feb 11, 2024

🧹 Python package for text cleaning

Python 947 79 Updated May 9, 2023

kapre: Keras Audio Preprocessors

Python 920 146 Updated Oct 23, 2023

Automatically create Faiss knn indices with the most optimal similarity search parameters.

Python 803 74 Updated May 21, 2024

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Python 756 111 Updated Mar 26, 2024

Collection of audio-focused loss functions in PyTorch

Python 719 66 Updated Jul 30, 2024
Next