eonglints

Follow

Dan Lyth eonglints

Follow

Research engineer working on something new. Previously leading speech research at @Stability-AI and Rockstar Games.

29 followers · 2 following

Achievements

Achievements

Stars

71 stars written in Python

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 68,048 8,039 Updated Sep 10, 2024

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 52,241 8,731 Updated Aug 14, 2024

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,219 6,380 Updated Sep 27, 2024

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 11,067 662 Updated Sep 19, 2024

espnet / espnet

End-to-End Speech Processing Toolkit

Python 8,325 2,160 Updated Sep 26, 2024

AntixK / PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 6,499 1,057 Updated Jun 13, 2024

tyiannak / pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Python 5,824 1,190 Updated Mar 31, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,482 383 Updated Sep 23, 2024

TensorSpeech / TensorFlowTTS

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,808 810 Updated Jul 5, 2024

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,438 307 Updated Jan 4, 2024

lucidrains / musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,126 253 Updated Sep 6, 2023

jettify / pytorch-optimizer

torch-optimizer -- collection of optimizers for Pytorch

Python 3,020 295 Updated Mar 22, 2024

stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 2,930 376 Updated Sep 4, 2024

lucidrains / audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,384 255 Updated Jan 27, 2024

s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,218 482 Updated Sep 9, 2024

fatchord / WaveRNN

WaveRNN Vocoder + TTS

Python 2,129 698 Updated Jul 2, 2022

archinetai / audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Python 1,924 167 Updated Jun 12, 2023

jik876 / hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 1,916 504 Updated Jul 27, 2024

IgorSusmelj / pytorch-styleguide

An unofficial styleguide and best practices summary for PyTorch

Python 1,906 170 Updated Dec 28, 2021

iver56 / audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,818 187 Updated Sep 26, 2024

vsitzmann / siren

Official implementation of "Implicit Neural Representations with Periodic Activation Functions"

Python 1,735 247 Updated Jul 27, 2024

HIPS / Spearmint

Spearmint Bayesian optimization codebase

Python 1,545 327 Updated Dec 27, 2019

lowerquality / gentle

gentle forced aligner

Python 1,433 296 Updated Apr 25, 2024

magenta / mt3

MT3: Multi-Task Multitrack Music Transcription

Python 1,410 185 Updated Sep 23, 2024

microsoft / NeuralSpeech

Python 1,368 185 Updated Feb 11, 2024

jfilter / clean-text

🧹 Python package for text cleaning

Python 947 79 Updated May 9, 2023

keunwoochoi / kapre

kapre: Keras Audio Preprocessors

Python 920 146 Updated Oct 23, 2023

criteo / autofaiss

Automatically create Faiss knn indices with the most optimal similarity search parameters.

Python 803 74 Updated May 21, 2024

lmnt-com / diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Python 756 111 Updated Mar 26, 2024

csteinmetz1 / auraloss

Collection of audio-focused loss functions in PyTorch

Python 719 66 Updated Jul 30, 2024