Skip to content
View eonglints's full-sized avatar

Block or report eonglints

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 2,930 376 Updated Sep 4, 2024

logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even when there are many audio tracks or stems.

Python 32 3 Updated Sep 11, 2024

Automatically create Faiss knn indices with the most optimal similarity search parameters.

Python 803 74 Updated May 21, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 30,664 3,572 Updated Sep 26, 2024

🧹 Python package for text cleaning

Python 947 79 Updated May 9, 2023

AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)

Python 93 6 Updated Sep 4, 2024

A browser extension that enhance search engines with ChatGPT

TypeScript 587 62 Updated Jul 3, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,482 383 Updated Sep 23, 2024

Machine Learning Engineering Open Book

Python 11,067 662 Updated Sep 19, 2024

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 3,795 206 Updated Jun 18, 2024

simple trainer for musicgen/audiocraft

Python 15 1 Updated Jul 14, 2023

Manage audio and video databases

Python 23 1 Updated Aug 22, 2024

Pitch Estimating Neural Networks (PENN)

Python 229 21 Updated Jul 31, 2024

A Python toolbox for speech features extraction

Python 158 21 Updated Feb 8, 2023

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,126 253 Updated Sep 6, 2023

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,818 187 Updated Sep 26, 2024

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,384 255 Updated Jan 27, 2024

Audio generation using diffusion models, in PyTorch.

Python 1,924 167 Updated Jun 12, 2023

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,438 307 Updated Jan 4, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 68,042 8,038 Updated Sep 10, 2024

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Python 323 53 Updated Jul 12, 2023

Implementation of DiffWave and SaShiMi audio generation models

Python 112 13 Updated Apr 4, 2023

Large, modern dataset for speech recognition

Shell 631 62 Updated Feb 26, 2024
Python 460 45 Updated Jun 25, 2024
Python 1,368 185 Updated Feb 11, 2024

Collection of audio-focused loss functions in PyTorch

Python 719 66 Updated Jul 30, 2024

Structured state space sequence models

Jupyter Notebook 2,381 285 Updated Jul 17, 2024

Performant and accurate speech recognition built on Pytorch

Python 246 26 Updated May 19, 2022

A library for speech data augmentation in time-domain

Python 635 57 Updated Aug 30, 2021

Library for Textless Spoken Language Processing

Python 523 50 Updated Aug 29, 2023
Next