-
UFMT
- Cuiabá, Mato Grosso - Brazil
- https://www.fredso.com.br
- @fred_s0
Highlights
-
-
BRSpeech-Dataset Public
BRSpeech: A Portuguese Dataset for Speech Synthesis
-
CML-TTS-Toolkit Public
CML-TTS Conversion Tools
-
CML-TTS-Dataset Public
CML-TTS: A Multilingual Dataset for Speech Synthesis
-
katube Public
KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a list of YouTube playlists or YouTube channels, KATube will ge…
-
BSpeech-MOS-Prediction Public
A model for predicting MOS that utilizes embeddings of supervised learning and self-supervised learning models, combined with embeddings of speaker verification models, to predict the MOS metric.
-
-
useful_audio_scripts Public
Some useful scripts for audio
-
Multilingual-PL-BERT Public
Forked from yl4579/PL-BERTPhoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
Python MIT License UpdatedMay 23, 2024 -
Train_Hifigan_XTTS Public
Forked from tuanh123789/Train_Hifigan_XTTSThis is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.
Python UpdatedMay 21, 2024 -
xtts-webui Public
Forked from daswer123/xtts-webuiWebui for using XTTS and for finetuning it
Python MIT License UpdatedMay 9, 2024 -
-
whisper-diarization Public
Forked from MahmoudAshraf97/whisper-diarizationAutomatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Jupyter Notebook BSD 2-Clause "Simplified" License UpdatedMar 12, 2024 -
-
FullSubNet-plus Public
Forked from RookieJunChen/FullSubNet-plusThe official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
Python Apache License 2.0 UpdatedMar 7, 2024 -
VALL-E-X Public
Forked from 0417keito/VALL-E-X-Trainer-by-CustomDataAn open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
Python MIT License UpdatedJan 27, 2024 -
-
coqui-TTS Public
Forked from coqui-ai/TTS🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Python Mozilla Public License 2.0 UpdatedDec 11, 2023 -
-
distil-whisper Public
Forked from shuaijiang/distil-whisperDistilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Python MIT License UpdatedDec 5, 2023 -
tacotron2 Public
Forked from NVIDIA/tacotron2Tacotron 2 - PyTorch implementation with faster-than-realtime inference adapted for brazilian portuguese.
-
capybara_dataset Public
This is a dataset composed of images of capybaras to be used for training a model for object detection
-
-
-
audio-slicer Public
Forked from flutydeer/audio-slicerA simple GUI application that slices audio with silence detection
Python MIT License UpdatedAug 1, 2023 -
so-vits-svc Public
Forked from svc-develop-team/so-vits-svcSoftVC VITS Singing Voice Conversion
Python GNU Affero General Public License v3.0 UpdatedJul 19, 2023 -
YourTTS Public
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
whisper_torchserve Public
Forked from egochao/whisper_torchserveTorchserve build for Whisper-Speech to text model.
-
stable-diffusion-webui Public
Forked from AUTOMATIC1111/stable-diffusion-webuiStable Diffusion web UI
Python GNU Affero General Public License v3.0 UpdatedApr 12, 2023 -
TriAAN-VC Public
Forked from winddori2002/TriAAN-VCTriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion
Python MIT License UpdatedMar 31, 2023