Classifying Stuttering Events with Sep28K

This repository contains code for preprocessing the Sep28k corpus and training a model for both binary (fluency/disfluency) and multiclass ('Prolongation', 'Block', 'SoundRep', 'WordRep', 'Interjection', 'NoStutteredWords') classification.

The user is assumed to have downloaded the relevant files from the original Sep28k repo.

Install required libraries:

pip install requirements.txt

The first step is to run the following:

python preprocess.py

This returns a dictionary containing F0, MFB, and wav2vec 2.0 features, and dictionaries for the labels and audio file paths.

To train a model on these features:

python train.py --model --batch_size --num_epochs

This train.py allows one to select a model from the models.py file and generates the dataset from the dataset.py file.

Available models include: ConvLSTM(), LSTM_base(), ResNet()

Following training the utils.py contains a plotting function to show the binary and multiclass losses and F1 scores.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Classifying Stuttering Events with Sep28K

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
dataset.py		dataset.py
model_card.md		model_card.md
models.py		models.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

sfoley3/sep28k_stuttering_detection

Folders and files

Latest commit

History

Repository files navigation

Classifying Stuttering Events with Sep28K

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages