Skip to content

haloha123/ACSSR

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 

Repository files navigation

Learning Adapters for Code-Switching Speech Recognition

This repository is developed under huggingface Framework

Requirements

python 3.8.10

  • datasets >= 1.18.0
  • torch >= 1.5
  • torchaudio
  • librosa
  • jiwer
  • evaluate
  • numpy
  • pandas
  • jieba
  • editdistance
  • tensorboard
  • fairscale
  • seaborn
  • accelerate
  • spacy

Installatation

  1. Install huggingface
> cd transformers
> pip install -e .

dataset and pretrained weight

  • dataset
MLLAB-public (\\mllab.asuscomm.com)W:\Chun-Yi_He\ASR_data\NTUT\dataset_NTUT
MLLAB-public (\\mllab.asuscomm.com)W:\Chun-Yi_He\ASR_data\ASCEND
  • pretrained weight
MLLAB-public (\\mllab.asuscomm.com)W:\Chun-Yi_He\pretrained_weight

FIle structure

|_ /ASCEND/
  |_ dataset_NTUT (NTUT AB01 dataset)  
  |_ waves (ASCEND dataset)
  |_ pretrained_weight

Model training

> cd examples/pytorch/speech-recognition/ASCEND/
> pip install -r requirements.txt
> bash run_train.sh

Model inference

python inference.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.9%
  • Jupyter Notebook 0.5%
  • Cuda 0.4%
  • Shell 0.2%
  • Dockerfile 0.0%
  • C++ 0.0%