Reverb

Open source inference and evaluation code for Rev's state-of-the-art speech recognition and diarization models. The speech recognition (ASR) code uses the WeNet framework and the speech diarization code uses the Pyannote framework. More detailed model descriptions can be found in our blog and the models can be downloaded from huggingface.

Installation

We recommend using a virtual environment with a tool such as anaconda. You might need to set your HUGGINGFACE_ACCESS_TOKEN as well since the model itself (ASR and diarization) is downloaded from the HF hub.

conda create -n reverb-env python=3.10
conda activate reverb-env

Then, in the root directory of this repository,

pip install -r asr/requirements.txt
pip install -r diarization/requirements.txt
export PYTHONPATH="$(pwd)"/asr:$PYTHONPATH  # adding this to make wenet/ work

To get the model files, make sure that git lfs is correctly installed on your system and clone the models from huggingface.

git lfs install
git clone https://huggingface.co/Revai/reverb-asr

Docker Image

Alternatively, you can use Docker to run ASR and/or diarization without needing to install dependencies (including the model files). directly on your system. First, make sure Docker is installed on your system. If you wish to run on NVIDIA GPU, more steps might be required. Then, run the following command to build the Docker image:

docker build -t reverb . --build-arg HUGGINGFACE_ACCESS_TOKEN=${YOUR_HUGGINGFACE_ACCESS_TOKEN}

And to run docker

sudo docker run --entrypoint "/bin/bash" --gpus all --rm -it reverb

Hosting the Model

If your usecase requires a to deploy these models at a larger scale and maintaining strict security requirements, consider using our other release: https://github.com/revdotcom/reverb-self-hosted. This setup will give you full control over the deployment of our models on your own infrastructure without the need for internet connectivity or cloud dependencies.

License

The license in this repository applies only to the code not the models. See LICENSE for details. For model licenses, check out their pages on HuggingFace.

Citations

If you make use of this model, please cite this paper

@article{bhandari2024reverb,
  title={Reverb: Open-Source ASR and Diarization from Rev},
  author={Bhandari, Nishchal and Chen, Danny and del Río Fernández, Miguel Ángel and Delworth, Natalie and Fox, Jennifer Drexler and Jetté, Miguel and McNamara, Quinten and Miller, Corey and Novotný, Ondřej and Profant, Ján and Qin, Nan and Ratajczak, Martin and Robichaud, Jean-Philippe},
  journal={arXiv preprint arXiv:2410.03930},
  year={2024}
}

Contributors

Nishchal Bhandari, Danny Chen, Miguel Del Rio, Natalie Delworth, Jennifer Drexler Fox, Miguel Jette, Quinn McNamara, Corey Miller, Ondrej Novotny, Jan Profant, Nan Qin, Martin Ratajczak, and Jean-Philippe Robichaud.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
asr		asr
diarization		diarization
resources		resources
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md

Model	Earnings21	Earnings22	Rev16
Reverb ASR	9.68	13.68	10.30
Whisper Large-v3	14.26	19.05	10.86
Canary-1B	14.40	19.01	13.82

Model	Earnings21	Rev16
Pyannote3.0	0.051	0.090
Reverb Diarization V1	0.047	0.077
Reverb Diarization V2	0.046	0.078

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reverb

Table of Contents

ASR

Diarization

Installation

Docker Image

Hosting the Model

License

Citations

Contributors

About

Releases

Packages

Contributors 6

Languages

License

revdotcom/reverb

Folders and files

Latest commit

History

Repository files navigation

Reverb

Table of Contents

ASR

Diarization

Installation

Docker Image

Hosting the Model

License

Citations

Contributors

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Packages