Skip to content
forked from mozilla/g2p

Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation

Notifications You must be signed in to change notification settings

navneeth1990/g2p

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Steps

Dependencies

  1. C++11 with TR1 for opemfst is required;
  2. A Linux or OS X environment
  3. Install openfst from deps/ folder (configure with --enable-far)
  4. Install phonetisaurus from root folder
  5. Install mitlm from deps/ folder (requires fortran)

Install Sphinxbase (attention to sphinxbase version)

  1. Sphinxbase: Since gecko-dev uses an outdated version of sphinxbase, this specific commit (https://github.com/cmusphinx/sphinxbase/tree/18aec4d11c5fc724a15f899bc1222bfcfe589def) should be used, otherwise generated models will be incompatible with Gecko's sphinxbase and will fail to load. Build with traditional ./configure && make && make install

Generating the models

  1. From inside dicts/ folder, execute gen_arpa.sh from each language you want arpa models to be build
  2. The .dmp and lm.bin (for faster loading) binary files and its text version (model.arpa) are generated the language root folder

About

Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 63.2%
  • Python 33.6%
  • Makefile 1.7%
  • Shell 1.5%