Skip to content
This repository was archived by the owner on Jan 20, 2022. It is now read-only.
/ g2p Public archive

INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation

Notifications You must be signed in to change notification settings

mozilla/g2p

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Steps

Dependencies

  1. C++11 with TR1 for opemfst is required;
  2. A Linux or OS X environment
  3. Install openfst from deps/ folder (configure with --enable-far)
  4. Install phonetisaurus from root folder
  5. Install mitlm from deps/ folder (requires fortran)

Install Sphinxbase (attention to sphinxbase version)

  1. Sphinxbase: Since gecko-dev uses an outdated version of sphinxbase, this specific commit (https://github.com/cmusphinx/sphinxbase/tree/18aec4d11c5fc724a15f899bc1222bfcfe589def) should be used, otherwise generated models will be incompatible with Gecko's sphinxbase and will fail to load. Build with traditional ./configure && make && make install

Generating the models

  1. From inside dicts/ folder, execute gen_arpa.sh from each language you want arpa models to be build
  2. The .dmp and lm.bin (for faster loading) binary files and its text version (model.arpa) are generated the language root folder

About

INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published