Sphinx-4 Transcriber Demo |
A simple Sphinx-4 application that transcribes a continuous audio file
that has multiple utterances. The audio file should contain connected
digits data. The default file, called "10001-90210-01803.wav", contains
three utterances, separated by silences.
People who want to transcribe non-digits data should
modify the config.xml
file to use the correct grammar,
language model, and linguist to do so. Please refer to the
Programmer's Guide
on how to modify the configuration file for your purposes.
Check if the bin
directory already has the
Transcriber.jar
file. If not, type the following in the top
level directory:
ant -find demo.xml
First make sure that you have JSAPI setup correctly. Then, to run the demo, type:
sphinx4> java -jar bin/Transcriber.jar
You will see the following result, with each utterance on its own line:
one zero zero zero one nine oh two one oh zero one eight zero three
NOTE:
lib/sphinx4.jar
is built. If not, go to the top level
directory and type: ant
test.wav
, then:
java -jar bin/Transcriber.jar test.wav