Provides an tool that generates different types of features (MFCC, PLP, spectrum) from audio files.

To create MFCC cepstrum using this program, go to the edu/cmu/sphinx/tools/feature directory, and type:

ant -Dinput="input file" -Doutput="output file" cepstra_producer

To create a binary PLP cepstrum file using this program, type:

ant -Dinput="input file" -Doutput="output file" plp_producer

To create a binary spectra file, type:

ant -Dinput="input file" -Doutput="output file" spectra_producer

As you might notice by comparing the files cepstra_dump.props and spectra_dump.props in the "edu/cmu/sphinx/tools/feature" directory, the only difference in setup between dumping different types of features is in the sequence of data processors as specified in the properties file. If you give it a difference data processor sequence, it will give you different output.

Binary File Format

The first 4 bytes of the binary file is an integer indicating the total number of data points in the file. This is used by the program that reads this file to check the endianness of the file by comparing with the file size. The rest of the file is simply the data points. Each data point is a 4-byte floating point number, in big-endian order.