The Sphinx-4 audio endpointer is composed of three data processors that carry out different functions:
SpeechClassifier - classifies chunks of audio into speech and non-speech.
SpeechMarker - marks the audio stream into speech and non-speech regions, by inserting SpeechStartSignals and SpeechEndSignals into the data stream, while giving some 'cushion areas' around these regions.
NonSpeechDataFilter - removes the non-speech regions from the audio.