Segmentation technique increasing the active vocabulary of speech recognizers
摘要
The present invention relates to a speech recognition system and a method executed by a speech recognition system focusing on the vocabulary of said speech recognition system and its usage during the speech recognition process. A segmented vocabulary and its exploitation is suggested comprising a multitude of entries an entry being either identical to a legal word or a constituent of a legal word of said language and said constituent being an arbitrary sub-component of said legal word according to the orthography. A constituent can comprise any number of characters not limited to a syllable of a legal word or a recognition-unit of the speech recognition system. Said vocabulary is used to recognize constituents of said vocabulary for recombination of said constituents into legal words if a constituent combination table indicates that said recognized constituents are legal concatenation in said language. <IMAGE>