摘要 |
<p>A method and system for creating a compressed data representation of a human speech utterance which may be utilized to accurately regenerate the human speech utterance. First, the location and occurrence of each period of silence, voiced sound and unvoiced sound within the speech utterance is detected. Next, a single representative data frame which may be repetitively utilized to approximate each voiced sound is iteratively determined, along with the duration of each voiced sound. The spectral content of each unvoiced sound, along with variations in the amplitude thereof is also determined. A compressed data presentation is then created which includes encoded representations of a duration of each period of silence, a duration and single representative data frame for each voiced sound and a spectral content and amplitude variations for each unvoiced sound. The compressed data representation may then be utilized to regenerate the speech utterance without substantial loss in intelligibility. <IMAGE></p> |