发明名称 METHOD AND APPARATUS FOR SPEECH RECONSTRUCTION IN A DISTRIBUTED SPEECH RECOGNITION SYSTEM
摘要 A method of reconstructing speech input at a communication device comprises receiving, at the communication device, encoded data that includes encoded spectral data and encoded energy data of the speech input, the encoded spectral data being encoded as a series of mel-frequency cepstral coefficients. The method further comprises decoding, at the communication device, the encoded spectral data and encoded energy data to determine the spectral data and energy data, wherein decoding comprises: performing an inverse discrete cosine transform on the mel-frequency cepstral coefficients at harmonic mel-frequencies corresponding to a pitch period of the speech input to determine log-spectral magnitudes of the speech input at the harmonic mel-frequencies, and exponentiating the log-spectral magnitudes to determine the spectral magnitudes of the speech input. The method also comprises combining the spectral data and energy data to reconstruct the speech input at the communication device. A communication device for use in distributed speech recognition system is also disclosed.
申请公布号 EP2945154(A1) 申请公布日期 2015.11.18
申请号 EP20150173401 申请日期 2002.01.18
申请人 MOTOROLA MOBILITY LLC 发明人 KUSHNER, WILLIAM M;MEUNIER, JEFFREY;JASIUK, MARK A;RAMABADRAN, TENKASI V.
分类号 G10L15/00;G10L15/28;G10L15/30;G10L19/00;G10L19/08;G10L19/093;G10L25/18 主分类号 G10L15/00
代理机构 代理人
主权项
地址