发明名称 PITCH QUANTIZATION FOR DISTRIBUTED SPEECH RECOGNITION
摘要 A system, method and computer readable medium for quantizing pitch information of audio is disclosed. The method includes capturing audio representing a numbered frame of a plurality of numbered frames. The method further includes calculating a class of the frame, wherein a class is any one of a voiced or unvoiced class. If the frame is a voiced class, a pitch is calculated for the frame (903). If the frame is an even numbered frame and a voiced class, a codeword of first length is calculated by absolutely quantizing the frame pitch (910). If the frame is an odd numbered frame and a voiced class and a reliable frame is available, a codeword of a second length is calculated by differentially quantizing the frame pitch (905). If there is no reliable frame available, a codeword of the second length is calculated by absolutely quantizing the frame pitch.
申请公布号 KR20050097929(A) 申请公布日期 2005.10.10
申请号 KR20057012455 申请日期 2005.06.30
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION;MOTOROLA INC. 发明人 RAMABADRAN TENKASI V.;SORIN ALEXANDER
分类号 G10L15/28;G10L19/08;(IPC1-7):G10L11/04;G10L11/00 主分类号 G10L15/28
代理机构 代理人
主权项
地址