发明名称 Pitch quantization for distributed speech recognition
摘要 A system, method and computer readable medium for quantizing pitch information of audio is disclosed. The method includes capturing audio representing a numbered frame of a plurality of numbered frames. The method further includes calculating a class of the frame, wherein a class is any one of a voiced or unvoiced class. If the frame is a voiced class, a pitch is calculated for the frame. If the frame is an even numbered frame and a voiced class, a codeword of a first length is calculated by absolutely quantizing the frame pitch. If the frame is an odd numbered frame and a voiced class and a reliable frame is available, a codeword of a second length is calculated by differentially quantizing the frame pitch. If there is no reliable frame available, a codeword of the second length is calculated by absolutely quantizing the frame pitch.
申请公布号 US6915256(B2) 申请公布日期 2005.07.05
申请号 US20030360581 申请日期 2003.02.07
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 RAMABADRAN TENKASI V.;SORIN ALEXANDER
分类号 G10L15/28;G10L19/08;(IPC1-7):G10L11/04 主分类号 G10L15/28
代理机构 代理人
主权项
地址