发明名称 Class quantization for distributed speech recognition
摘要 A system, method and computer readable medium for quantizing class information and pitch information of audio is disclosed. The method on an information processing system includes receiving audio and capturing a frame of the audio. The method further includes determining a pitch of the frame and calculating a codeword representing the pitch of the frame, wherein a first codeword value indicates an indefinite pitch. The method further includes determining a class of the frame, wherein the class is any one of at least two classes indicating an indefinite pitch and at least one class indicating a definite pitch. The method further includes calculating a codeword representing the class of the frame, wherein the codeword length is the maximum of the minimum number of bits required to represent the at least two classes and the minimum number of bits required to represent the at least one class.
申请公布号 US2004158461(A1) 申请公布日期 2004.08.12
申请号 US20030360582 申请日期 2003.02.07
申请人 MOTOROLA, INC.;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 RAMABADRAN TENKASI V.;SORIN ALEXANDER
分类号 G10L;G10L11/00;G10L11/04;G10L11/06;G10L15/28;G10L21/00;(IPC1-7):G10L11/04 主分类号 G10L
代理机构 代理人
主权项
地址