发明名称 |
CLASS QUANTIZATION FOR DISTRIBUTED SPEECH RECOGNITION |
摘要 |
A system, method and computer readable medium for quantizing class information and pitch information of audio is disclosed. The method on an information processing system includes receiving audio and capturing a frame of the audio. The method further includes determining a pitch of the frame and calculating a codeword representing the pitch of the frame, wherein a first codeword value indicates an indefinite pitch. The method further includes determining a class of the frame, wherein the class is any one of at least two classes indicating an indefinite pitch and at least one class indicating a definite pitch. The method further includes calculating a codeword representing the class of the frame, wherein the codeword length is the maximum of the minimum number of bits required to represent the at least two classes and the minimum number of bits required to represent the at least one class. |
申请公布号 |
EP1595249(A2) |
申请公布日期 |
2005.11.16 |
申请号 |
EP20040708622 |
申请日期 |
2004.02.05 |
申请人 |
MOTOROLA, INC.;INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
RAMABADRAN, TENKASI, V.;SORIN, ALEXANDER |
分类号 |
G10L25/93;G10L;G10L11/00;G10L11/04;G10L11/06;G10L15/28;G10L15/30;G10L21/00;G10L25/90 |
主分类号 |
G10L25/93 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|