发明名称 Apparatus for performing speaker identification and speaker searching in speech or sound image data, and method thereof
摘要 A process of identifying a speaker in coded speech data and a process of searching for the speaker are efficiently performed with fewer computations and with a smaller storage capacity. In an information search apparatus, an LSP decoding section extracts and decodes only LSP information from coded speech data which is read for each block. An LPC conversion section converts the LSP information into LPC information. A Cepstrum conversion section converts the obtained LPC information into an LPC Cepstrum which represents features of speech. A vector quantization section performs vector quantization on the LPC Cepstrum. A speaker identification section identifies a speaker on the basis of the result of the vector quantization. Furthermore, the identified speaker is compared with a search condition in a condition comparison section, and based on the result, the search result is output.
申请公布号 US7315819(B2) 申请公布日期 2008.01.01
申请号 US20020201069 申请日期 2002.07.23
申请人 SONY CORPORATION 发明人 TOGURI YASUHIRO;NISHIGUCHI MASAYUKI
分类号 G10L17/00;G10L15/02;G10L15/10;G10L19/04;H03M7/30;H03M7/36 主分类号 G10L17/00
代理机构 代理人
主权项
地址