发明名称 APPARATUS AND METHOD FOR DETECTING INFORMATION
摘要 <p>PROBLEM TO BE SOLVED: To reduce recognition errors in a part sufficiently including a mixed voice due to a background noise and a plurality of speakers when the statistical speech frequency of the speakers in a AV data is detected. SOLUTION: In an information detecting apparatus 10, a voice signal D11 of the AV data inputted from an inputting part 11 is LPC-analyzed by a LPC analyzing part 12. A LPC coefficient of a block determined as a voiced sound block by a voiced sound determining part 14 is inputted to a cepstrum converting part 17 and converted into a LPC cepstrum coefficient. The LPC cepstrum coefficient D12 is vector-quantized by a vector-quantizating part 18. A quantization distortion D18 is inputted to and evaluated by a speaker identifying part 19 for identifying and determining the speaker per the predetermined recognition block. The identified speaker D20 is inputted to a part for calculating the frequency of determining the speaker 20. The part 20 calculates the frequency of determining the speaker respectively recognized in an interval per the predetermined evaluating interval and outputs as frequency information of appearance of the speaker D21.</p>
申请公布号 JP2003036087(A) 申请公布日期 2003.02.07
申请号 JP20010225050 申请日期 2001.07.25
申请人 SONY CORP 发明人 TOKURI YASUHIRO;NISHIGUCHI MASAYUKI
分类号 G10L11/06;G10L15/00;G10L15/02;G10L15/06;G10L15/10;G10L17/00;(IPC1-7):G10L11/06 主分类号 G10L11/06
代理机构 代理人
主权项
地址