发明名称 SPEECH DATABASE REGISTRATION PROCESSING METHOD, SPEECH GENERATION SOURCE RECOGNIZING METHOD, SPEECH GENERATION SECTION RETRIEVING METHOD, SPEECH DATABASE REGISTRATION PROCESSING DEVICE, SPEECH GENERATION SOURCE RECOGNIZING DEVICE, SPEECH GENERATION SECTION RETRIEVING DEVICE, PROGRAM THEREFOR, AND RECORDING MEDIUM FOR SAME PROGRAM
摘要 <p><P>PROBLEM TO BE SOLVED: To provide a means making it possible to precisely retrieve a speaking section of a desired speaker even when video and audio include a part wherein a plurality of speakers speak at the same time. <P>SOLUTION: In a speaker speech registration phase, not only feature quantities of the voice of a speaker himself/herself, but also feature quantities of a voice composed of speech signals of a plurality of speakers are extracted and registered in a speech database 1. In a speaker retrieval phase, an input speech signal to be retrieved is segmented into short sections and feature quantities of the respective short sections are collated with feature quantities in the speech database 1 to recognize speakers. In a speaking section determination phase, retrieval results of speakers of the respective short sections are totalized in every fixed number of short sections and speaking sections of the speakers are found according to appearance frequencies of the speakers. In a speaker information display phase, the retrieval results of the speaking section are displayed. <P>COPYRIGHT: (C)2004,JPO</p>
申请公布号 JP2004145161(A) 申请公布日期 2004.05.20
申请号 JP20020312074 申请日期 2002.10.28
申请人 NIPPON TELEGR & TELEPH CORP <NTT> 发明人 OSADA HIDENOBU;KOSUGI NAOKO
分类号 G10L15/06;G10L11/02;G10L15/00;G10L15/04;G10L17/00;(IPC1-7):G10L15/06 主分类号 G10L15/06
代理机构 代理人
主权项
地址