发明名称 Multimedia search apparatus and method for searching multimedia content using speaker detection by audio data
摘要 A multimedia search apparatus and method for searching multimedia content using speaker detection to segment the multimedia content. The multimedia search apparatus receives a search request from a user device. The search request identifies the target speaker for which the search is to be conducted. Based on the search request, the multimedia search apparatus retrieves multimedia content from a multimedia database. The multimedia search apparatus retrieves models, such as Gaussian Mixture Models (GMMs), from a model storage device, corresponding to the target speaker and background data. Based on the retrieved models, the multimedia search device searches the audio data of the multimedia content and segments the audio data. The segments are identified by calculating an average normalized score for a block of frames of the audio data and determining if the average normalized score for the block of frames exceeds one or more predetermined thresholds.
申请公布号 US6317710(B1) 申请公布日期 2001.11.13
申请号 US19990353192 申请日期 1999.07.14
申请人 AT&T CORP. 发明人 HUANG QIAN;MAGRIN-CHAGNOLLEAU IVAN;PARTHASARATHY SARANGARAJAN;ROSENBERG AARON EDWARD
分类号 G10L17/00;(IPC1-7):G01L17/00;G01L15/10 主分类号 G10L17/00
代理机构 代理人
主权项
地址