发明名称 SYSTEM AND METHOD FOR IMPROVING SPEAKER SEGMENTATION AND RECOGNITION ACCURACY IN A MEDIA PROCESSING ENVIRONMENT
摘要 A method is provided and includes estimating an approximate list of potential speakers in a file from one or more applications. The file (e.g., an audio file, video file, or any suitable combination thereof) includes a recording of a plurality of speakers. The method also includes segmenting the file according to the approximate list of potential speakers such that each segment corresponds to at least one speaker; and recognizing particular speakers in the file based on the approximate list of potential speakers.
申请公布号 US2014074471(A1) 申请公布日期 2014.03.13
申请号 US201213608420 申请日期 2012.09.10
申请人 SANKAR ANANTH;KAJAREKAR SACHIN;GANNU SATISH K.;CISCO TECHNOLOGY, INC. 发明人 SANKAR ANANTH;KAJAREKAR SACHIN;GANNU SATISH K.
分类号 G10L17/00 主分类号 G10L17/00
代理机构 代理人
主权项
地址