发明名称 |
SYSTEM AND METHOD FOR IMPROVING SPEAKER SEGMENTATION AND RECOGNITION ACCURACY IN A MEDIA PROCESSING ENVIRONMENT |
摘要 |
A method is provided and includes estimating an approximate list of potential speakers in a file from one or more applications. The file (e.g., an audio file, video file, or any suitable combination thereof) includes a recording of a plurality of speakers. The method also includes segmenting the file according to the approximate list of potential speakers such that each segment corresponds to at least one speaker; and recognizing particular speakers in the file based on the approximate list of potential speakers. |
申请公布号 |
US2014074471(A1) |
申请公布日期 |
2014.03.13 |
申请号 |
US201213608420 |
申请日期 |
2012.09.10 |
申请人 |
SANKAR ANANTH;KAJAREKAR SACHIN;GANNU SATISH K.;CISCO TECHNOLOGY, INC. |
发明人 |
SANKAR ANANTH;KAJAREKAR SACHIN;GANNU SATISH K. |
分类号 |
G10L17/00 |
主分类号 |
G10L17/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|