发明名称 EXTENDED VIDEOLENS MEDIA ENGINE FOR AUDIO RECOGNITION
摘要 A system, method, and computer program product for automatically analyzing multimedia data audio content are disclosed. Embodiments receive multimedia data, detect portions having specified audio features, and output a corresponding subset of the multimedia data and generated metadata. Audio content features including voices, non-voice sounds, and closed captioning, from downloaded or streaming movies or video clips are identified as a human probably would do, but in essentially real time. Particular speakers and the most meaningful content sounds and words and corresponding time-stamps are recognized via database comparison, and may be presented in order of match probability. Embodiments responsively pre-fetch related data, recognize locations, and provide related advertisements. The content features may be also sent to search engines so that further related content may be identified. User feedback and verification may improve the embodiments over time.
申请公布号 US2013006625(A1) 申请公布日期 2013.01.03
申请号 US201113171246 申请日期 2011.06.28
申请人 SONY CORPORATION;GUNATILAKE PRIYAN;NGUYEN DJUNG;PATIL ABHISHEK;SAHA DIPENDU 发明人 GUNATILAKE PRIYAN;NGUYEN DJUNG;PATIL ABHISHEK;SAHA DIPENDU
分类号 G10L15/26;G10L17/00 主分类号 G10L15/26
代理机构 代理人
主权项
地址