摘要 |
<p>A method receives video content and metadata associated with video content. The method then extracts features of the video content based on the metadata. Portions of the visual, audio, and textual features are fused into composite features that include multiple features from the visual, audio, and textual features. A set of video segments of the video content is identified based on the composite features of the video content. Also, the segments may be identified based on a user query.</p> |
申请人 |
ARRIS ENTERPRISES, INC. |
发明人 |
ISHTIAQ, FAISAL;FONSECA, BENEDITO J., JR.;BAUM, KEVIN L.;BRASKICH, ANTHONY J.;EMEOTT, STEPHEN P.;GANDHI, BHAVAN;LI, RENXIANG;SMITH, ALFONSO MARTINEZ;NEEDHAM, MICHAEL L.;OULD DELLAHY, ISSELMOU |