发明名称 Videolens media engine
摘要 A system, method, and computer program product for automatically analyzing multimedia data are disclosed. Embodiments receive multimedia data, detect portions having specified features, and output a corresponding subset of the multimedia data. Content features from downloaded or streaming movies or video clips are identified as a human probably would do, but in essentially real time. Embodiments then generate an index or menu based on individual consumer preferences. Consumers can peruse the index, or produce customized trailers, or edit and tag content with metadata as desired. The tool can categorize and cluster content by feature, to assemble a library of scenes or scene clusters according to user-selected criteria.
申请公布号 US9594959(B2) 申请公布日期 2017.03.14
申请号 US201414289942 申请日期 2014.05.29
申请人 Sony Corporation 发明人 Gunatilake Priyan
分类号 G06K9/00;G06F17/30;G10L15/26;G06T7/00;G06T7/20;H04N5/91;G10L25/78 主分类号 G06K9/00
代理机构 Fitch, Even, Tabin & Flannery LLP 代理人 Fitch, Even, Tabin & Flannery LLP
主权项 1. A method for automated analysis of multimedia data, the method comprising: receiving multimedia data at a computing device including a computer processor programmed to analyze the multimedia data; identifying at least one multimedia data portion having specified content features via the computer processor analyzing the multimedia data by identifying: at least one action scene in the multimedia data based on audio signal amplitude and motion vector magnitude;at least one low motion scene in the multimedia data based on macro-block size and motion vector magnitude;at least one previewing frame in the multimedia data based on color histograms taken from sampled candidate frames; andat least one human dialogue in the multimedia data based on mel frequency cepstrum coefficients (MFCC) of an audio sample; and responsively outputting the at least one identified multimedia data portion.
地址 Tokyo JP