发明名称 Content identification system
摘要 The content of a media program is recognized by analyzing its audio content to extract therefrom prescribed features, which are compared to a database of features associated with identified content. The identity of the content within the database that has features that most closely match the features of the media program being played is supplied as the identity of the program being played. The features are extracted from a frequency domain version of the media program by a) filtering the coefficients to reduce their number, e.g., using triangular filters; b) grouping a number of consecutive outputs of triangular filters into segments; and c) selecting those segments that meet prescribed criteria, such as those segments that have the largest minimum segment energy with prescribed constraints that prevent the segments from being too close to each other. The triangular filters may be log-spaced and their output may be normalized.
申请公布号 US9336794(B2) 申请公布日期 2016.05.10
申请号 US201414538450 申请日期 2014.11.11
申请人 Alcatel Lucent 发明人 Ben Jan I.;Burges Christopher J.;Mousavi Madjid S.;Nohl Craig R.
分类号 G10L15/00;G10L25/48;G11B27/11;G11B27/28 主分类号 G10L15/00
代理机构 Tong, Rea, Bentley & Kim, LLP 代理人 Tong, Rea, Bentley & Kim, LLP
主权项 1. An apparatus for use in recognizing the content of a media program, the apparatus comprising: processing circuitry configured to digitize an audio representation of a media program to form a digitized audio representation of the media program; a memory configured to store the digitized audio representation of the media program; and a processor communicatively connected to the memory, the processor configured to: divide the digitized audio representation into time domain blocks of a prescribed number of samples;smooth the time domain blocks using a filter;convert the smoothed time domain blocks into frequency domain blocks, wherein the smoothed time domain blocks are represented by frequency coefficients;filter each first frequency domain representation of blocks of the media program using a plurality of filters to develop a respective second frequency domain representation of each of the blocks of the media program, the second frequency domain representation of each of the blocks having a reduced number of frequency coefficients with respect to the first frequency domain representation;group frequency coefficients of the second frequency domain representation of the blocks to form frequency coefficient segments;select a plurality of the segments as representing the media program;compare the selected segments to frequency coefficient segments of stored programs to provide corresponding matching scores; anddetermine the media program using the matching scores.
地址 Boulogne-Billancourt FR