发明名称 Audio fingerprint extraction by scaling in time and resampling
摘要 An audio fingerprint is extracted from an audio sample, where the fingerprint contains information that is characteristic of the content in the sample. The fingerprint may be generated by computing an energy spectrum for the audio sample, resampling the energy spectrum, transforming the resampled energy spectrum to produce a series of feature vectors, and computing the fingerprint using differential coding of the feature vectors. The generated fingerprint can be compared to a set of reference fingerprints in a database to identify the original audio content.
申请公布号 US9093120(B2) 申请公布日期 2015.07.28
申请号 US201113025060 申请日期 2011.02.10
申请人 YAHOO! INC. 发明人 Bilobrov Sergiy
分类号 G10L19/02;G10L25/18;G11B27/28;G10L25/54 主分类号 G10L19/02
代理机构 Greenberg Traurig, LLP 代理人 DeCarlo James J.;Greenberg Traurig, LLP
主权项 1. A method for extracting an audio fingerprint from an audio frame, the method comprising: filtering, by a processor, the audio frame into a plurality of frequency bands by varying the frequency bands with time to produce a corresponding plurality of filtered audio signals; scaling in time, by the processor, for fingerprinting, a size of each filtered audio signal based on a frequency of the corresponding frequency band, the scaling based on at least one of a mid-frequency and a frequency range of the corresponding frequency band; resampling, by the processor, the scaled filtered audio signals by sampling from different corresponding frequency bands as time changes to produce resampled audio signals; transforming, by the processor, the resampled audio signals to produce a feature vector for each resampled audio signal; and computing, by the processor, the audio fingerprint based on the feature vectors.
地址 Sunnyvale CA US