发明名称 Finding differences in nearly-identical audio recordings
摘要 Systems and techniques are provided for finding differences in nearly-identical audio recordings. A first version of an audio recording may be received. A second version of the audio recording may be received. A difference between the first version of the audio recording and the second version of the audio recording may be determined using time domain analysis and frequency domain analysis. The difference may be stored in a difference set. The difference set may allow the first version of the audio recording to be distinguished from the second version of the audio recording. The audio recording may be a music track. The first version of the audio recording may be an explicit version of the music track. The second version of the audio recording may be an edited version of the music track.
申请公布号 US9536546(B2) 申请公布日期 2017.01.03
申请号 US201414453762 申请日期 2014.08.07
申请人 GOOGLE INC. 发明人 Motta Giovanni;Lu Yang
分类号 G06F17/00;G10L25/51;G06F17/30 主分类号 G06F17/00
代理机构 Morris & Kamlay LLP 代理人 Morris & Kamlay LLP
主权项 1. A computer-implemented method performed by a data processing apparatus, the method comprising: receiving a first version of an audio recording; receiving a second version of the audio recording; determining at least one difference between the first version of the audio recording and the second version of the audio recording using one or more of time domain analysis and frequency domain analysis; and storing the at least one difference in a difference set, wherein the difference set allows the first version of the audio recording to be distinguished from the second version of the audio recording, wherein determining at the least one difference between the first version of the audio recording and the second version of the audio recording using time domain analysis comprises: partitioning the first version of the audio recording and the second version of the audio recording into non-overlapping blocks of fixed lengths,aligning the blocks for the second version of the audio recording with corresponding blocks for the first version of the audio recording to form block pairs,subtracting the block for the second version of the audio recording from the corresponding block for the first version of the audio recording for each block pair to obtain a residual signal,subtracting a weighted spectrum of the second version of the audio recording from the residual signal to obtain a difference signal,squaring the difference signal to obtain a squared difference signal,determining a mean value for the difference signal to obtain a threshold, andinspecting each peak in the squared difference signal that is greater than the threshold to determine if each peak represents one of the at least one differences.
地址 Mountain View CA US