发明名称 Pitch shift resistant audio matching
摘要 Systems and methods are provided herein relating to audio matching. Both melody fingerprints and audio-id fingerprints can be used to improve an audio matching system's resistance to pitch shifts. A melody fingerprint can be used to identify a set of potential melody matches. Varying pitch shifted audio-id reference fingerprints can be generated for audio-id fingerprints associated with the potential matches identified in melody matching. Additional pitch shifted audio-id fingerprints of a reference sample are generated and used in matching only if an audio sample has previously been matched to a melody fingerprint of the same reference sample. A reference index need not be expanded to include pitch shifted variations of each reference sample as pitch shifted variations of audio-id fingerprint reference samples are generated and used only if their associated melody fingerprint is deemed a potential match.
申请公布号 US9052986(B1) 申请公布日期 2015.06.09
申请号 US201213450422 申请日期 2012.04.18
申请人 Google Inc. 发明人 Postelnicu Gheorghe;Sharifi Matthew
分类号 G06F17/00 主分类号 G06F17/00
代理机构 Amin, Turocy & Watson, LLP 代理人 Amin, Turocy & Watson, LLP
主权项 1. A system comprising: a memory that has stored thereon computer executable components; and a processor that executes the following computer executable components stored in the memory: an input component that receives a video sample;a fingerprint component that generates a melody fingerprint and an audio-id fingerprint based on an audio track of the video sample;a melody matching component that identifies a set of potential audio matches for the audio track based on comparing the melody fingerprint to reference melody fingerprints for the potential audio matches of the set;an audio-id matching component that identifies reference audio-id fingerprints respectively associated with the potential audio matches of the set;a pitch shift evaluation component that determines an estimated amount of pitch shift between the audio track and the reference audio-id fingerprints; anda pitch variation component that generates sets of pitch modified fingerprints for each of the reference audio-id fingerprints based on the estimated amount of pitch shift,wherein the audio-id matching component identifies a subset of the set of the potential audio matches based on comparing the audio-id fingerprint to the reference audio-id fingerprints and the sets of the pitch modified fingerprints.
地址 Mountain View CA US