发明名称 METHOD AND SYSTEM FOR ROBUST AUDIO HASHING
摘要 Method and system for channel-invariant robust audio hashing, the method comprising: a robust hash extraction step wherein a robust hash is extracted from audio content, said step comprising: dividing the audio content in frames;applying a transformation procedure on said frames to compute, for each frame, transformed coefficients;applying a normalization procedure on the transformed coefficients to obtain normalized coefficients, wherein said normalization procedure comprises computing the product of the sign of each coefficient of said transformed coefficients by an amplitude-scaling-invariant function of any combination of said transformed coefficients;applying a quantization procedure on said normalized coefficients to obtain the robust hash of the audio content; anda comparison step wherein the robust hash is compared with reference hashes to find a match.
申请公布号 US2014188487(A1) 申请公布日期 2014.07.03
申请号 US201114123865 申请日期 2011.06.06
申请人 Perez Gonzalez Fernando;Comesana Alfaro Pedro;Perez Freire Luis;Perez Vieites Diego 发明人 Perez Gonzalez Fernando;Comesana Alfaro Pedro;Perez Freire Luis;Perez Vieites Diego
分类号 G10L19/00 主分类号 G10L19/00
代理机构 代理人
主权项 1. A method for audio content identification based on robust audio hashing, comprising: a robust hash extraction step wherein a robust hash (110) is extracted from audio content (102,106); a comparison step wherein the robust hash (110) is compared with at least one reference hash (302) to find a match;characterized in that the robust hash extraction step comprises: dividing the audio content (102,106) in at least one frame; applying a transformation procedure (206) on said at least one frame to compute, for each frame, at least one transformed coefficient (208); applying a normalization procedure (212) on the at least one transformed coefficient (208) to obtain at least one normalized coefficient (214), wherein said normalization procedure (212) comprises computing the product of the sign of each coefficient of said at least one transformed coefficient (208) by an amplitude-scaling-invariant function of any combination of said at least one transformed coefficient (208); applying a quantization procedure (220) on said at least one normalized coefficient (214) to obtain the robust hash (110) of the audio content (102,106).
地址 Vigo ES