摘要 |
Apparatus and method for fingerprinting an audio signal utilizes programmed machine to identify overlapping windows in a time domain representation of the audio signal, establish a frequency domain representation of the overlapping windows, convolve a set of two-dimensional kernels with the frequency domain representation to thereby provide a convolutional layer as an output stage, reduce dimensionality of the convolution layer to provide one or more further output stages, and perform further processing so as to output a decision in regard to a plurality of the overlapping windows comprising either a specific content id that matches to the audio signal or a failure-to-match indication. |