发明名称 Method for extracting representative segments from music
摘要 A method for extracting the most representative segments of a musical composition, represented by an audio signal, according to which the audio signal is preprocessed by a set of preprocessors, each if which is adapted to identify a rhythmic pattern. The output of the preprocessors that provided the most periodic or rhythmical patterns in the musical composition selected and the musical composition is divided into bars with rhythmic patterns, while iteratively checking and scoring their quality and detecting a section that is a sequence of bars with score above a predetermined threshold. Checking and scoring is iteratively repeated until all sections are detected. Then similarity matrices between all bars that belong to the musical composition are constructed, based on MFCCs of the processed sound, chromograms and the rhythmic patterns. Then equivalent classes of similar sections are extracted along the musical composition. Substantial transitions between sections represented as blocks in the similarity matrices are collected and a representative segment is selected from each class with the highest number of sections.
申请公布号 US9099064(B2) 申请公布日期 2015.08.04
申请号 US201214362129 申请日期 2012.11.29
申请人 Play My Tone Ltd. 发明人 Sheffer Ohad;Calev Kobi;Alloro Omri Cohen
分类号 A63H5/00;G04B13/00;G10H7/00;G10H1/00;G10H1/36 主分类号 A63H5/00
代理机构 Lowenstein Sandler LLP 代理人 Lowenstein Sandler LLP
主权项 1. A method for extracting the most representative segments of a musical composition, represented by an audio signal, comprising: a) preprocessing said audio signal by a set of preprocessors, each of which is adapted to identify a rhythmic pattern within the musical composition; b) selecting an output of the preprocessors that provides the most periodic or rhythmical patterns within said musical composition; c) dividing said musical composition into bars having rhythmic patterns, while iteratively checking and scoring their quality and detecting a section being a sequence of bars with a score above a predetermined threshold; d) iteratively repeating the preceding step until all sections are detected; e) constructing similarity matrices between all bars that belong to said musical composition, based on MFCCs of the processed sound, chromograms and said rhythmic patterns; f) extracting equivalent classes of similar sections along said musical composition; g) collecting substantial transitions between sections represented as blocks in said similarity matrices; and h) selecting a representative segment from each class having the highest number of sections.
地址 Tel Aviv IL