发明名称 METHOD AND APPARATUS FOR AUDIO CLIP CLASSIFICATION
摘要 <p>A method of classifying an audio clip into one of a plurality of predefined classes is disclosed. The method separates an audio clip into a plurality of segments, and each of the segments into a plurality of frames (102). The method then extracts Mel Frequency Cepstral Coefficients (MFCC) and Linear Prediction Cepstral Coefficients (LPCC) as the audio features from each frame within one segment (103). Segment characteristics for a segment are determined by deriving one or more measures of said audio features from each frame within the segment. Two or more Support Vector Machine (SVM) classifiers at two or more stages in a segment level classification process are utilised to determine a decision function value (104). For each segment of the audio clip, a class label is determined based on said decision function value. The decision function values of all the segments in said audio clip are mapped to associated segment confidence levels by using a sigmoid function model. Finally, post-processing (105) is performed on the classes and the confidence levels of all the segments to produce a class and associated confidence level for said audio clip (106).</p>
申请公布号 WO2006132596(A1) 申请公布日期 2006.12.14
申请号 WO2005SG00180 申请日期 2005.06.07
申请人 MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD.;ZHAO, YING;CHONG, KOK SENG;NEO, SUA HONG 发明人 ZHAO, YING;CHONG, KOK SENG;NEO, SUA HONG
分类号 G06F17/30;G06F15/18;G06N7/00;G10L15/08 主分类号 G06F17/30
代理机构 代理人
主权项
地址