发明名称 Extracting classifying data in music from an audio bitstream
摘要 The method of the present invention utilizes machine-learning techniques, particularly Support Vector Machines in combination with a neural network, to process a unique machine-learning enabled representation of the audio bitstream. Using this method, a classifying machine is able to autonomously detect characteristics of a piece of music, such as the artist or genre, and classify it accordingly. The method includes transforming digital time-domain representation of music into a frequency-domain representation, then dividing that frequency data into time slices, and compressing it into frequency bands to form multiple learning representations of each song. The learning representations that result are processed by a group of Support Vector Machines, then by a neural network, both previously trained to distinguish among a given set of characteristics, to determine the classification.
申请公布号 US7295977(B2) 申请公布日期 2007.11.13
申请号 US20010939954 申请日期 2001.08.27
申请人 NEC LABORATORIES AMERICA, INC. 发明人 WHITMAN BRIAN;FLAKE GARY W.;LAWRENCE STEPHEN R.
分类号 G06F17/30;G10L15/16;G06N3/00;G10H7/10;G10L11/00;G10L17/00;G10L19/00;H03M7/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址