发明名称 MONAURAL SPEECH FILTER
摘要 A system receives monaural sound which includes speech and background noises. The received sound is divided by frequency and time into time-frequency units (TFUs). Each TFU is classified as speech or non-speech by a processing unit. The processing unit for each frequency range includes at least one of a deep neural network (DNN) or a linear support vector machine (LSVM). The DNN extracts and classifies the features of the TFU and includes a pre- trained stack of Restricted Boltzmann Machines (RBM), and each RBM includes a visible and a hidden layer. The LSVM classifies each TFU based on extracted features from the DNN, including those from the visible layer of the first RBM, and those from the hidden layer of the last RBM in the stack. The LSVM and DNN include training with a plurality of training noises. Each TFU classified as speech is output.
申请公布号 WO2013149123(A1) 申请公布日期 2013.10.03
申请号 WO2013US34564 申请日期 2013.03.29
申请人 THE OHIO STATE UNIVERSITY 发明人 WANG, YUXUAN;WANG, DELIANG
分类号 G10L15/16 主分类号 G10L15/16
代理机构 代理人
主权项
地址
您可能感兴趣的专利