发明名称 Sparse Auditory Reproducing Kernel (SPARK) Features for Noise-Robust Speech and Speaker Recognition
摘要 The speech feature extraction algorithm is based on a hierarchical combination of auditory similarity and pooling functions. Computationally efficient features referred to as "Sparse Auditory Reproducing Kernel" (SPARK) coefficients are extracted under the hypothesis that the noise-robust information in speech signal is embedded in a reproducing kernel Hilbert space (RKHS) spanned by overcomplete, nonlinear, and time-shifted gammatone basis functions. The feature extraction algorithm first involves computing kernel based similarity between the speech signal and the time-shifted gammatone functions, followed by feature pruning using a simple pooling technique ("MAX" operation). Different hyper-parameters and kernel functions may be used to enhance the performance of a SPARK based speech recognizer.
申请公布号 US2013297299(A1) 申请公布日期 2013.11.07
申请号 US201313788385 申请日期 2013.03.07
申请人 BOARD OF TRUSTEES OF MICHIGAN STATE UNIVERSITY 发明人 CHAKRABARTTY SHANTANU;FAZELDEHKORDI AMIN
分类号 G10L15/02 主分类号 G10L15/02
代理机构 代理人
主权项
地址