发明名称 System and method of using neural transforms of robust audio features for speech processing
摘要 A system and method for processing speech includes receiving a first information stream associated with speech, the first information stream comprising micro-modulation features and receiving a second information stream associated with the speech, the second information stream comprising features. The method includes combining, via a non-linear multilayer perceptron, the first information stream and the second information stream to yield a third information stream. The system performs automatic speech recognition on the third information stream. The third information stream can also be used for training HMMs.
申请公布号 US9280968(B2) 申请公布日期 2016.03.08
申请号 US201314046393 申请日期 2013.10.04
申请人 AT&T Intellectual Property I, L.P. 发明人 Bocchieri Enrico Luigi;Dimitriadis Dimitrios
分类号 G10L15/16;G10L21/02;G10L19/00;G10L15/02;G10L21/0208;G10L15/14 主分类号 G10L15/16
代理机构 代理人
主权项 1. The method comprising: receiving, via a communication network, a first information stream associated with speech, the first information stream comprising micro-modulation features modeled in a first time scale; receiving, via the communication network, a second information stream associated with the speech, the second information stream comprising cepstral features modeled in a second time scale, wherein the first time scale is distinct from the second time scale; combining, via a non-linear multilayer perceptron, the first information stream and the second information stream, to yield a third information stream; and performing, via a hardware processor, automatic speech recognition on the third confirmation stream.
地址 Atlanta GA US