发明名称 |
System and method of using neural transforms of robust audio features for speech processing |
摘要 |
A system and method for processing speech includes receiving a first information stream associated with speech, the first information stream comprising micro-modulation features and receiving a second information stream associated with the speech, the second information stream comprising features. The method includes combining, via a non-linear multilayer perceptron, the first information stream and the second information stream to yield a third information stream. The system performs automatic speech recognition on the third information stream. The third information stream can also be used for training HMMs. |
申请公布号 |
US9280968(B2) |
申请公布日期 |
2016.03.08 |
申请号 |
US201314046393 |
申请日期 |
2013.10.04 |
申请人 |
AT&T Intellectual Property I, L.P. |
发明人 |
Bocchieri Enrico Luigi;Dimitriadis Dimitrios |
分类号 |
G10L15/16;G10L21/02;G10L19/00;G10L15/02;G10L21/0208;G10L15/14 |
主分类号 |
G10L15/16 |
代理机构 |
|
代理人 |
|
主权项 |
1. The method comprising:
receiving, via a communication network, a first information stream associated with speech, the first information stream comprising micro-modulation features modeled in a first time scale; receiving, via the communication network, a second information stream associated with the speech, the second information stream comprising cepstral features modeled in a second time scale, wherein the first time scale is distinct from the second time scale; combining, via a non-linear multilayer perceptron, the first information stream and the second information stream, to yield a third information stream; and performing, via a hardware processor, automatic speech recognition on the third confirmation stream. |
地址 |
Atlanta GA US |