摘要 |
A system and method for extracting acoustic features and speech activity on a device and transmitting them in a distributed voice recognition system. The distributed voice recognition system includes a local VR engine in a subscriber unit (102) and a server VR engine in a server (160). The local VR engine comprises a feature extraction (FE) module (104) that extracts features from a speech signal, and a voice activity detection module (VAD) (106) that detects voice activity within a speech signal. The system includes filters, framing and windowing modules, power spectrum analyzers, a neural network, a nonlinear element, and other components to selectively provide an advanced front end vector including predetermined portions of the voice activity detection indication and extracted features from the subscriber unit (104) to the server (160). The system also includes a module to generate additional feature vectors on the server from the received features using a feed-forward multilayer perception (MLP) and providing the same to the speech server (160). |
申请人 |
QUALCOMM INCORPORATED |
发明人 |
GARUDADRI, HARINATH;HERMANSKY, HYNEK;BURGET, LUKAS;JAIN, PRATIBHA;KAJAREKAR, SACHIN;SIVADAS, SUNIL;DUPONT, STEPHANE, N.;ORTUZAR, MARIA, CARMEN, BENITEZ;MORGAN, NELSON, H. |