发明名称 Distributed realtime speech recognition system
摘要 A real-time system incorporating speech recognition and linguistic processing for recognizing a spoken query by a user and distributed between client and server, is disclosed. The system accepts user's queries in the form of speech at the client where minimal processing extracts a sufficient number of acoustic speech vectors representing the utterance. These vectors are sent via a communications channel to the server where additional acoustic vectors are derived. Using Hidden Markov Models (HMMs), and appropriate grammars and dictionaries conditioned by the selections made by the user, the speech representing the user's query is fully decoded into text (or some other suitable form) at the server. This text corresponding to the user's query is then simultaneously sent to a natural language engine and a database processor where optimized SQL statements are constructed for a full-text search from a database for a recordset of several stored questions that best matches the user's query. Further processing in the natural language engine narrows the search to a single stored question. The answer corresponding to this single stored question is next retrieved from the file path and sent to the client in compressed form. At the client, the answer to the user's query is articulated to the user using a text-to-speech engine in his or her native natural language. The system requires no training and can operate in several natural languages.</PTEXT>
申请公布号 US6633846(B1) 申请公布日期 2003.10.14
申请号 US19990439145 申请日期 1999.11.12
申请人 PHOENIX SOLUTIONS, INC. 发明人 BENNETT IAN M.;BABU BANDI RAMESH;MORKHANDIKAR KISHOR;GURURAJ PALLAKI
分类号 G06F3/16;G06F17/28;G06F17/30;G09B7/02;G10L13/00;G10L15/00;G10L15/02;G10L15/14;G10L15/18;G10L15/20;G10L15/22;G10L15/28;G10L21/02;(IPC1-7):G10L15/02 主分类号 G06F3/16
代理机构 代理人
主权项
地址