发明名称 |
Use of metadata to post process speech recognition output |
摘要 |
A method of utilizing metadata stored in a computer-readable medium to assist in the conversion of an audio stream to a text stream. The method compares personally identifiable data, such as a user's electronic address book and/or Caller/Recipient ID information (in the case of processing voice mail to text), to the n-best results generated by a speech recognition engine for each word that is output by the engine. A goal of this comparison is to correct a possible misrecognition of a spoken proper noun such as a name or company with its proper textual form or a spoken phone number to correctly formatted phone number with Arabic numerals to improve the overall accuracy of the output of the voice recognition system. |
申请公布号 |
US8676577(B2) |
申请公布日期 |
2014.03.18 |
申请号 |
US20090415874 |
申请日期 |
2009.03.31 |
申请人 |
JABLOKOV IGOR RODITIS;STROHOFER, III CLIFFORD J.;WHITE MARC;JABLOKOV VICTOR RODITIS;CANYON IP HOLDINGS, LLC |
发明人 |
JABLOKOV IGOR RODITIS;STROHOFER, III CLIFFORD J.;WHITE MARC;JABLOKOV VICTOR RODITIS |
分类号 |
G10L15/00 |
主分类号 |
G10L15/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|