发明名称 |
GRAMMAR FRAGMENT ACQUISITION USING SYNTACTIC AND SEMANTIC CLUSTERING |
摘要 |
A method and apparatus are provided for automatically acquiring grammar fragments for recognizing and understanding fluently spoken language. Grammar fragments representing a set of syntactically and semantically similar phrases may be generated using three probability distributions: of succeeding words, of preceding words, and of associated call-types. The similarity between phrases may be measured by applying Kullback-Leibler distance to these tree probability distributions. Phrases being close in all three distances may be clustered into a grammar fragment. |
申请公布号 |
US2014303978(A1) |
申请公布日期 |
2014.10.09 |
申请号 |
US201414196536 |
申请日期 |
2014.03.04 |
申请人 |
AT&T INTELLECTUAL PROPERTY I, L.P. |
发明人 |
Arai Kazuhiro;Gorin Allen L.;Riccardi Giuseppe;Wright Jeremy H. |
分类号 |
G10L15/06 |
主分类号 |
G10L15/06 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method comprising:
selecting candidate multi-word phrases from a set of words, wherein a maximum number of candidate multi-word phrases is based on a highest of a number of preceding contexts; for each candidate multi-word phrase in the candidate multi-word phrases, generating a measurement associated with a succeeding context of a succeeding phrase and a preceding context of a preceding phrase using a similarity in the candidate multi-word phrases; and clustering, via a processor, the candidate multi-word phrases into a grammar fragment based on the measurement, wherein the grammar fragment represents similar phrases that are both syntactically and semantically coherent. |
地址 |
Atlanta GA US |