发明名称 GRAMMAR FRAGMENT ACQUISITION USING SYNTACTIC AND SEMANTIC CLUSTERING
摘要 A method and apparatus are provided for automatically acquiring grammar fragments for recognizing and understanding fluently spoken language. Grammar fragments representing a set of syntactically and semantically similar phrases may be generated using three probability distributions: of succeeding words, of preceding words, and of associated call-types. The similarity between phrases may be measured by applying Kullback-Leibler distance to these tree probability distributions. Phrases being close in all three distances may be clustered into a grammar fragment.
申请公布号 US2014303978(A1) 申请公布日期 2014.10.09
申请号 US201414196536 申请日期 2014.03.04
申请人 AT&T INTELLECTUAL PROPERTY I, L.P. 发明人 Arai Kazuhiro;Gorin Allen L.;Riccardi Giuseppe;Wright Jeremy H.
分类号 G10L15/06 主分类号 G10L15/06
代理机构 代理人
主权项 1. A method comprising: selecting candidate multi-word phrases from a set of words, wherein a maximum number of candidate multi-word phrases is based on a highest of a number of preceding contexts; for each candidate multi-word phrase in the candidate multi-word phrases, generating a measurement associated with a succeeding context of a succeeding phrase and a preceding context of a preceding phrase using a similarity in the candidate multi-word phrases; and clustering, via a processor, the candidate multi-word phrases into a grammar fragment based on the measurement, wherein the grammar fragment represents similar phrases that are both syntactically and semantically coherent.
地址 Atlanta GA US
您可能感兴趣的专利