发明名称 Method and apparatus for recognizing multiword expressions
摘要 <p>Words of an input string are morphologically analyzed to identify their alternative base forms and parts of speech. The analyzed words of the input string are used to compile the input string into a first finite-state network. The first finite-state network is matched with a second finite-state network of multiword expressions to identify all subpaths of the first finite-state network that match one or more complete paths in the second finite-state network. Each matching subpath of the first finite-state network and path of the second finite-state network identify a multiword expression in the input string. The morphological analysis is performed without disambiguating words and without segmenting the input string into sentences in the input string to compile the first finite-state network with at least one path that identifies alternative base forms or parts of speech of a word in the input string.</p>
申请公布号 EP1429257(B1) 申请公布日期 2015.08.12
申请号 EP20030028508 申请日期 2003.12.10
申请人 XEROX CORPORATION 发明人 PRIVAULT, CAROLINE;POIRIER, HERVE
分类号 G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项
地址