发明名称 Method for recognizing compound terms in a document
摘要 A method is provided for identifying compound terms in a document that is represented by a stream of tokens. The stream of document tokens is scanned for an initial term associated with a compound term and a compound term template is accessed when the initial term is identified. The template includes content, retention, and token specifications for the compound term. The stream of tokens is compared with the template, and when the stream matches the content specification of the template, a token representing the compound term is tagged according to the retention specification and added to the stream of tokens. The tagged token is stopped according to the retention specification represented by its tag.
申请公布号 US5842217(A) 申请公布日期 1998.11.24
申请号 US19960773194 申请日期 1996.12.30
申请人 INTEL CORPORATION 发明人 LIGHT, JOHN
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址