发明名称 System and method for determining affixes of words
摘要 A computer system and a method for analyzing text in one or more electronic documents are disclosed. The computer system comprises one or more system interfaces; and an affix process that determines one or more affixes of one or more words in one or more of the documents and provides the affixes to the system interface. The preferred embodiment of the invention may be used to build a domain specific morphology lexicon for NLP applications so that they can recognize out-of-vocabulary words. The disclosed procedure utilizes the fact that the processes of discovering prefixes and suffixes are not independent. Many words, especially in technical documents, have complex morphological structures, and thus the knowledge about prefixes helps the discovery of suffixes and vice versa.
申请公布号 US2005055200(A1) 申请公布日期 2005.03.10
申请号 US20030658968 申请日期 2003.09.09
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 PARK YOUNGJA
分类号 G06F17/27;(IPC1-7):G06F17/28 主分类号 G06F17/27
代理机构 代理人
主权项
地址