发明名称 Method for identifying word patterns in text
摘要 A method for identifying word patterns in text is conducted in real time and is highly suitable for network and Internet use. The method involves receiving a stream of text, breaking the stream of text into a plurality of threads, tokenizing the words in each thread, and comparing the words to identified words in the semantic network. Recognized, words are then examined, together with surrounding words in the text to determine whether the words are part of a word pattern. Word patterns are located at nodes in the semantic network in a hierarchical structure, and certain word patterns correspond to objects of the semantic network. When all word patterns involving a word are located, links are followed to objects corresponding to the word patterns. Several nodes may point to a single object, but each object is represented only once in the semantic network. Identified objects may thus be identified in real time, as the text streams through the text analysis module.
申请公布号 US7426505(B2) 申请公布日期 2008.09.16
申请号 US20010801340 申请日期 2001.03.07
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 SIMPSON DON M.;USEY, JR. ROBERT W.
分类号 G06F17/30;G06F7/00;G06F17/00;G06F17/27 主分类号 G06F17/30
代理机构 代理人
主权项
地址