发明名称 Method for segmentation of text
摘要 A computerized method, and a corresponding apparatus, for segmentation of a stream of text elements comprising analyzed tokens into one or more initial clauses is disclosed. In the method, a predetermined number of consecutive text elements of said stream of text elements are scanned, starting from a given position. The predetermined number of consecutive text elements are compared with each pattern of a set of patterns for beginnings of initial clauses, and a beginning of an initial clause is identified in the predetermined number of consecutive text elements, if the predetermined number of consecutive text elements match one pattern of the set of patterns for beginnings of initial clauses. The given position is then moved at least one position forward and the scanning, comparison and identification is repeated.
申请公布号 US6810375(B1) 申请公布日期 2004.10.26
申请号 US20000584135 申请日期 2000.05.31
申请人 HAPAX LTD 发明人 EJERHED EVA INGEGERD
分类号 G06F17/27;(IPC1-7):G06F17/28 主分类号 G06F17/27
代理机构 代理人
主权项
地址
您可能感兴趣的专利