发明名称 Method and system for morphologizing text
摘要 A method and system for morphologizing written or printed texts, including Japanese texts are obtained in accordance to codes. The longest morphemes are divided one at a time from the characters in a sentence. This is achieved by forming the longest morpheme from the remaining characters in the sentence which is listed in a dictionary of valid morphemes and determining if it is conjunctive with the previously divided morpheme. To determine if a formed morpheme is conjunctive, associated pairs of front and back connection codes are retrieved. If a front connection code of one retrieved pair and a back connection code of a pair of connection codes of the previously divided morpheme are co-listed in a table of permissible relationships, the formed morpheme is conjunctive. If no character may be divided from the remaining characters in the sentence, a previously divided morpheme is redivided. If a morpheme can be divided and is conjunctive with the previous morpheme, a connection action, describing the relationship between the formed morpheme and the previously divided morpheme, is recorded. In response to certain connection actions, the next morpheme is divided by forming it from a single character of the remaining characters and testing it. After all of the morphemes are divided, a word graph is constructed from the morphemes in accordance with the connection actions relating adjacent morphemes.
申请公布号 US5268840(A) 申请公布日期 1993.12.07
申请号 US19920876665 申请日期 1992.04.30
申请人 INDUSTRIAL TECHNOLOGY RESEARCH INSTITUTE 发明人 CHANG, DAVID;LEE, BING-HWANG;TSAUR, JIAN-MING;LIN, HUAN-CHAN
分类号 G06F3/00;G06F3/01;G06F17/27;G06F17/28;(IPC1-7):G06F15/38;G06F1/00 主分类号 G06F3/00
代理机构 代理人
主权项
地址