发明名称 DEVICE AND METHOD FOR AUTOMATIC EXTRACTION OF POSITIONAL INFORMATION AND RECORDING MEDIUM
摘要 PROBLEM TO BE SOLVED: To automatically extract the positional information in the expressions of addresses and regional names which are included in a document with high probability. SOLUTION: An inputted sentence is decomposed into morphemes (101), and every morpheme is compared with regional name expressions (102). An exception decision processing is carried out to decide whether the expression including a morpheme is a formal address expression (103). If a formal address expression is confirmed, the morpheme is successively compared with all address expressions of Japan and an address expression is extracted (104). If no formal address expression is confirmed in exception decision processing, the coincidence is retrieved between those morphemes and the exceptional address expressions and an exceptional address expression is extracted. Then the names of a prefecture, a city and a district are added to the extracted exceptional address expression and this address expression is converted into a formal address expression (108-110). If a positional information complementary word is included within six words of the extracted position information, the positional information including the positional information complementary word is outputted (106).
申请公布号 JP2000250931(A) 申请公布日期 2000.09.14
申请号 JP19990053137 申请日期 1999.03.01
申请人 NIPPON TELEGR & TELEPH CORP <NTT> 发明人 SUGIURA HIRONOBU;TSUCHIYA HIDEYUKI
分类号 G06F17/21;G06F17/27;G06F17/30 主分类号 G06F17/21
代理机构 代理人
主权项
地址