摘要 |
PROBLEM TO BE SOLVED: To automatically extract the positional information in the expressions of addresses and regional names which are included in a document with high probability. SOLUTION: An inputted sentence is decomposed into morphemes (101), and every morpheme is compared with regional name expressions (102). An exception decision processing is carried out to decide whether the expression including a morpheme is a formal address expression (103). If a formal address expression is confirmed, the morpheme is successively compared with all address expressions of Japan and an address expression is extracted (104). If no formal address expression is confirmed in exception decision processing, the coincidence is retrieved between those morphemes and the exceptional address expressions and an exceptional address expression is extracted. Then the names of a prefecture, a city and a district are added to the extracted exceptional address expression and this address expression is converted into a formal address expression (108-110). If a positional information complementary word is included within six words of the extracted position information, the positional information including the positional information complementary word is outputted (106). |