摘要 |
<p>Extracting and clarifying ambiguities of addresses in documents includes constructing a plurality of context clouds, where each of the context clouds corresponds to place names and homographs therefor, determining place names in each of the documents using a dictionary of geographic names, clarifying ambiguities for the place names determined in each of the documents using the context clouds and residual documents, where the residual documents correspond to each of the documents having detected place names removed therefrom, and providing a relevance score for each remaining one of the detected place names. Using the context clouds and residual documents may include determining the relevance score of each of the place names and determining a relevance score of corresponding homographs for each of the place names.</p> |