摘要 |
<p>A method and apparatus for identifying a set of bilingual term pairs, and from the set of bilingual term pairs, identifying a set of candidate patterns related to the layout of the bilingual term pairs in the bilingual webpage is provided. From the set of candidate patterns, one or more best patterns can be selected based on features identified in the candidate patterns. Using the one or more selected patterns, a set of translation pair candidates can be extracted. The translation pair candidates can be verified to determine the likelihood that each translation pair candidate is an accurate translation. Based on the verification, some or all of the translation pair candidates can be discarded as incorrect translations, and the remaining translation pair candidates can be identified as correct translation pairs.</p> |