发明名称 Systems and methods for using anchor text as parallel corpora for cross-language information retrieval
摘要 A system performs cross-language query translations. The system receives a search query that includes terms in a first language and determines possible translations of the terms of the search query into a second language. The system also locates documents for use as parallel corpora to aid in the translation by: (1) locating documents in the first language that contain references that match the terms of the search query and identify documents in the second language; (2) locating documents in the first language that contain references that match the terms of the query and refer to other documents in the first language and identify documents in the second language that contain references to the other documents; or (3) locating documents in the first language that match the terms of the query and identify documents in the second language that contain references to the documents in the first language. The system may use the second language documents as parallel corpora to disambiguate among the possible translations of the terms of the search query and identify one of the possible translations as a likely translation of the search query into the second language.
申请公布号 US8190608(B1) 申请公布日期 2012.05.29
申请号 US201113174209 申请日期 2011.06.30
申请人 GRAVANO LUIS;HENZINGER MONIKA H.;GOOGLE INC. 发明人 GRAVANO LUIS;HENZINGER MONIKA H.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址