发明名称 MULTI-LANGUAGE DOCUMENT CLUSTERING
摘要 A technique can include identifying a collection of documents to be clustered. The collection of documents can include foreign language documents and base language documents. The foreign language documents can be translated into the base language at a base language translation module. Keywords in the base language documents and keywords in the translated foreign language documents can be determined at a document indexing module. The base language documents can be clustered with the foreign language documents in a common set of document clusters based on the determined keywords in the base language documents and the determined keywords in the translated foreign language documents. In response to a search query in a first language, a listing of search results can be provided that includes documents in the first language and another language from the a common document cluster.
申请公布号 US2014019451(A1) 申请公布日期 2014.01.16
申请号 US201213549624 申请日期 2012.07.16
申请人 BURYAK KIRILL;GOOGLE INC. 发明人 BURYAK KIRILL
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址