发明名称 IDENTIFYING AND LINKING SIMILAR PASSAGES IN A DIGITAL TEXT CORPUS
摘要 A corpus contains digital text from multiple documents. A passage mining engine identifies similar passages in the documents and stores data describing the similarities. The passage mining engine groups similar passages into groups based on degree of similarity or other criteria. The passage mining engine ranks the similar passages found in the text corpus based on quality or other criteria. A user interface is presented that includes hypertext links associated with the similar passages that allow a user to navigate the documents.
申请公布号 CA2691278(C) 申请公布日期 2013.09.24
申请号 CA20082691278 申请日期 2008.07.18
申请人 GOOGLE INC. 发明人 SCHILIT, WILLIAM N.;KOLAK, OKAN;MATHES, ADAM B.
分类号 G06F17/27;G06F17/30 主分类号 G06F17/27
代理机构 代理人
主权项
地址