发明名称 System for identifying and investigating shared and derived content
摘要 A computer readable storage medium with computer readable program code. The computer readable program code may be configured to index a plurality of documents into a document library stored in a database. The computer readable program code may be configured to receive a query document and to compare the query document with each indexed document to generate a score for each indexed document. The score represents a measure of similarity between the query document and each indexed document. The computer readable program code may be configured to display a query result based on the score for each indexed document.
申请公布号 US9256644(B1) 申请公布日期 2016.02.09
申请号 US201313839943 申请日期 2013.03.15
申请人 CA, Inc. 发明人 Spellward Peter C.;Snart Woodhouse Howard C.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Baker Botts L.L.P. 代理人 Baker Botts L.L.P.
主权项 1. A method of searching documents, comprising: indexing a plurality of documents into a document library stored in a database; receiving a query document; comparing, using a processor, the query document with each indexed document to generate a score for each indexed document, the score representing a measure of similarity between the query document and each indexed document; determining a commonality among particular ones of the indexed documents, other than the measure of similarity; displaying, at a user interface, a query result based on the score for each indexed document and based on the commonality; calculating hash values for each indexed document over each of a plurality of alternative windows; storing the hash values for each indexed document over each of the plurality of alternative windows; receiving user input selecting a particular one of the plurality of alternative windows; in response to receiving the selection of the particular one of the plurality of alternative windows, calculating hash values for the query document using the particular one of the plurality of alternative windows; and comparing the hash values for the query document with the hash values corresponding to the particular one of the plurality of alternative windows for each of the indexed documents to determine a measure of similarity between the query document and each indexed document.
地址 New York NY US