发明名称 SYSTEM AND METHODS FOR CLUSTERING LARGE DATABASE OF DOCUMENTS
摘要 <p>In a computerized system, a method of organizing a plurality of documents within a dataset of documents wherein a plurality of documents within a class of the dataset each includes one or more citations to one or more other documents, comprising creating a set of fingerprints for each respective document in the class, wherein each fingerprint comprises one or more citations contained in the respective document, creating a plurality of clusters for the dataset based on the sets of fingerprints for the documents in the class, assigning each respective document in the dataset to one or more of the clusters, creating a descriptive label for each respective cluster, and presenting one or more of the labeled clusters to a user of the computerized system or providing the user with access to documents in at least one cluster.</p>
申请公布号 WO2009018223(A1) 申请公布日期 2009.02.05
申请号 WO2008US71375 申请日期 2008.07.28
申请人 SPARKIP, INC.;DORIE, VINCENT, JOSEPH;GIANNELLA, ERIC, R. 发明人 DORIE, VINCENT, JOSEPH;GIANNELLA, ERIC, R.
分类号 G06F7/00 主分类号 G06F7/00
代理机构 代理人
主权项
地址