发明名称 Computer-implemented system and method for generating a reference set via seed documents
摘要 A computer-implemented system and method for generating a reference set via seed documents is provided. A collection of documents is obtained. One or more seed documents are identified. The seed documents are compared with the document collection and those documents that are similar to the seed documents are identified as reference set candidates. A size threshold is applied to the reference set candidates, which are grouped as the reference set when the size threshold is satisfied.
申请公布号 US9275344(B2) 申请公布日期 2016.03.01
申请号 US201314108248 申请日期 2013.12.16
申请人 FTI Consulting, Inc. 发明人 Knight William C.;McNee Sean M.
分类号 G06F17/30;G06N99/00 主分类号 G06F17/30
代理机构 代理人 Inouye Patrick J. S.;Wittman Krista A.
主权项 1. A computer-implemented method for generating a reference set via seed documents, comprising: obtaining a collection of documents; identifying one or more seed documents related to an issue for which a reference set of documents is needed; selecting the seed documents from at least one of a current document set and a previously defined document set; comparing the seed documents to the document collection and identifying those documents similar to the seed documents as reference set candidates; applying a size threshold to the reference set candidates; and grouping the reference set candidates as the reference set when the size threshold is satisfied, wherein the reference set is reduced by identifying the reference set candidates that are closely related and common reference set candidates and by removing the closely related reference set candidates and the common reference set candidates.
地址 Annapolis MD US