发明名称 SYSTEMS AND METHODS FOR CONDUCTING AND TERMINATING A TECHNOLOGY-ASSISTED REVIEW
摘要 Systems and methods are provided for classifying electronic information and terminating a classification process which utilizes Technology-Assisted Review (“TAR”) techniques. In certain embodiments, the TAR process, which is an iterative process, is terminated based upon one more stopping criteria. In certain embodiments, use of the stopping criteria ensures that the TAR process will reliably achieve a level of quality (e.g., recall) with a certain probability. In certain embodiments, the TAR process is terminated when it independently identifies a target set of documents. In certain embodiments, the TAR process is terminated based upon whether the ratio of the slope of the TAR process's gain curve before an inflection point to the slope of the TAR process' gain curve after the inflection point exceeds a threshold. In certain embodiments, the TAR process is terminated when a review budget and slope ratio of the gain curve each exceed a respective threshold.
申请公布号 US2016371260(A1) 申请公布日期 2016.12.22
申请号 US201615186366 申请日期 2016.06.17
申请人 Cormack Gordon V.;Grossman Maura R. 发明人 Cormack Gordon V.;Grossman Maura R.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A system for terminating a classification process, the system comprising: at least one computing device having a processor and physical memory, the physical memory storing instructions that cause the processor to: execute the classification process, wherein the classification process utilizes an iterative search strategy to classify documents in a document collection and the documents are stored on a non-transitory storage medium;select a gain curve slope ratio threshold;compute points on a gain curve using a selected set of documents in the document collection and results from the classification process;detect an inflection point in the gain curve;determine a slope ratio for the detected inflection point using a slope of the gain curve before the detected inflection point, and a slope of the gain curve after the detected inflection point; andterminate the classification process based upon a determination that the slope ratio for the detected inflection point exceeds the selected slope ratio threshold.
地址 Waterloo CA