发明名称 DOCUMENT SORTING SYSTEM, DOCUMENT SORTING METHOD, AND DOCUMENT SORTING PROGRAM
摘要 It is possible to analyze digitized document information gathered to be provided as evidence in a legal action and to classify the document information to be easily accessible in the legal action. A document classification system includes a keyword database, a related term database, a first classification unit which extracts a document including a keyword recorded in the keyword database from document information and attaches a specific classification mark to the extracted document based on keyword-corresponding information, and a second classification unit which extracts a document including a related term recorded in the related term database from document information, to which the specific classification mark is not attached in the first classification unit, calculates a score based on an evaluated value of the related term included in the extracted document and the number of related terms, and attaches a predetermined classification mark to a document, for which the score exceeds a given value, among the documents including the related term based on the score and the related term-corresponding information.
申请公布号 US2016098478(A1) 申请公布日期 2016.04.07
申请号 US201514866016 申请日期 2015.09.25
申请人 UBIC, Inc. 发明人 Morimoto Masahiro;Shirai Yoshikatsu;Takeda Hideki;Hasuko Kazumi
分类号 G06F17/30;G06Q50/18 主分类号 G06F17/30
代理机构 代理人
主权项 1. A document classification system comprising a processing apparatus configured to: a document extraction circuitry which extracts a predetermined number of documents as a subject to be classified by a user by sampling the predetermined number of documents from document information; a display circuitry which displays a document display screen presenting the user the extracted documents and a classification mark which is an identifier to be used when classifying the documents; a classification mark accepting circuitry which accepts the classification mark attached to the displayed document by the user; a database which records words which appear in common in the documents to which the classification mark is attached; and a score calculation circuitry which calculates a score which is the evaluation of the relation between the document and the classification mark based on the amount of information which is exhibited by the recorded words in the document.
地址 Tokyo JP