发明名称 Document sorting system, document sorting method, and document sorting program
摘要 It is possible to analyze digitized document information gathered to be provided as evidence in a legal action and to classify the document information to be easily accessible in the legal action. A document classification system includes a keyword database, a related term database, a first classification unit which extracts a document including a keyword recorded in the keyword database from document information and attaches a specific classification mark to the extracted document based on keyword-corresponding information, and a second classification unit which extracts a document including a related term recorded in the related term database from document information, to which the specific classification mark is not attached in the first classification unit, calculates a score based on an evaluated value of the related term included in the extracted document and the number of related terms, and attaches a predetermined classification mark to a document, for which the score exceeds a given value, among the documents including the related term based on the score and the related term-corresponding information.
申请公布号 US9171074(B2) 申请公布日期 2015.10.27
申请号 US201314346364 申请日期 2013.04.01
申请人 UBIC, Inc. 发明人 Morimoto Masahiro;Shirai Yoshikatsu;Takeda Hideki;Hasuko Kazumi
分类号 G06F17/30;G06Q10/10;G06F7/36;G06Q50/18 主分类号 G06F17/30
代理机构 Reed Smith LLP 代理人 Kaufman Marc S.;Grewal Amardeep S.;Reed Smith LLP
主权项 1. A document classification system which acquires digital information recorded in a plurality of computers or servers, analyzes document information having a plurality of documents included in the acquired digital information, and attaches a classification mark representing the degree of association with a legal action to the document for ease of use in the legal action, the document classification system comprising: a keyword database which records a specific classification mark, a keyword described in a document, to which the specific classification mark is attached, and keyword-corresponding information representing the correspondence relationship between the specific classification mark and the keyword; a related term database which records a predetermined classification mark, a related term having words with a high appearance frequency in the document, to which the predetermined classification mark is attached, and related term-corresponding information representing the correspondence relationship between the predetermined classification mark and the related term; a first classification unit which extracts a document including the keyword recorded in the keyword database from the document information and attaches the specific classification mark to the extracted document based on the keyword-corresponding information; a second classification unit which extracts a document including the related term recorded in the related term database from the document information, to which the specific classification mark is not attached in the first classification unit, calculates a score based on an evaluated value of the related term included in the extracted document and the number of related terms, and attaches the predetermined classification mark to a document, for which the score exceeds a given value, among the documents including the related term based on the score and the related term-corresponding information; and a classification mark accepting unit which accepts the attachment of a classification mark from a user to a document, to which the predetermined classification mark is not attached in the second classification unit.
地址 Tokyo JP