发明名称 |
Document sorting system, document sorting method, and document sorting program |
摘要 |
It is possible to analyze digitized document information gathered to be provided as evidence in a legal action and to classify the document information to be easily accessible in the legal action. A document classification system includes a keyword database, a related term database, a first classification unit which extracts a document including a keyword recorded in the keyword database from document information and attaches a specific classification mark to the extracted document based on keyword-corresponding information, and a second classification unit which extracts a document including a related term recorded in the related term database from document information, to which the specific classification mark is not attached in the first classification unit, calculates a score based on an evaluated value of the related term included in the extracted document and the number of related terms, and attaches a predetermined classification mark to a document, for which the score exceeds a given value, among the documents including the related term based on the score and the related term-corresponding information. |
申请公布号 |
US9171074(B2) |
申请公布日期 |
2015.10.27 |
申请号 |
US201314346364 |
申请日期 |
2013.04.01 |
申请人 |
UBIC, Inc. |
发明人 |
Morimoto Masahiro;Shirai Yoshikatsu;Takeda Hideki;Hasuko Kazumi |
分类号 |
G06F17/30;G06Q10/10;G06F7/36;G06Q50/18 |
主分类号 |
G06F17/30 |
代理机构 |
Reed Smith LLP |
代理人 |
Kaufman Marc S.;Grewal Amardeep S.;Reed Smith LLP |
主权项 |
1. A document classification system which acquires digital information recorded in a plurality of computers or servers, analyzes document information having a plurality of documents included in the acquired digital information, and attaches a classification mark representing the degree of association with a legal action to the document for ease of use in the legal action, the document classification system comprising:
a keyword database which records a specific classification mark, a keyword described in a document, to which the specific classification mark is attached, and keyword-corresponding information representing the correspondence relationship between the specific classification mark and the keyword; a related term database which records a predetermined classification mark, a related term having words with a high appearance frequency in the document, to which the predetermined classification mark is attached, and related term-corresponding information representing the correspondence relationship between the predetermined classification mark and the related term; a first classification unit which extracts a document including the keyword recorded in the keyword database from the document information and attaches the specific classification mark to the extracted document based on the keyword-corresponding information; a second classification unit which extracts a document including the related term recorded in the related term database from the document information, to which the specific classification mark is not attached in the first classification unit, calculates a score based on an evaluated value of the related term included in the extracted document and the number of related terms, and attaches the predetermined classification mark to a document, for which the score exceeds a given value, among the documents including the related term based on the score and the related term-corresponding information; and a classification mark accepting unit which accepts the attachment of a classification mark from a user to a document, to which the predetermined classification mark is not attached in the second classification unit. |
地址 |
Tokyo JP |