发明名称 METHOD FOR THE AUTOMATED ANALYSIS OF TEXT DOCUMENTS
摘要 The invention concerns the automated analysis of text documents. Its use in development of new and improvement of the existing systems of verification of text documents for availability of phrases or parts of the text from other documents in them allows to expand the arsenal of technical facilities at the expense of creation of the comparatively quick and universal method, which allows to reveal expressions, phrases or even text fragments from other documents in the document. The method of automated analysis of text documents consists in the following: all electronic files of reference documents are transformed into the preset format, distinguishing meaningful fragments, called closures, in each; transformed electronic files of reference documents are stored in the database; each electronic file of the analyzed document is transferred into the preset format; coincidence of the distinguished closures in the electronic file of the analyzed document with the distinguished closures in the electronic files of reference documents is revealed; relative number of closures in the electronic file of the analyzed document is calculated, which coincided with the corresponding closures of each of the electronic files of reference documents; found relative numbers of coincidences are compared with the preset threshold value for revelation of availability of the text fragments from any of reference documents in the electronic file of the analyzed document.
申请公布号 EP2782023(A2) 申请公布日期 2014.09.24
申请号 EP20120849920 申请日期 2012.11.16
申请人 OBSHCHESTVO S OGRANICHENNOY OTVETSTVENNOST'YU "TSENTR INNOVATSIY NATAL'I KASPERSKAYA" 发明人 LAPSHIN, VLADIMIR ANATOL'YEVICH;PSHEKHOTSKAYA, YEKATERINA ALEKSANDROVNA;PEROV, DMITRIY VSEVOLODOVICH
分类号 G06F17/20;G06F17/22 主分类号 G06F17/20
代理机构 代理人
主权项
地址