发明名称 METHOD FOR AUTOMATED ANALYSIS OF TEXT DOCUMENTS
摘要 FIELD: information technology.SUBSTANCE: all electronic files of reference documents are first converted to a predetermined format while selecting in each document comprehensible fragments referred to as clauses, and the converted electronic files of reference documents are stored in a database. Each electronic file of an analysed document is converted to a predetermined format. A match between the selected clauses in the electronic file of the analysed document and the selected clauses in the electronic files of the reference documents is detected. The relative number of clauses in the electronic file of the analysed document matching corresponding clauses of each of the electronic files of the reference documents is counted. The relative number of matches found is then compared with a predetermined threshold value in order to determine presence of text excerpts of any of the reference documents in the electronic file of the analysed document.EFFECT: wider range of apparatus by designing a relatively fast and universal method which enables to detect expressions, phrases or even text excerpts in a document from other documents.5 cl, 2 dwg
申请公布号 RU2474870(C1) 申请公布日期 2013.02.10
申请号 RU20110146888 申请日期 2011.11.18
申请人 OBSHCHESTVO S OGRANICHENNOJ OTVETSTVENNOST'JU "TSENTR INNOVATSIJ NATAL'I KASPERSKOJ" 发明人 LAPSHIN VLADIMIR ANATOL'EVICH;PSHEKHOTSKAJA EKATERINA ALEKSANDROVNA;PEROV DMITRIJ VSEVOLODOVICH
分类号 G06F17/00 主分类号 G06F17/00
代理机构 代理人
主权项
地址