发明名称 ITERATIVE FILLING OF ELECTRONIC GLOSSARY
摘要 FIELD: physics, computer engineering.SUBSTANCE: invention relates to methods of filling electronic glossaries - lists of terms with tags. The method of filling a glossary from a training set of electronic documents using a computer (personal computer, server, etc.) includes forming a training subset, the text of all electronic documents of which contains glossary terms. Characteristic selection criteria are applied to words met in the training subset. Words selected using the criteria are assigned tags and the selected words are optionally assigned a weight. The selected words are added to the glossary with corresponding tags (and weights).EFFECT: high efficiency of using electronic glossaries in text analysis tasks by enabling assignment of intelligent weights to terms and automatic filling of glossaries with a training set of texts.16 cl, 13 dwg
申请公布号 RU2549118(C2) 申请公布日期 2015.04.20
申请号 RU20130123795 申请日期 2013.05.24
申请人 OBSHCHESTVO S OGRANICHENNOJ OTVETSTVENNOST'JU "ABI INFOPOISK" 发明人 BOGDANOVA DAR'JA NIKOLAEVNA;KOPYLOV NIKOLAJ JUR'EVICH
分类号 G06F17/20 主分类号 G06F17/20
代理机构 代理人
主权项
地址