发明名称 NORMALIZING NON-NUMERIC FEATURES OF FILES
摘要 Embodiments include method, computer program products and apparatuses for normalizing non-numeric features of files and corresponding apparatus Aspects include segmenting at least one pair of positive instances of a non-numeric feature of a file into a number of tokens and comparing the tokens in the at least one pair of positive instances to obtain matching tokens. Aspects also include calculating weights of their matching the file, for the matching tokens, and storing the tokens and their weights in a token base.
申请公布号 US2016154793(A1) 申请公布日期 2016.06.02
申请号 US201514967314 申请日期 2015.12.13
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 LI CHANG SHENG;MENG FAN JING;STERN EDITH HELEN;WANG HAN;XU JING MIN;YANG LIN;ZHUO XUEJUN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer-implemented method for normalizing non-numeric features of files, comprising: segmenting at least one pair of positive instances of a non-numeric feature of a file into a number of tokens; comparing the tokens in the at least one pair of positive instances to obtain matching tokens; and for each of the matching tokens, calculating weights of their matching the file, and storing the tokens and their weights in a token base.
地址 Armonk NY US