发明名称 PROBABILISTIC SURFACING OF POTENTIALLY SENSITIVE IDENTIFIERS
摘要 Probabilistic surfacing of potentially sensitive identifiers is provided. In one embodiment of the present invention, a method of and computer program product for surfacing of potentially sensitive identifiers are provided. An input string is read. The input string has a length. The input string is divided into a plurality of tokens. Each of the tokens has a predetermined length. A score is determined for each of the plurality of tokens. A composite score is determined based on the scores of each of the plurality of tokens. Whether the input string comprises an identifier is determined by comparing the composite score to a predetermined threshold.
申请公布号 US2016117522(A1) 申请公布日期 2016.04.28
申请号 US201414521288 申请日期 2014.10.22
申请人 International Business Machines Corporation 发明人 Bhagwan Varun;Chiticariu Laura;Gruhl Daniel F.
分类号 G06F21/62;G06N7/00;G06N99/00 主分类号 G06F21/62
代理机构 代理人
主权项 1. A method comprising: reading an input string, the input string having a length; dividing the input string into a plurality of tokens, each of the tokens having a predetermined length; determining a score for each of the plurality of tokens; determining a composite score based on the scores of each of the plurality of tokens; and determining whether the input string comprises an identifier by comparing the composite score to a predetermined threshold.
地址 Armonk NY US