发明名称 |
PROBABILISTIC SURFACING OF POTENTIALLY SENSITIVE IDENTIFIERS |
摘要 |
Probabilistic surfacing of potentially sensitive identifiers is provided. In one embodiment of the present invention, a method of and computer program product for surfacing of potentially sensitive identifiers are provided. An input string is read. The input string has a length. The input string is divided into a plurality of tokens. Each of the tokens has a predetermined length. A score is determined for each of the plurality of tokens. A composite score is determined based on the scores of each of the plurality of tokens. Whether the input string comprises an identifier is determined by comparing the composite score to a predetermined threshold. |
申请公布号 |
US2016117522(A1) |
申请公布日期 |
2016.04.28 |
申请号 |
US201414521288 |
申请日期 |
2014.10.22 |
申请人 |
International Business Machines Corporation |
发明人 |
Bhagwan Varun;Chiticariu Laura;Gruhl Daniel F. |
分类号 |
G06F21/62;G06N7/00;G06N99/00 |
主分类号 |
G06F21/62 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method comprising:
reading an input string, the input string having a length; dividing the input string into a plurality of tokens, each of the tokens having a predetermined length; determining a score for each of the plurality of tokens; determining a composite score based on the scores of each of the plurality of tokens; and determining whether the input string comprises an identifier by comparing the composite score to a predetermined threshold. |
地址 |
Armonk NY US |