发明名称 |
SENSITIVE TEXT DETECTING METHOD AND APPARATUS |
摘要 |
A sensitive text detecting method and apparatus relate to the field of information processing technologies. The method includes: acquiring a feature text string of a currently detected text (101); detecting the feature text string according to a finite-state machine established in advance, so as to obtain frequency of occurrence of each keyword in the feature text string (102); calculating, for each keyword category of multiple keyword categories, a weight of the keyword category in the text based on the frequency of occurrence of each keyword corresponding to the keyword category and a preset weight of each keyword (103); and determining that the text is a sensitive text when the weight of at least one keyword category is greater than a preset threshold (104). |
申请公布号 |
US2016350282(A1) |
申请公布日期 |
2016.12.01 |
申请号 |
US201515110541 |
申请日期 |
2015.02.11 |
申请人 |
TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED |
发明人 |
ZHANG Honglin |
分类号 |
G06F17/27;G06F17/30;G06F17/22 |
主分类号 |
G06F17/27 |
代理机构 |
|
代理人 |
|
主权项 |
1. A sensitive text detecting method, comprising:
acquiring a feature text string of a currently detected text; detecting the feature text string according to a finite-state machine established in advance, so as to obtain frequency of occurrence of each keyword in the feature text string, the finite-state machine comprising multiple keywords; calculating, for each keyword category of multiple keyword categories, a weight of the keyword category in the text based on the frequency of occurrence of each keyword corresponding to the keyword category and a preset weight of each keyword; and determining that the text is a sensitive text when the weight of at least one keyword category is greater than a preset threshold. |
地址 |
Shenzhen, Guangdong CN |