发明名称 SENSITIVE TEXT DETECTING METHOD AND APPARATUS
摘要 A sensitive text detecting method and apparatus relate to the field of information processing technologies. The method includes: acquiring a feature text string of a currently detected text (101); detecting the feature text string according to a finite-state machine established in advance, so as to obtain frequency of occurrence of each keyword in the feature text string (102); calculating, for each keyword category of multiple keyword categories, a weight of the keyword category in the text based on the frequency of occurrence of each keyword corresponding to the keyword category and a preset weight of each keyword (103); and determining that the text is a sensitive text when the weight of at least one keyword category is greater than a preset threshold (104).
申请公布号 US2016350282(A1) 申请公布日期 2016.12.01
申请号 US201515110541 申请日期 2015.02.11
申请人 TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED 发明人 ZHANG Honglin
分类号 G06F17/27;G06F17/30;G06F17/22 主分类号 G06F17/27
代理机构 代理人
主权项 1. A sensitive text detecting method, comprising: acquiring a feature text string of a currently detected text; detecting the feature text string according to a finite-state machine established in advance, so as to obtain frequency of occurrence of each keyword in the feature text string, the finite-state machine comprising multiple keywords; calculating, for each keyword category of multiple keyword categories, a weight of the keyword category in the text based on the frequency of occurrence of each keyword corresponding to the keyword category and a preset weight of each keyword; and determining that the text is a sensitive text when the weight of at least one keyword category is greater than a preset threshold.
地址 Shenzhen, Guangdong CN