Classification-Based Redaction in Natural Language Text,申请号US201113048003-传众专利搜索

发明名称	Classification-Based Redaction in Natural Language Text
摘要	When redacting natural language text, a classifier is used to provide a sensitive concept model according to features in natural language text and in which the various classes employed are sensitive concepts reflected in the natural language text. Similarly, the classifier is used to provide an utility concepts model based on utility concepts. Based on these models, and for one or more identified sensitive concept and identified utility concept, at least one feature in the natural language text is identified that implicates the at least one identified sensitive topic more than the at least one identified utility concept. At least some of the features thus identified may be perturbed such that the modified natural language text may be provided as at least one redacted document. In this manner, features are perturbed to maximize classification error for sensitive concepts while simultaneously minimizing classification error in the utility concepts.
申请公布号	US2012239380(A1)	申请公布日期	2012.09.20
申请号	US201113048003	申请日期	2011.03.15
申请人	CUMBY CHAD;GHANI RAYID;ACCENTURE GLOBAL SERVICES LIMITED	发明人	CUMBY CHAD;GHANI RAYID
分类号	G06F17/27	主分类号	G06F17/27
代理机构		代理人
主权项
地址

您可能感兴趣的专利