发明名称 Method for scanning, analyzing and rating digital information content
摘要 Computer-implemented methods are described for, first, characterizing a specific category of information content-pornography, for example-and then accurately identifying instances of that category of content within a real-time media stream, such as a web page, e-mail or other digital dataset. This content-recognition technology enables a new class of highly scalable applications to manage such content, including filtering, classifying, prioritizing, tracking, etc. An illustrative application of the invention is a software product for use in conjunction with web-browser client software for screening access to web pages that contain pornography or other potentially harmful or offensive content. A target attribute set of regular expression, such as natural language words and/or phrases, is formed by statistical analysis of a number of samples of datasets characterized as "containing," and another set of samples characterized as "not containing," the selected category of information content. This list of expressions is refined by applying correlation analysis to the samples or "training data." Neural-network feed-forward techniques are then applied, again using a substantial training dataset, for adaptively assigning relative weights to each of the expressions in the target attribute set, thereby forming an awaited list that is highly predictive of the information content category of interest.
申请公布号 US6266664(B1) 申请公布日期 2001.07.24
申请号 US19980164940 申请日期 1998.10.01
申请人 RULESPACE, INC. 发明人 RUSSELL-FALLA ADRIAN PETER;HANSON ANDREW BARD
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址
您可能感兴趣的专利