发明名称 Filtering Content In An Online System Based On Text And Image Signals Extracted From The Content
摘要 The disclosure relates (a) a method and computer program product for training a content classifier and (b) a method and computer program product for using the trained content classifier to determine compliance of content items with a content policy of an online system. A content classifier is trained using two training sets, one containing NSFW content items and the other containing SFW content items. Content signals are extracted from each content item and used by the classifier to output a decision, which is compared against its known classification. Parameters used in the classifier are adjusted iteratively to improve accuracy of classification. The trained classifier is then used to classify content items with unknown classifications. Appropriate action is taken for each content item responsive to its classification. In alternative embodiments, multiple classifiers are implemented as part of a two-tier classification system, with text and image content classified separately.
申请公布号 US2016323281(A1) 申请公布日期 2016.11.03
申请号 US201514702363 申请日期 2015.05.01
申请人 Flipboard, Inc. 发明人 Griesmeyer Robert
分类号 H04L29/06;G06N7/00;G06N99/00 主分类号 H04L29/06
代理机构 代理人
主权项 1. A method for determining compliance of content with a content policy of the online system, the method comprising: receiving a content item comprising text and one or more images; extracting a plurality of text signals from the text; extracting a plurality of image signals from the one or more images; inputting the plurality of text signals and the plurality of image signals into a trained model, the model outputting a confidence value expressing a likelihood of compliance with a content policy of an online system; determining, based on the confidence value a compliance classification of the content item.
地址 Palo Alto CA US