发明名称 MATCHING TEXT TO IMAGES
摘要 Text in web pages or other text documents may be classified based on the images or other objects within the webpage. A system for identifying and classifying text related to an object may identify one or more web pages containing the image or similar images, determine topics from the text of the document, and develop a set of training phrases for a classifier. The classifier may be trained and then used to analyze the text in the documents. The training set may include both positive examples and negative examples of text taken from the set of documents. A positive example may include captions or other elements directly associated with the object, while negative examples may include text taken from the documents, but from a large distance from the object. In some cases, the system may iterate on the classification process to refine the results.
申请公布号 US2013315480(A1) 申请公布日期 2013.11.28
申请号 US201313959724 申请日期 2013.08.05
申请人 MICROSOFT CORPORATION 发明人 BAKER SIMON;LIN DAHUA;KANNAN ANITHA;KE QIFA
分类号 G06K9/00 主分类号 G06K9/00
代理机构 代理人
主权项
地址