发明名称 EXTRACTION OF SNIPPET DESCRIPTIONS USING CLASSIFICATION TAXONOMIES
摘要 Systems and methods are presented for generating snippets from document data within the document and category taxonomies. In some embodiments, the system may receive a document comprising a set of paragraphs and sentences, identify text in the document relating to a set of categories, and score the paragraphs based on a relation between the paragraph and the set of categories to produce a section score. The system determines one or more sentences for inclusion in a snippet based in part on the section score. The system generates a snippet from the sentences determined for inclusion and associates the snippet with the document.
申请公布号 US2016078038(A1) 申请公布日期 2016.03.17
申请号 US201514852391 申请日期 2015.09.11
申请人 Solanki Sameep Navin;Nallapaneni Jagadish;King Tracy Holloway;Chittar Naren 发明人 Solanki Sameep Navin;Nallapaneni Jagadish;King Tracy Holloway;Chittar Naren
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method comprising: receiving a product listing from a client device, the product listing having a set of text sections associated with the product and a set of categories associated with the product, the text sections comprising a set of sentences; based on receiving the product listing, automatically generating a snippet by identifying, by a snippet server, text in the set of text sections relating to the set of categories;based on identifying the text relating to the set of categories, automatically scoring, by the snippet server, the set of text sections based on the relation between the identified text and the set of categories to produce a section score;based on the scoring of the set of text sections, automatically determining, by the snippet server, one or more sentences for inclusion in a snippet based in part on the section score of the text section to which the sentence corresponds; andgenerating the snippet from the one or more sentences determined for inclusion in the snippet; and associating the snippet with the product listing within a database of a network-based publication system for presentation in a graphical user interface.
地址 San Jose CA US