发明名称 Mapping Documents to Associated Outcome based on Sequential Evolution of Their Contents
摘要 A method and system is described for modeling the content evolution of an accessed document and predicting an associated outcome for said document. The system accesses a document but can further receive additional tags, metadata, or related information that characterizes the nature of such text collection. The invention applies various processing to separate the document into elements and performs semantic modeling to create a narrative model that describes the evolution of the contents of the elements in terms of their respective sequencing. This system then uses a set of training documents with target values assigned to them to predict an associated outcome for the accessed document. The most relevant subset of a training set can be selected by matching metadata information that characterize the accessed document and a collection of metadata that characterize other broad document sets. Such characterization is done using graph partitioning or other community detection methods from metadata information that characterize the document sets and relations between multiple sets of such documents. The outcome of the method may apply to prediction of economic value of a events described by the accessed document, success measures of the document quality, or discovery of related content with similar associated outcome to the accessed document.
申请公布号 US2016155067(A1) 申请公布日期 2016.06.02
申请号 US201514948122 申请日期 2015.11.20
申请人 Dubnov Shlomo;Dubnov Tammuz 发明人 Dubnov Shlomo;Dubnov Tammuz
分类号 G06N99/00;G06F17/28;G06N7/00 主分类号 G06N99/00
代理机构 代理人
主权项 1. A method comprising: accessing a document with attributes and tags that sequentially order elements of the document; extracting a selection of document text belonging to a specific set of attributes; creating a narrative model that represents evolution of semantics with respect to the sequentially ordered elements; accessing a set of target values and training documents, wherein the target value quantifies an outcome associated with one or more of the training documents in the set; and predicting an outcome associated with the accessed document.
地址 San Diego CA US