发明名称 Feedback enhanced attribute extraction
摘要 A data object submitted for storage is analyzed, and a set of values is extracted from the data object that can correspond to a set of attributes. The analysis of the data object can also identify possible new ontology terms. One or more extracted values are presented to the entity which submitted the data object for approval and feedback. This feedback can be used to characterize the data object with appropriate terms, train the extraction process for future extractions, and/or expand the set of known ontology terms.
申请公布号 US9087084(B1) 申请公布日期 2015.07.21
申请号 US201213620641 申请日期 2012.09.14
申请人 Google Inc. 发明人 Truher Joel
分类号 G06F17/30;G06F17/27 主分类号 G06F17/30
代理机构 Fish & Richardson P.C. 代理人 Fish & Richardson P.C.
主权项 1. A method comprising: receiving a data object from a user device, wherein the data object is an uploaded document; analyzing, using one or more computers having one or more processors and one or more non-transitory computer storage media, the data object to extract a plurality of values having corresponding attributes, wherein the analysis is based at least in part on an ontology and wherein the analysis is used to extract one or more values from the data object as corresponding to ontology terms and one or more candidate ontology terms associated with respective attributes, wherein the analysis of the data object further comprises associating the data object with one or more domain models generated through analysis of a plurality of objects from one or more corpora; providing a user interface to a client, the user interface configured to present at least one question as to whether an extracted value of the plurality of values paired with its corresponding attribute accurately characterizes the data object, wherein at least one extracted value paired with its corresponding attribute is a candidate ontology term not found in the ontology and is presented in the at least one question; receiving a response to the at least one question from the client though the user interface indicating that the attribute-value pair accurately characterizes the data object; in response to the indication that the attribute-value pair accurately characterizes the data object, analyzing the attribute-value pair to determine if the attribute and the value are in the ontology; if it is determined that the attribute is not in the ontology, updating the ontology to add the attribute to the ontology; if it is determined that the value is not in the ontology, updating the ontology to add the value to the ontology; and training the analysis to extract values from additional received data objects according to the updated ontology.
地址 Mountain View CA US