发明名称 Systems and methods for entity resolution using attributes from structured and unstructured data
摘要 In some aspects, the present disclosure relates to coreference resolution. In one embodiment, a method includes obtaining unstructured text data including a plurality of references corresponding to entities, and determining, from the unstructured text data, attributes associated with the entities. The method also includes obtaining structured data including predefined attributes associated with the entities, and comparing attributes associated with a first coreference unit with attributes associated with a second coreference unit. The first coreference unit is a sub-entity representation having the attributes determined from the unstructured text data and the second coreference unit is a sub-entity representation having the predefined attributes. The method further includes determining, based on the comparison, whether the first coreference unit and the second coreference unit both correspond to the same entity.
申请公布号 US9535902(B1) 申请公布日期 2017.01.03
申请号 US201514839348 申请日期 2015.08.28
申请人 DIGITAL REASONING SYSTEMS, INC. 发明人 Michalak Phillip Daniel;Graham Kenneth;Massey Keith Ellis;Zamata James;Gardner Holly
分类号 G06N5/02;G06F17/27;G06F17/30;G06F17/28 主分类号 G06N5/02
代理机构 Troutman Sanders LLP 代理人 Troutman Sanders LLP ;Schneider Ryan A.;Glass Christopher W.
主权项 1. A computer-implemented method, comprising: obtaining unstructured text data including a plurality of references corresponding to entities, wherein the unstructured text data is not pre-arranged with a predefined data model or schema; determining, from the unstructured text data, attributes associated with the entities; obtaining structured data including predefined attributes associated with the entities; comparing attributes associated with a first coreference unit with attributes associated with a second coreference unit, wherein the first coreference unit is a sub-entity representation having the attributes determined from the unstructured text data and the second coreference unit is a sub-entity representation having the predefined attributes; and determining, based on the comparison, whether the first coreference unit and the second coreference unit both correspond to the same entity.
地址 Franklin TN US