发明名称 SCHEMA GENERATION USING NATURAL LANGUAGE PROCESSING
摘要 In a method for generating a schema for a corpus of data, a first corpus of data is received, wherein the first corpus of data includes unstructured text. A processor identifies a set of one or more entity relationships within the first corpus of data, wherein an entity relationship comprises a first entity, a second entity, and a specified relationship between the entities. A processor compares the set of one or more entity relationships to a second corpus of data, wherein the second corpus of data includes text of a subject matter different than the corpus of data. A processor determines a score for each entity relationship based on the comparison to the second corpus of data. A processor generates a schema for the first corpus of data based on the score for each entity relationship of the set of one or more entity relationships.
申请公布号 US2016283523(A1) 申请公布日期 2016.09.29
申请号 US201514666532 申请日期 2015.03.24
申请人 International Business Machines Corporation 发明人 Farenden Matthew C.;Latty Gareth A.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址 Armonk NY US