IMPROVING ENTITY RECOGNITION IN NATURAL LANGUAGE PROCESSING SYSTEMS
摘要
Mechanisms are provided for generating a dictionary data structure for analytical operations. A source terminology resource is ingested to generate a hierarchical representation of the source terminology resource comprising nodes for terms related to concepts in the source terminology resource. For a node of the nodes in the hierarchical representation of the source terminology resource, a permutation of a corresponding term associated with the node is generated. An expanded hierarchical representation of the source terminology resource is generated based on the generated permutation. An enhanced dictionary data structure is generated based on the expanded hierarchical representation and output to an analytics engine to perform analysis of a corpus of information using the enhanced dictionary data structure.
申请公布号
WO2014140977(A9)
申请公布日期
2014.12.18
申请号
WO2014IB59310
申请日期
2014.02.27
申请人
INTERNATIONAL BUSINESS MACHINES CORPORATION;IBM UNITED KINGDOM LIMITED;IBM (CHINA) INVESTMENT COMPANY LIMITED
发明人
GERKEN III, JOHN, KENYON;ZBOICHYK, FIODAR;PRAGER, JOHN, MARTIN