发明名称 BUILDING AND MAINTAINING INFORMATION EXTRACTION RULES
摘要 Methods and arrangements for managing development of information extraction rules. One or more documents are opened for extraction. An interface is provided to create a label and thereupon label a portion of the document. The created label is stored, and an extractor is developed based on the labeling. A test interface is provided for the extractor, and results of a test conducted through the test interface are displayed. The extractor is exported. In accordance with at least one embodiment, developers are presented with eased automated guidance to write extractors, which thereby reduces an overall manual effort involved in extractor development. Generally, a focused, tutorial-type environment serves as a guide based on previously developed best practices.
申请公布号 US2016371243(A1) 申请公布日期 2016.12.22
申请号 US201615253613 申请日期 2016.08.31
申请人 International Business Machines Corporation 发明人 Carreno-Fuentes Arnaldo;Chiticariu Laura;Kandogan Eser;Li Yunyao;Yang Huahai
分类号 G06F17/24;G06F17/30;G06F17/22;G06F17/21;G06F3/0486;G06F3/0482 主分类号 G06F17/24
代理机构 代理人
主权项 1. A method comprising: opening one or more documents for extraction; providing an interface to create a label and thereupon label a portion of the document; storing the created label; developing an extractor based on the labeling; providing a test interface for the extractor; displaying results of a test conducted through the test interface; and exporting the extractor.
地址 Armonk NY US