发明名称 DICTIONARY REFINEMENT FOR INFORMATION EXTRACTION
摘要 A method for refining a dictionary for information extraction, the operations including: inputting a set of extracted results from execution of an extractor comprising the dictionary on a collection of text, wherein the extracted results are labeled as correct results or incorrect results; processing the extracted results using an algorithm configured to set a score of the extractor above a score threshold, wherein the score threshold balances a precision and a recall of the extractor; and outputting a set of candidate dictionary entries corresponding to a full set of dictionary entries, wherein the candidate dictionary entries are candidates to be removed from the dictionary based on the extracted results.
申请公布号 US2013318075(A1) 申请公布日期 2013.11.28
申请号 US201213480974 申请日期 2012.05.25
申请人 CHITICARIU LAURA;FELDMAN VITALY;REISS FREDERICK R.;ROY SUDEEPA;ZHU HUAIYU;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 CHITICARIU LAURA;FELDMAN VITALY;REISS FREDERICK R.;ROY SUDEEPA;ZHU HUAIYU
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址