发明名称 INFORMATION EXTRACTION SYSTEM, INFORMATION EXTRACTION METHOD, INFORMATION EXTRACTION PROGRAM, AND INFORMATION SERVICE SYSTEM
摘要 <p>Words and phrases of the same type can be extracted from a plurality of documents of various forms. A storage device (400) stores the documents of various forms. A pattern candidate creating means (11) receives a list of input words selected as samples from words and phrases which are to be included in a dictionary. The pattern candidate creating means (11) selects one document, determines character strings before and after the input word in the document as pattern candidates, and stores them as pattern candidates (16). The pattern candidate creating means (11) performs this processing for each document. A word and phrase candidate creating means (12) extracts the words and phrases sandwiched between the patterns included in the pattern candidate (16) as word and phrase candidates which are to be outputted and stores them as a word and phrase candidate (17). A word and phrase selecting means (13) outputs a word and phrase candidate satisfying a predetermined condition out of the word and phrase candidates included in the word and phrase candidate (17) as an output word to an output device (300).</p>
申请公布号 WO2007108529(A1) 申请公布日期 2007.09.27
申请号 WO2007JP55958 申请日期 2007.03.23
申请人 NEC CORPORATION;MIZUGUCHI, HIRONORI;TSUCHIDA, MASAAKI;KUSUI, DAI;KAWAI, HIDEKI 发明人 MIZUGUCHI, HIRONORI;TSUCHIDA, MASAAKI;KUSUI, DAI;KAWAI, HIDEKI
分类号 G06F17/30;G06F17/21;G06Q30/02;G06Q30/06 主分类号 G06F17/30
代理机构 代理人
主权项
地址