发明名称 METHOD FOR PRODUCING RULE FOR CLASSIFYING STRUCTURED DOCUMENTS, COMPUTER PROGRAM THEREFOR AND COMPUTER
摘要 <P>PROBLEM TO BE SOLVED: To provide a method, computer and computer program for producing a rule for efficiently classifying structured documents such as XML documents. <P>SOLUTION: A method for producing a rule for classifying a plurality of digitized structured documents to which the same schema is applied is provided. The method includes the steps of scanning the schema to identify one or more variable portions defined by the schema, acquiring feature values of the identified variable portions from each of the plurality of structured documents and associating each of the acquired feature values with the structured document from which the feature value is acquired, and producing the rule on the basis of the feature values associated with the structured documents. Furthermore, a computer for producing a rule for specifying a plurality of digitized structured documents to which the same schema is applied, and a computer program therefor are provided. <P>COPYRIGHT: (C)2012,JPO&INPIT
申请公布号 JP2012098797(A) 申请公布日期 2012.05.24
申请号 JP20100243910 申请日期 2010.10.29
申请人 INTERNATL BUSINESS MACH CORP <IBM> 发明人 TAKASE TOSHIRO;MISHINA TAKUYA
分类号 G06F17/30;G06F17/21 主分类号 G06F17/30
代理机构 代理人
主权项
地址