发明名称 SYSTEM AND METHOD FOR AUTOMATIC WRAPPER INDUCTION BY APPLYING FILTERS
摘要 Information from a plurality of domains is automatically extracted according to an iterative application of rules. A first rule is generated based on a target string. The first rule comprises at least one filter. A domain of interest is identified and a training set is generated using the target string and at least one document in the domain of interest. The first rule is applied to each document in the training set to obtain a first of target results. The first set of target results are compared to desired set of target results. Based on the comparison, a second rule may is created and applied to the training set to yield an improved second set of target results.
申请公布号 CA2833355(A1) 申请公布日期 2014.05.14
申请号 CA20132833355 申请日期 2013.11.14
申请人 HOMER TLC, INC. 发明人 PAVAN KUMAR MALLAPRAGADA NAGA SURYA, SIVA KALYANA
分类号 G06F17/00;G06F9/44 主分类号 G06F17/00
代理机构 代理人
主权项
地址