发明名称 CATEGORIZATION BASED ON RECORD LINKAGE THEORY
摘要 The method and apparatus for categorizing an item based on Record Linkage Theory is disclosed. A related method and apparatus for assigning a confidence level to the categorization process is disclosed. In one aspect, the item to be categorized is parsed into at least one token. At least one category that contains the token in the training set is identified. A weight is calculated for each token with respect to a first category. Weights are combined to determine the total weight of the first category. The weighting process is repeated for each relevant category. Where one of a plurality of threshold values is met or exceeded, the item may be automatically assigned to the category with the highest total weight. The combination of threshold values may be selected based on the confidence level associated with that combination of threshold values. Weights for each relevant category, possibly ordered, may be presented.
申请公布号 WO02071273(A2) 申请公布日期 2002.09.12
申请号 WO2002US03815 申请日期 2002.02.05
申请人 VALITY TECHNOLOGY INC.;JARO, MATTHEW, A. 发明人 JARO, MATTHEW, A.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址