发明名称 Data Augmentation by Imputation
摘要 A computerized method of representing a dataset with a taxonomy includes obtaining a dataset comprising a plurality of records, the dataset being characterized by a vocabulary and each of the plurality of records being characterized by at least one term within the vocabulary; identifying nearest neighbors for each term within the vocabulary; imputing a degree of membership for each nearest neighbor identified for each term within the vocabulary; augmenting the obtained dataset with the imputed degree of membership; and generating a taxonomy of the augmented dataset.
申请公布号 US2007271266(A1) 申请公布日期 2007.11.22
申请号 US20060457103 申请日期 2006.07.12
申请人 发明人 ACHARYA CHIRANJIT;PURANG KHEMDUT;PLUTOWSKI MARK
分类号 G06F17/30;G06N99/00 主分类号 G06F17/30
代理机构 代理人
主权项
地址