发明名称 DISCRIMINATIVE FEATURE SELECTION FOR DATA SEQUENCES
摘要 A discriminative feature selection method for selecting a set of features from a set of training data sequences (420) is described. The training data sequences (420) are generated by at least two data sources, and each data sequences consists of a sequence of data symbols taken as an alphabet. The method is performed by first building a suffix tree (300) from the training. The suffix tree (300) contains only suffixes of the data sequences having an empirical probability of occurrence greater than a first predetermined threshold, (430) from at least one of the sources. Next the suffix tree is pruned (310) on all suffixes for which there exists in the suffix tree (300) a shorter suffix having equivalent predictive capability, for all of the data sources.
申请公布号 WO03058489(A1) 申请公布日期 2003.07.17
申请号 WO2002IL00279 申请日期 2002.04.04
申请人 YISSUM RESEARCH DEVELOPMENT COMPANY OF THE HEBREW;TISHBY, NAFTALY;SLONIM, NOAM;FINE, SHAI 发明人 TISHBY, NAFTALY;SLONIM, NOAM;FINE, SHAI
分类号 G06F17/27;(IPC1-7):G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项
地址