发明名称 |
DISCRIMINATIVE FEATURE SELECTION FOR DATA SEQUENCES |
摘要 |
A discriminative feature selection method for selecting a set of features from a set of training data sequences (420) is described. The training data sequences (420) are generated by at least two data sources, and each data sequences consists of a sequence of data symbols taken as an alphabet. The method is performed by first building a suffix tree (300) from the training. The suffix tree (300) contains only suffixes of the data sequences having an empirical probability of occurrence greater than a first predetermined threshold, (430) from at least one of the sources. Next the suffix tree is pruned (310) on all suffixes for which there exists in the suffix tree (300) a shorter suffix having equivalent predictive capability, for all of the data sources.
|
申请公布号 |
WO03058489(A1) |
申请公布日期 |
2003.07.17 |
申请号 |
WO2002IL00279 |
申请日期 |
2002.04.04 |
申请人 |
YISSUM RESEARCH DEVELOPMENT COMPANY OF THE HEBREW;TISHBY, NAFTALY;SLONIM, NOAM;FINE, SHAI |
发明人 |
TISHBY, NAFTALY;SLONIM, NOAM;FINE, SHAI |
分类号 |
G06F17/27;(IPC1-7):G06F17/27 |
主分类号 |
G06F17/27 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|