发明名称 |
GENERATING MACHINE-READABLE ASSOCIATION FILES |
摘要 |
Asociation files (153, 154, 155) are generated that are suitable for determining whether a data file (151) belongs to a predetermined category (A, B). A plurality of included files (156) belonging to the category are stored in combination with a plurality of excluded files (157) not belonging to the category. Included files (156) are processed to identify candidate terms for an association file (155). The suitability of candidate terms is assessed with references to occurrences in the included files (156) in addition, the suitability is also assessed with reference to occurrences in the excluded files (157) so as to provide definition terms for an association file. Thus, if a term identified as a candidate also appears frequently in the excluded files (157) it is likely to be assessed as unsuitable for inclusion within the new association file.
|
申请公布号 |
WO9956222(A1) |
申请公布日期 |
1999.11.04 |
申请号 |
WO1999GB01212 |
申请日期 |
1999.04.21 |
申请人 |
THE DIALOG CORPORATION PLC |
发明人 |
HAMMOND, RACHEL;FERNANDES, LLEWELYN, IGNAZIO |
分类号 |
G06F17/30;(IPC1-7):G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|