发明名称 Automatic classification of text files using word contexts
摘要 Text files 204 are categorised to improve searching of a text file database. Categorisation is performed by identifying the occurrence of relevant words such as company names without direct specification of any company names. This is done by specifying words or phrases that imply the occurrence of a relevant word and searching the text files for occurrences of these words or phrases. For example the phrase "shares in' will tend immediately to precede a company name, and so a search for this phrase will assist identification of files which should be considered for classification as company information. However, if the relevant word is also detected in other contexts 206 which suggest that the word is not relevant, then the probability of association with the preferred context 205 is reduced 210,211 based on a scoring scheme.
申请公布号 GB2336696(A) 申请公布日期 1999.10.27
申请号 GB19980008799 申请日期 1998.04.24
申请人 THE * DIALOG CORPORATION PLC 发明人 LLEWELYN IGNAZIO * FERNANDES;RACHEL * HAMMOND
分类号 G06F17/30;G06K9/62;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址