发明名称 METHOD FOR GENERATING DESCRIPTORS FOR THE CLASSIFICATION OF TEXTS
摘要 The process for generating descriptors for text classification proposes the breakdown of complex word forms by matching with all the word forms occurring within a training text. No basis in morphological or linguistic knowledge is required for the preferably cyclically continued breakdown, nor for the accompanying drawing up of stop word prefix and suffix lists. Simple morphological knowledge is provided by the specification of minimum requirements for the form of descriptors and text sections. The process can adapted to new applications very flexibly and easily. The process is, moreover, very fault-tolerant and hence especially suitable for the classification of digitised texts obtained by character recognition processes from written texts or by means of speech recognition processes from spoken texts.
申请公布号 CA2200334(A1) 申请公布日期 1997.02.06
申请号 CA19962200334 申请日期 1996.06.18
申请人 DAIMLER BENZ AG 发明人 RENZ, INGRID
分类号 G06F17/27;G06F17/30;(IPC1-7):G06F17/30;G06F17/28 主分类号 G06F17/27
代理机构 代理人
主权项
地址