发明名称 Method for processing text in a computer and a computer
摘要 A method for processing text in a computer unit (1) and a computer unit (1) are proposed which enable key word lists to be efficiently generated. In the process, a first list (5) of key words is generated, a first text (10) being partitioned into a plurality of text chunks which are separated from one another by predefined text components of a text component list (20) stored in a memory (15) assigned to the computer unit (1). At least one portion of a text chunk is entered into the first list (5) of key words when its frequency of occurrence in the first text (10) exceeds a first predefined value. In a first step, all word groups in the remaining text chunks are sought which include a first predefined number of directly adjacent words. Of these word groups in the text chunks, those are subsequently deleted whose frequency of occurrence in the first text exceeds the first predefined value and which, therefore, are entered into the first list (5) of key words. In a second step, all word groups in the remaining text chunks are sought which include a second predefined number of directly adjacent words, the second predefined number of words being smaller than the first predefined number of words.
申请公布号 US2004054677(A1) 申请公布日期 2004.03.18
申请号 US20030416966 申请日期 2003.10.09
申请人 MUELLER HANS-GEORG;KOPCSA ALEXANDER;SCHIEBEL EDGAR;WIDHALM CLEMENS 发明人 MUELLER HANS-GEORG;KOPCSA ALEXANDER;SCHIEBEL EDGAR;WIDHALM CLEMENS
分类号 G06F17/30;(IPC1-7):G06F7/00 主分类号 G06F17/30
代理机构 代理人
主权项
地址