摘要 |
A method for processing text in a computer unit (1) and a computer unit (1) are proposed which enable key word lists to be efficiently generated. In the process, a first list (5) of key words is generated, a first text (10) being partitioned into a plurality of text chunks which are separated from one another by predefined text components of a text component list (20) stored in a memory (15) assigned to the computer unit (1). At least one portion of a text chunk is entered into the first list (5) of key words when its frequency of occurrence in the first text (10) exceeds a first predefined value. In a first step, all word groups in the remaining text chunks are sought which include a first predefined number of directly adjacent words. Of these word groups in the text chunks, those are subsequently deleted whose frequency of occurrence in the first text exceeds the first predefined value and which, therefore, are entered into the first list (5) of key words. In a second step, all word groups in the remaining text chunks are sought which include a second predefined number of directly adjacent words, the second predefined number of words being smaller than the first predefined number of words.
|