摘要 |
Method for extracting information from a data file comprising a first step wherein the data are transmitted to a device (3.1) or "tokenizer" adapted to convert them in the course of a first step into elementary units or "tokens", the elementary units being transmitted to a second step of searching in the dictionaries (3.2) and a third step (3.3) of searching in grammars, characterized in that, for the conversion step, a sliding window of given size is used, the data are converted into "tokens" as and when they arrive in the tokenizer and the tokens are transmitted as and when they are formed to the step of searching in dictionaries (3.2), then to the step of searching in the grammars (3.3).
|