发明名称 METHOD AND DEVICE FOR RETRIEVING DATA AND TRANSFORMING SAME INTO QUALITATIVE DATA OF A TEXT-BASED DOCUMENT
摘要 Method for extracting information from a data file comprising a first step wherein the data are transmitted to a device (3.1) or "tokenizer" adapted to convert them in the course of a first step into elementary units or "tokens", the elementary units being transmitted to a second step of searching in the dictionaries (3.2) and a third step (3.3) of searching in grammars, characterized in that, for the conversion step, a sliding window of given size is used, the data are converted into "tokens" as and when they arrive in the tokenizer and the tokens are transmitted as and when they are formed to the step of searching in dictionaries (3.2), then to the step of searching in the grammars (3.3).
申请公布号 US2010023318(A1) 申请公布日期 2010.01.28
申请号 US20070161600 申请日期 2007.01.19
申请人 LEMOINE JULIEN 发明人 LEMOINE JULIEN
分类号 G06F17/27;G06F17/21 主分类号 G06F17/27
代理机构 代理人
主权项
地址