发明名称 METHOD AND APPARATUS OF PROCESSING SEMISTRUCTURED TEXTUAL DATA
摘要 A method of processing semistructured data, in particular semistructured textual data, to output data which is in accordance with a predetermined structure, wherein said semistructured data is structured into one or more elements according to a given syntax, the actual content of the syntax elements being variable and being called a token, said method comprising: extracting by means of an extractor ("parser") from said semistructured data one or more tokens, said parser being capable of returning at least one toke n in response to a respective specific command identifying the requested token by a token identifier, wherein said method further comprises: providing a sequence of commands and an associated data structure definition, both together being called a loader, said loader comprising the commands necessar y to cause said parser to return the one or more tokens to be extracted; causi ng by said sequence of commands of said loader said parser to extract said one or more tokens from said semistructured data and further converting said extracted tokens into said predetermined data structure defined by said associated structure definition.
申请公布号 CA2357048(A1) 申请公布日期 2000.07.13
申请号 CA19992357048 申请日期 1999.12.23
申请人 LION BIOSCIENCE AG 发明人 ETZOLD, THURE;COUPAYE, THIERRY
分类号 G06F17/22;G06F17/27;G06F17/30;(IPC1-7):G06F17/22 主分类号 G06F17/22
代理机构 代理人
主权项
地址