发明名称 System and method for mining data
摘要 A system and method for extracting data, hereinafter referred to as MitoMine(TM), that produces a strongly-typed ontology defined collection referencing (and cross referencing) all extracted records. The input to the mining process can be any data source, such as a text file delimited into a set of possibly dissimilar records. Mitomine contains parser routines and post-processing functions, known as 'munchers'. The parser routines can be accessed either via a batch mining process or as part of a running server process connected to a live source. Munchers can be registered on a per data-source basis in order to process the records produced, possibly writing them to an external database and/or a set of servers. The present invention also embeds an interpreted ontology based language within a compiler/interpreter (for the source format) such that the statements of the embedded language are executed as a result of the source compiler 'recognizing' a given construct within the source and extracting the corresponding source content. In this way, the execution of the statements in the embedded program will occur in a sequence that is dictated wholly by the source content. This system and method therefore make it possible to bulk extract free-form data from such sources as CD-ROMs, the web etc. and have the resultant structured data loaded into an ontology based system.
申请公布号 US2003172053(A1) 申请公布日期 2003.09.11
申请号 US20030357290 申请日期 2003.02.03
申请人 FAIRWEATHER JOHN 发明人 FAIRWEATHER JOHN
分类号 G06F;G06F7/00;G06F9/00;G06F9/44;G06F9/45;G06F12/00;G06F12/06;G06F13/00;G06F15/16;G06F15/173;G06F17/00;G06F17/21;G06F17/27;G06F17/28;G06F17/30;G06K9/72;G06N5/00;G06N5/02;H04L;(IPC1-7):G06F17/30 主分类号 G06F
代理机构 代理人
主权项
地址