发明名称 A SYSTEM AND METHOD OF EXTRACTING RAW DATA
摘要 A system and corresponding method to extract raw data from an electronic text document in a predetermined format (particularly XML format), the document comprising raw data in the form of tag data held in a hierarchical framework formed from a plurality of tag sets, wherein the system comprises: a tag analysis module operable to analyse the formatted document to identify: a plurality of tags in the hierarchical framework; and the tag data in the hierarchical framework;a grouping module operable to group at least two of the plurality of tags, wherein the at least two tags are grouped into a tag set, each tag set having a tag index specifying the location of the tag set in the hierarchical framework of the document; and an output module operable to associate a tag set with the tag data of that tag set.
申请公布号 WO2013178993(A1) 申请公布日期 2013.12.05
申请号 WO2013GB51341 申请日期 2013.05.22
申请人 VAN MOLENDORFF, STEFFAN OCKERT 发明人 VAN MOLENDORFF, STEFFAN OCKERT
分类号 G06F17/22;G06F17/30 主分类号 G06F17/22
代理机构 代理人
主权项
地址