发明名称 |
A SYSTEM AND METHOD OF EXTRACTING RAW DATA |
摘要 |
A system and corresponding method to extract raw data from an electronic text document in a predetermined format (particularly XML format), the document comprising raw data in the form of tag data held in a hierarchical framework formed from a plurality of tag sets, wherein the system comprises: a tag analysis module operable to analyse the formatted document to identify: a plurality of tags in the hierarchical framework; and the tag data in the hierarchical framework;a grouping module operable to group at least two of the plurality of tags, wherein the at least two tags are grouped into a tag set, each tag set having a tag index specifying the location of the tag set in the hierarchical framework of the document; and an output module operable to associate a tag set with the tag data of that tag set. |
申请公布号 |
WO2013178993(A1) |
申请公布日期 |
2013.12.05 |
申请号 |
WO2013GB51341 |
申请日期 |
2013.05.22 |
申请人 |
VAN MOLENDORFF, STEFFAN OCKERT |
发明人 |
VAN MOLENDORFF, STEFFAN OCKERT |
分类号 |
G06F17/22;G06F17/30 |
主分类号 |
G06F17/22 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|