发明名称 Data cell cluster identification and table transformation
摘要 Various embodiments may operate to access individual lines of information included in a file stored in an electronic storage medium, to detect the existence of data clusters in the file based on neighboring cell content in a horizontal direction (corresponding to the individual lines), and in a vertical direction (orthogonal to the horizontal direction), to identify at least some of the data clusters as being associated with predefined table types (comprising vertical tables, horizontal tables, or cross tables), to merge some of the data clusters into section tables having common properties, and to transform the tables resulting from the merging activity, as well as remaining un-merged data clusters, into a single flat table. The stored file may comprise a spreadsheet file.
申请公布号 US9311371(B2) 申请公布日期 2016.04.12
申请号 US201313870252 申请日期 2013.04.25
申请人 Business Objects Software 发明人 Mohan Mayur Belur;Diwan Saurabh;Gargoum Louay
分类号 G06F17/30;G06F17/24 主分类号 G06F17/30
代理机构 Schwegman Lundberg & Woessner, P.A. 代理人 Schwegman Lundberg & Woessner, P.A.
主权项 1. An apparatus, comprising: a processing node, implemented by one or more hardware processors, comprising a data reception module to sequentially access individual lines of information included in a file stored in an electronic storage medium; and a transformation module, implemented by the one or more hardware processors, to couple to the data reception module, the transformation module configured to detect existence of data clusters in the file, the detecting comprising associating a cell with a new data cluster or an existing data cluster, based on comparing the cell with adjacent cells in a horizontal direction corresponding to the individual lines,to generate metadata including characteristics associated with each of the data clusters,to identify at least some of the data clusters as being associated with predefined table types comprising vertical tables, horizontal tables, or cross tables,to merge one or more of the data clusters having common cluster table schemas into section tables, based on the metadata, andto transform the vertical tables, the horizontal tables, the cross tables, the section tables, and remaining un-merged data clusters into a single flat table having a vertical or horizontal orientation, based on the metadata.
地址 Dublin IE