发明名称 Subject-matter analysis of tabular data
摘要 A system, and computer program product for subject-matter analysis of tabular data are provided in the illustrative embodiments. A first document including the tabular data is received. A library of functional signatures for a first subject-matter domain is selected. A determination is made whether a threshold number of functional signatures from the selected library are applicable to the tabular data, wherein a functional signature is applicable to the tabular data when values in the tabular data correspond to an operation and a table structure specified in the functional signature. Responsive to the threshold number of functional signatures from the selected library being applicable to the tabular data, a processor and a memory process the first document according to a process for the first subject matter domain selected from a plurality of processes for respective subject matter domains.
申请公布号 US9607039(B2) 申请公布日期 2017.03.28
申请号 US201313945259 申请日期 2013.07.18
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 Byron Donna Karen;Gerard Scott N.;Pikovsky Alexander;Sanchez Matthew B.
分类号 G06F17/22;G06F17/30;G06F17/24 主分类号 G06F17/22
代理机构 Garg Law Firm, PLLC 代理人 Garg Law Firm, PLLC ;Garg Rakesh;Sarbakhsh Reza
主权项 1. A computer usable program product comprising a non-transitory computer usable storage device including computer usable code for subject-matter analysis of tabular data, the computer usable code comprising: computer usable code for receiving a first document including the tabular data; computer usable code for normalizing information specific to the tabular data in the document using references specific to a first subject-matter domain of the tabular data; computer usable code for selecting a library of functional signatures for the first subject-matter domain, wherein a functional signature in the library of functional signatures for the first subject-matter domain comprises an expression, wherein the expression represents (i) a functional relationship of a first cell and a second cell and (ii) a semantic relationship of the first cell and the second cell, wherein the first cell is a cell in the tabular data, wherein the functional relationship describes a computation between a first value in the first cell and a second value in the second cell, and wherein the semantic relationship describes an organizational relationship between a first identifier associated with the first cell and a second identifier associated with the second cell; computer usable code for determining whether a threshold number of functional signatures from the selected library are applicable to the tabular data, wherein a functional signature is applicable to the tabular data when values in the tabular data correspond to an operation and a table structure specified in the functional signature; and computer usable code for processing, according to a process for the first subject matter domain selected from a plurality of processes for respective subject matter domains, using a processor and a memory, the first document responsive to the threshold number of functional signatures from the selected library being applicable to the tabular data, wherein the first identifier associated with the first cell is a header and the second identifier associated with the second cell is an indentation in a placement of the second cell in the tabular data.
地址 Armonk NY US
您可能感兴趣的专利