发明名称 AUTOMATED APPROACH FOR EXTRACTING INTELLIGENCE, ENRICHING AND TRANSFORMING CONTENT
摘要 The present invention relates to a system and method for enriching and transforming unstructured data to obtain structured data by intelligence extraction, enrichment, categorization and hierarchy creation. The invention discloses an automated approach for transformation of unstructured documents, which involves an analysis, a transformation and a quality assessment of the input unstructured documents, to obtain the output structured documents in fewer time frames and without the need of skilled labors.
申请公布号 US2016299878(A9) 申请公布日期 2016.10.13
申请号 US201313945315 申请日期 2013.07.18
申请人 Infosys Limited 发明人 Subramaniam Jagathpathy;Madanagopal Thirumugam;Santhana Venkatasubramanian;Mishra Rahul;Chandramouli Biswanath;Raghunath Saroja;Sundaram Padmavathi;Gopalakrishnan Karthick;Narayan Anilkumar Pambalayam;Murali Sriram Krishnan
分类号 G06F17/22 主分类号 G06F17/22
代理机构 代理人
主权项 1. A content management computing device, comprising: a memory; and a processor operatively coupled to the memory, the processor configured to perform the steps comprising:; receiving at least one unstructured document and providing at least one unstructured XML file; identifying a complexity level of the at least one unstructured XML file, and listing a plurality of applicable and configurable transformation rules for the at least one unstructured XML file; transforming the at least one unstructured XML file to a structured XML file based on the plurality of applicable and configurable transformation rules and a predefined set of target schema; and validating the structured XML file against the plurality of applicable and configurable transformation rules to report incorrect transformation in the XML structured file.
地址 Bangalore IN