发明名称 System for extracting data from an electronic document
摘要 The system comprises a data acquisition engine and processing engine, wherein the data acquisition engine is adapted to extract data from the electronic document which is preferably a PDF document such as an invoice. The data comprises glyphs in the form of words and at least one property spatially associated with the glyphs, the property being location, fonts, background or shading. An anchor, point is defined according to rules describing the relationship between extracted data and the spatial property. The processing engine analyses extracted data by applying the rules to determine the anchor point and the probability of whether the spatial property and extracted data meet the requirements of the rules to determine the best if the extracted data to the format of the data output required.
申请公布号 GB2487600(A) 申请公布日期 2012.08.01
申请号 GB20110001624 申请日期 2011.01.31
申请人 KEYWORDLOGIC LIMITED 发明人 RICHARD DEVELYN
分类号 G06F17/21;G06F17/27 主分类号 G06F17/21
代理机构 代理人
主权项
地址