METHOD AND SYSTEM FOR EXTRACTING AND MANAGING INFORMATION CONTAINED IN ELECTRONIC DOCUMENTS,申请号WO2011BR00047-传众专利搜索

首页产品黄页商标征信

会员服务注册登录

法人/股东/高管

发明名称	METHOD AND SYSTEM FOR EXTRACTING AND MANAGING INFORMATION CONTAINED IN ELECTRONIC DOCUMENTS
摘要	This invention relates to a method and system that use metadata to facilitate the extraction and enable the management of information contained in electronic documents. This metadata describes the content of the documents based on the composition of their structure and the manner in which the information in question is arranged in that structure. In addition to providing a description that makes it possible to automatically manage the models used for extraction, this metadata also defines a logical schema for managing the information extracted. The method begins with a preparation step (10) in which said metadata (1) and document samples (2) are collected and stored in the system. The training step (20) is then performed, in which the system uses said metadata (1) and respective document samples (2) to build and train the models (3) used for extraction. Finally, in the extraction step (30), the system receives a collection of electronic documents (4) and uses the trained models (3) to extract the information of interest. This information, once extracted, is stored (5) by the system in accordance with the logical schema defined using the metadata, enabling it to be managed immediately. The system enables the method to be applied even if the information is dispersed throughout large documents. In one preferred embodiment, the metadata is defined using an XSD (XML Schema Definition), and the document samples are labelled in an XML format, allowing them to be validated by that XSD.
申请公布号	WO2011100814(A1)	申请公布日期	2011.08.25
申请号	WO2011BR00047	申请日期	2011.02.16
申请人	BERTOLI MARTINS, ALEXANDRE JONATAN	发明人	BERTOLI MARTINS, ALEXANDRE JONATAN
分类号	G06F17/30	主分类号	G06F17/30
代理机构		代理人
主权项
地址

您可能感兴趣的专利

由操纵柄控制的机动车液压泵

一种从植物茴香果实中分离的驱虫剂

METHOD FOR THE PRODUCTION FLOORS OR PARTING LAYERS

Packet transmission control apparatus, mobile node, control node, packet communication method, and packet communication system

APPARATUS AND METHOD FOR TRANSMITTING MPEG-4 DATA SYNCHRONIZED WITH MPEG-2 DATA

BIARYL-ACETIC ACID DERIVATIVES AND THEIR USE AS COX-2 INHIBITORS

PROTECTIVE CAP FOR MEDICAL HF INSTRUMENTS

Connection device for an electric distribution installation

Printing of variable data with the aid of variants

Premixing fuel injector and method of operation

Process for the preparation of end-products based on vinylaromatic polymers with a predominantly syndiotactic structure

Connector system for cordless appliances

Locking mechanism

Nozzle, especially for cleaning cylindrical gas filter cartridge by pressure pulsing

Cleaning arrangement for turbocharger turbines

DISPLAY ELEMENT FOR AN ELECTRONIC BACK-UP ASSIST SYSTEM

AGENT LEARNING APPARATUS, METHOD, AND PROGRAM

DAMPENING SYSTEM AND A METHOD FOR ALTERNATELY DISPENSING DAMPENING MEANS OR CLEANING LIQUID

Process for making flowable pearlescent and opacicying concentrates