Data Extraction Method, Computer Program Product and System,申请号US200913258480-传众专利搜索

首页产品黄页商标征信

会员服务注册登录

法人/股东/高管

发明名称	Data Extraction Method, Computer Program Product and System
摘要	Disclosed is a method of automatically extracting data from a target web page, comprising selecting (302) data in a source web page; determining (304) the respective DOM (document object model) trees of the source and target web page, and identifying the one or more nodes comprising the selected data in the source web page DOM tree; determining (306) matching paths in the respective DOM trees; for selected data in a node of an unmatched branch of the source web page DOM tree, identifying (308) the nearest matched path in the source web page; identifying (310) the unmatched branch nearest to the corresponding matched path in the target web page; determining (312) if said identified unmatched branch in the target web page DOM tree comprises a target node matching the selected data node; and if so: extracting (322) data from the target node if the mismatch between the respective unmatched branches does not exceed a predefined threshold. A computer program product and system implementing this method are also disclosed.
申请公布号	US2012059859(A1)	申请公布日期	2012.03.08
申请号	US200913258480	申请日期	2009.11.25
申请人	JIAO LI-MEI;XIONG YUHONG	发明人	JIAO LI-MEI;XIONG YUHONG
分类号	G06F17/30	主分类号	G06F17/30
代理机构		代理人
主权项
地址

您可能感兴趣的专利

调温开关(1)

PRESSURE SENSOR DIAGNOSTICS IN A PROCESS TRANSMITTER

PROCESS FOR PRODUCING POLYMERS OF ALKENES BY SUSPENSION POLYMERISATION

COLORED GLASS COMPOSITIONS

MULTI-STAGE SPEECH CODER WITH TRANSFORM CODING OF PREDICTION RESIDUAL SIGNALS WITH QUANTIZATION BY AUDITORY MODELS

PISTE MILLING DEVICE

MATERIALS TESTING APPARATUS

TENSION FORCE METER

HEART VALVE ACTIVATION SYSTEM AND ACTIVATED HEART VALVE

CAN END WITH EMBOSS AND DEBOSS SCORE PANEL STIFFENING BEADS

MAGNETIC RESONANCE BLOOD POOL AGENTS BOUND TO HUMAN SERUM ALBUMIN

USE OF GLIAL NEUROTROPHIC FACTOR (GDNF) FOR TREATMENT OF HEARING DISORDERS

COMPOSITION AND APPARATUS FOR SURFACE CLEANING

System und Verfahren zur Geschwindigkeitsregelung von elektrischen Motoren in extrem niedrigen Geschwindigkeitsbereichen unter Verwendung eines rotierenden Pulskodierers

COMPONENT SUCTION METHOD

KEY REPLACEMENT IN A PUBLIC KEY CRYPTOSYSTEM

COMPUTERIZED QUOTATION SYSTEM AND METHOD

MEMORY CARD CONNECTOR WITH A MEMORY CARD EJECTOR MECHANISM

POWER FACTOR CONTROLLER CIRCUIT