发明名称 SYSTEM AND METHOD OF IDENTIFYING WEB PAGE SEMANTIC STRUCTURES
摘要 The disclosure presents a method, system and computer-readable medium related to automatically analyzing structure for a web page. The method embodiment comprises building a training corpus comprising a broad stylistic coverage of web pages, segmenting a web page into information blocks, identifying semantic categories of the information blocks using the training corpus and applying the identical semantic categories in a web-based tool.
申请公布号 US2010312728(A1) 申请公布日期 2010.12.09
申请号 US20100858818 申请日期 2010.08.18
申请人 AT&T INTELLECTUAL PROPERTY II, L.P. VIA TRANSFER FROM AT&T CORP. 发明人 FENG JUNLAN;HOLLISTER BARBARA B.
分类号 G06F15/18 主分类号 G06F15/18
代理机构 代理人
主权项
地址