发明名称 |
SYSTEM AND METHOD OF IDENTIFYING WEB PAGE SEMANTIC STRUCTURES |
摘要 |
The disclosure presents a method, system and computer-readable medium related to automatically analyzing structure for a web page. The method embodiment comprises building a training corpus comprising a broad stylistic coverage of web pages, segmenting a web page into information blocks, identifying semantic categories of the information blocks using the training corpus and applying the identical semantic categories in a web-based tool.
|
申请公布号 |
US2010312728(A1) |
申请公布日期 |
2010.12.09 |
申请号 |
US20100858818 |
申请日期 |
2010.08.18 |
申请人 |
AT&T INTELLECTUAL PROPERTY II, L.P. VIA TRANSFER FROM AT&T CORP. |
发明人 |
FENG JUNLAN;HOLLISTER BARBARA B. |
分类号 |
G06F15/18 |
主分类号 |
G06F15/18 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|