发明名称 FLEXIBLE AND SCALABLE STRUCTURED WEB DATA EXTRACTION
摘要 This document describes techniques that label text nodes of a seed site for each of a plurality of verticals. Once a seed site is labeled for a given vertical, the techniques extract features from the labeled text nodes of the seed site. The techniques learn vertical knowledge for the seed site based on the human labels and the extracted features, and adapt the learned vertical knowledge to a new web site to automatically and accurately identify attributes and extract attribute values targeted within a given vertical for structured web data extraction.
申请公布号 US2013073514(A1) 申请公布日期 2013.03.21
申请号 US201113237142 申请日期 2011.09.20
申请人 CAI RUI;ZHANG LEI;HAO QIANG;MICROSOFT CORPORATION 发明人 CAI RUI;ZHANG LEI;HAO QIANG
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址