发明名称 Automatic Extraction of Structured Web Content
摘要 Described is extracting structured information from web pages for use in directly answering queries with data items from the structured data. Users' post-search browsing behaviors (search trails) are treated as implicit labels as to the relevance between web content and user queries, and are used to determine wrappers for extracting structured information. In one implementation, a system identifies websites from web search logs, builds wrappers from users' search trails, filters out bad wrappers (from inconsistent user clicks), and combines structured information from different web sites, e.g., for each query.
申请公布号 US2011307479(A1) 申请公布日期 2011.12.15
申请号 US20100797614 申请日期 2010.06.10
申请人 YIN XIAOXIN;TAN WENZHAO;LI XIAO;TU YI-CHIN;SUZUE YUTAKA;APACIBLE JOHNSON T.;MICROSOFT CORPORATION 发明人 YIN XIAOXIN;TAN WENZHAO;LI XIAO;TU YI-CHIN;SUZUE YUTAKA;APACIBLE JOHNSON T.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址