Automatic Extraction of Structured Web Content,申请号US20100797614-传众专利搜索

发明名称	Automatic Extraction of Structured Web Content
摘要	Described is extracting structured information from web pages for use in directly answering queries with data items from the structured data. Users' post-search browsing behaviors (search trails) are treated as implicit labels as to the relevance between web content and user queries, and are used to determine wrappers for extracting structured information. In one implementation, a system identifies websites from web search logs, builds wrappers from users' search trails, filters out bad wrappers (from inconsistent user clicks), and combines structured information from different web sites, e.g., for each query.
申请公布号	US2011307479(A1)	申请公布日期	2011.12.15
申请号	US20100797614	申请日期	2010.06.10
申请人	YIN XIAOXIN;TAN WENZHAO;LI XIAO;TU YI-CHIN;SUZUE YUTAKA;APACIBLE JOHNSON T.;MICROSOFT CORPORATION	发明人	YIN XIAOXIN;TAN WENZHAO;LI XIAO;TU YI-CHIN;SUZUE YUTAKA;APACIBLE JOHNSON T.
分类号	G06F17/30	主分类号	G06F17/30
代理机构		代理人
主权项
地址