发明名称 Segment sensitive query matching
摘要 Exemplary techniques are provided which may be implemented using various methods, apparatuses, and/or articles of manufacture to provide or otherwise support segment sensitive query matching based on segmented portions of web pages and/or providing related information for use in information extraction and/or information retrieval systems. In certain example implementations techniques may be provided for determining whether a query match exists between a document and obtained query terms based, at least in part, on labeled portion information associated with a plurality of segmented portions of a document.
申请公布号 US9465872(B2) 申请公布日期 2016.10.11
申请号 US200912538711 申请日期 2009.08.10
申请人 YAHOO! Inc. 发明人 Vadrevu Srinivas;Velipasaoglu Emre
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Berkeley Law & Technology Group, LLP 代理人 Berkeley Law & Technology Group, LLP
主权项 1. A method comprising: with one or more special purpose computing devices: processing one or more search query terms submitted to a search engine via a user interface;processing labeled portions indicative of a plurality of content quality scores for a plurality of segmented portions of a web page,wherein at least one of the plurality of content quality scores is based, at least in part, upon a classification of a corresponding segmented portion of the plurality of segmented portions according to a type of content of the corresponding segmented portion and without regard to subject matter topic of the content of the corresponding segmented portion;calculating at least one weighted content quality score for the at least one of the plurality of content quality scores based, at least in part, on at least one measure of frequency of at least one term in the corresponding segmented portion matching the one or more search query terms and at least one measure of a length in words of the corresponding segmented portion;determining whether a query match exists between the web page and the one or more search query terms based, at least in part, on the labeled portions including the at least one weighted content quality score, and the one or more search query terms; andinitiating transmission to the user interface of at least a portion of a result of the determination.
地址 Sunnyvale CA US