发明名称 Method and apparatus for extracting queries from webpages
摘要 Methods for extracting queries from webpages is provided. Methods may include receiving a plurality of queries. Each query in the plurality may be input into a search box located on a public-facing webpage. Methods may include receiving content associated with each of the public-facing webpages. Methods may include receiving identifying information associated with an inputter of each query. Methods may include selecting at least one query from the plurality of queries based at least in part on factors. The factors may include query length. The factors may include query rank. The query rank may be based in part on the magnitude of predetermined terminology included in the at least one query. The predetermined terminology may be stored on a computer-readable memory. The factors may include the grammatical relationship between query terms. The factors may include the identifying information about the inputter associated with the at least one query.
申请公布号 US9165057(B1) 申请公布日期 2015.10.20
申请号 US201514642998 申请日期 2015.03.10
申请人 Bank of America Corporation 发明人 Bostian Michael;Yeager Stephen L.;Yannam Ramakrishna R.;Kothuvatiparambil Viju
分类号 G06F17/30;G06F7/00 主分类号 G06F17/30
代理机构 Weiss & Arons LLP 代理人 Weiss & Arons LLP ;Springs, Esq. Michael A.
主权项 1. An apparatus for extracting queries from webpages, the apparatus comprising: a receiver configured to receive: a plurality of queries, wherein each query, included in the plurality, is input into a distinct search box located on a distinct public-facing webpage;content associated with each public-facing webpage;identifying information associated with each inputter of each distinct query; a processor configured to: analyze each query in the plurality of queries based on a first parameter, the first parameter being query length;discard a query, from the plurality of queries, said query which falls below a predetermined threshold with respect to fulfillment of the first parameter;analyze each query in the plurality of queries based on a second parameter, the second parameter being the magnitude of predetermined terminology included in each query's language, said predetermined terminology stored on a computer-readable memory;discard a query, from the plurality of queries, said query which falls below a predetermined threshold with respect to fulfillment of the second parameter;analyze each query in the plurality of queries based on a third parameter, the third parameter being a grammatical relationship of query terms to one another;discard a query, from the plurality of queries, said query which falls below a predetermined threshold with respect to fulfillment of the third parameter;analyze each query in the plurality of queries based on a fourth parameter, the fourth parameter being identifying information of an inputter of each query;discard a query, from the plurality of queries, said query which falls below a predetermined threshold with respect to fulfillment of the fourth parameter;analyze each query in the plurality of queries based on a fifth parameter, the fifth parameter being the content of the public-facing webpage associated with each query;discard a query, from the plurality of queries, said query which falls below a predetermined threshold with respect to fulfillment of the fifth parameter.
地址 Charlotte NC US