发明名称 SYSTEMS, METHODS AND COMPUTER PROGRAM PRODUCTS FOR DISCOVERING A TEXT QUERY FROM EXAMPLE DOCUMENTS
摘要 Discovering a keyword query corresponding to an input collection of documents taken from a candidate pool includes selecting a document from a working set as the input set, and extracting a list of snippets in the selected document. For each snippet, executing a set of proximity queries based on selected terms in that snippet, and finding all possible proximity queries that return less than N query results from the candidate pool. A query is selected from said proximity queries, based on the selected query returning the greatest number of working set documents, and returning the smallest number of documents not in the working set. Documents returned by the selected query are removed from the working set, and the above steps are repeated until no documents remain in the working set. The disjunction of selected queries is returned as the discovered query.
申请公布号 US2013132418(A1) 申请公布日期 2013.05.23
申请号 US201113300431 申请日期 2011.11.18
申请人 SPANGLER WILLIAM S.;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 SPANGLER WILLIAM S.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址