发明名称 MECHANISM FOR EFFICIENTLY SEARCHING XML DOCUMENT COLLECTIONS
摘要 The techniques presented herein are directed towards providing a user-directed keyword-based search on a large collection of XML documents, and displaying a summary of results to the user. Prior to receiving search requests from a user, an offline analysis of a large collection of XML documents is performed to construct an inverted index of keywords. For each keyword, the index stores a set of location indicators that identify all the instances of the keyword found in the collection of documents. A location indicator may comprise a document identifier, an indication of the position of the node in the hierarchy of nodes within the XML document containing the keyword, and an indication of the pathname of the node containing the keyword. Once the index is constructed, keyword searching can be done efficiently by a keyword lookup in the index. Various display strategies enable the user to see the specific portion of a large XML document containing the keyword and/or path frequency information allowing the user to easily refine the search to specific paths within the collection of documents.
申请公布号 US2010228734(A1) 申请公布日期 2010.09.09
申请号 US20090391818 申请日期 2009.02.24
申请人 ORACLE INTERNATIONAL CORPORATION 发明人 MURTHY RAVI
分类号 G06F7/06;G06F17/30 主分类号 G06F7/06
代理机构 代理人
主权项
地址