发明名称 EFFICIENT IN-MEMORY DB QUERY PROCESSING OVER ANY SEMI-STRUCTURED DATA FORMATS
摘要 Techniques are described herein for maintaining two copies of the same semi-structured data, where each copy is organized in a different format. One copy is in a first-format that may be convenient for storage, but inefficient for query processing. For example, the first-format may be a textual format that needs to be parsed every time a query needs to access individual data items within a semi-structured object. The database system intelligently loads semi-structured first-format data into volatile memory and, while doing so, converts the semi-structured first-format data to a second-format. Because the data in volatile memory is in the second-format, processing queries against the second-format data both allows disk I/0 to be avoided, and increases the efficiency of the queries themselves. For example, the parsing that may be necessary to run a query against a cached copy of the first-format data is avoided.
申请公布号 US2017060973(A1) 申请公布日期 2017.03.02
申请号 US201615162235 申请日期 2016.05.23
申请人 Oracle International Corporation 发明人 Liu Zhen Hua;Chavan Shasank Kisan;Mishra Aurosish
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method comprising: maintaining, by a database server, a set of first-format data; wherein the database server maintains the set of first-format data on persistent storage; wherein the set of first-format data: is accessible to the database server,represents a set of semi-structured documents, andis stored in a first-format; converting, by the database server, the set of first-format data to a second-format to produce a set of second-format data; wherein the second-format is one of: a row-based mirror format;a set-based mirror format; ora master-detail mirror format; wherein the database server maintains the set of second-format data in volatile memory; wherein the second-format is different from and independent of the first-format; wherein the set of second-format data is accessible to the database server; the database server maintaining the set of second-format data transactionally consistent with the set of first-format data as the database server executes database operations that make changes to the set of first-format data; receiving, by the database server, a request to execute a database command that requires particular information from the set of semi-structured documents; and in response to receiving the request, the database server executing the database command; wherein, during execution of the database command, the database server obtains at least some of the particular information from the set of second-format data.
地址 Redwood Shore CA US