发明名称 Processing structured documents stored in a database
摘要 A method, computer readable medium, and system for processing a structured document are disclosed. The method, computer readable medium, and system include identifying a plurality of strings processed by a database, assigning an identifier to each of the plurality of strings, storing each of the plurality of strings and its assigned identifier in a table in the database, and utilizing the table during document processing.
申请公布号 US9600597(B2) 申请公布日期 2017.03.21
申请号 US201113239695 申请日期 2011.09.22
申请人 International Business Machines Corporation 发明人 Loeser Henrik;Van der Linden Robbert C.;Vickery Brian S.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人 Walder, Jr. Stephen J.;Kashef Mohammed
主权项 1. A method for processing a structured document, the method comprising: identifying a plurality of strings processed by a database; assigning an identifier to each of the plurality of strings; storing each of the plurality of strings and its assigned identifier in a table in the database, wherein at least one string of the plurality of strings in the table in the database is derived from a source other than the structured documents stored in the database, and wherein at least another string of the plurality of strings in the table in the database is derived from the one or more structured documents stored in the database; and utilizing the table during document processing at least by: receiving a request to store a structured document in the database, wherein the structured document comprises a plurality of element, attribute, and namespace strings; transforming the structured document by replacing the plurality of element, attribute, and namespace strings with their respective identifiers at least by: requesting an identifier for one of the plurality of element, attribute, and namespace strings in the structured document;performing a lookup on the table for the one string;returning the identifier assigned to the one string when the one string is located in the table; andassigning a new identifier to the one string, storing the one string and the new identifier in the table, designating a status of the one string as permanent, and returning the new identifier when the one string is not located in the table; and storing the transformed structured document in the database.
地址 Armonk NY US