主权项 |
1. A method of data processing for query execution, the method being performed by a query engine instance running on each data node of a plurality of data nodes which together form a Hadoop™ distributed computing cluster, wherein a query is processed by whichever data node that receives the query, the method comprising:
storing initial data in an original format at a data node in the plurality of data nodes forming a peer-to-peer network for the query, each data node functioning as a peer in the peer-to-peer network and being capable of interacting with components of the Hadoop™ cluster, each peer having an instance of a query engine running in memory; converting, at the data node, the initial data to be in a target format that is optimized for relational database processing according to a predetermined schedule; and storing the converted data together with the initial data. |