摘要 |
A method for parallelization of an algorithm executing on a parallel processing system. An extension element is generated for each of the sections of the algorithm, where the sections comprise: distribution of data to multiple processing elements, transfer of data from outside of the algorithm to inside of the algorithm, global cross-communication of data between processing elements, moving data to a subset of the processing elements, and transfer of data from inside of the algorithm to outside of the algorithm. Each extension element functions to provide parallelization at a respective place in the algorithm where parallelization of the algorithm may occur. |