摘要 |
A thread dependency scheme may significantly reduce the dependency penalty and improve the parallelism efficiency in some embodiments in video compression techniques with relatively high dependencies, such as VP9. One fundamental feature is to split an individual large kernel into multiple, less dependent, smaller kernels, thereby significantly increasing the number of software threads that can potentially run in parallel. Another feature is to define the larger number of thread dependencies (superset of all the dependency candidates for each thread), with the specific thread's spatial position and associated context, and mask out some of the unnecessary thread dependencies. |