Nested Parallelism
by MichaelS from High-Performance Computing News Analysis | insideHPC on (#FXT5)
The benefits of nested parallelism on highly threaded applications can be determined and quantified. With the number of cores in both the host CPU (Intel Xeon) and the coprocessor (Intel Xeon Phi) continues to increase, much thought must be given to minimizing the thread overhead when many threads need to be synchronized, as well as the memory access for each processor (core). Tasks that can be spread across an entire system to exploit the algorithm's parallelism, should be mapped to the NUMA node to make them more efficient.
The post Nested Parallelism appeared first on insideHPC.