Early performance evaluation of lattice QCD on POWER+GPU cluster
As supercomputers are shifting from peta-scale to exa-scale, computers with accelerators such as GPUS, MICs and FPGAS have become one of the big trends of supercomputer because of their low energy consumption and high density. Now IBM's POWER processor has quite new power, NVIDIA's Tesla GPU brings huge computational capability. It is important for us to understand how this new POWER+GPU environment brings power to the actual applications in the early stage. We implemented Wilson-Dirac operatorand BiCGStab solver using CUDA7.0 on the POWER+GPU cluster and evaluated the performance.