Yuechao Pan, Xuewu Xu, and Xinan Liang, "Fast distributed large-pixel-count hologram computation using a GPU cluster," Appl. Opt. 52, 6562-6571 (2013)
Abstract:
Large-pixel-count holograms are one essential part for big size holographic 3D display, but the generation of such holograms is computationally demanding. In order to address this issue, we have built a GPU cluster with 32.5 Tflop/s computing power and implemented distributed hologram computation on it with speed improvement techniques such as shared memory on GPU, GPU level adaptive load balancing and node level load distribution. Using these speed improvement techniques on the GPU cluster, we have achieved 71.4 times computation speed increase for 186M-pixel holograms. Furthermore, we have used the approaches of diffraction limits and subdivision of holograms to overcome the GPU memory limit in computing large-pixel-count holograms. 745M-pixel and 1.80G-pixel holograms were computed in 343 and 3326 seconds respectively, for more than 2 million object points with RGB colors. Color 3D objects with 591k points were successfully reconstructed from 186M-pixel hologram computed in 9.05 seconds with all the above three speed improvement techniques. It is shown that distributed hologram computation using a GPU cluster is a promising approach to increase the computation speed of large-pixel-count holograms for large size holographic display.