TY - GEN
T1 - Accelerating astronomical image subtraction on heterogeneous processors
AU - Zhao, Yan
AU - Luo, Qiong
AU - Wang, Senhong
AU - Wu, Chao
PY - 2013
Y1 - 2013
N2 - Image subtraction is an effective method used in astronomy to search transient objects or identify objects that have time-varying brightness. The state-of-the-art astronomical image subtraction methods work by taking two aligned images of the same observation area, calculating a space-varying convolution kernel for the two images, and finally obtaining the difference image using the convolution kernel. With the need for fast image subtraction in astronomy projects, we study the parallelization of HOTPANTS, a popular astronomical image subtraction package by Andrew Becker, on multicore CPUs and GPUs. Specifically, we identify the components in HOTPANTS that are data parallel and parallelize these components on the GPU and multicore CPU. We divide the work between the CPU and the GPU to minimize the overall time. In the GPU-based components, we investigate the suitable setup of the GPU thread structure for the computation, and optimize data access on the GPU memory hierarchy. Consequently, P-HOTPANTS (our parallelized HOTPANTS), achieves a 4-times speedup over the original HOTPANTS running on a desktop with an Intel i7 CPU and an NVIDIA GTX580 GPU.
AB - Image subtraction is an effective method used in astronomy to search transient objects or identify objects that have time-varying brightness. The state-of-the-art astronomical image subtraction methods work by taking two aligned images of the same observation area, calculating a space-varying convolution kernel for the two images, and finally obtaining the difference image using the convolution kernel. With the need for fast image subtraction in astronomy projects, we study the parallelization of HOTPANTS, a popular astronomical image subtraction package by Andrew Becker, on multicore CPUs and GPUs. Specifically, we identify the components in HOTPANTS that are data parallel and parallelize these components on the GPU and multicore CPU. We divide the work between the CPU and the GPU to minimize the overall time. In the GPU-based components, we investigate the suitable setup of the GPU thread structure for the computation, and optimize data access on the GPU memory hierarchy. Consequently, P-HOTPANTS (our parallelized HOTPANTS), achieves a 4-times speedup over the original HOTPANTS running on a desktop with an Intel i7 CPU and an NVIDIA GTX580 GPU.
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:000330195500009
UR - https://openalex.org/W2160421408
UR - https://www.scopus.com/pages/publications/84893461093
U2 - 10.1109/eScience.2013.23
DO - 10.1109/eScience.2013.23
M3 - Conference Paper published in a book
SN - 9780768550831
T3 - Proceedings - IEEE 9th International Conference on e-Science, e-Science 2013
SP - 70
EP - 77
BT - Proceedings - IEEE 9th International Conference on e-Science, e-Science 2013
PB - IEEE Computer Society
T2 - 9th IEEE International Conference on e-Science, e-Science 2013
Y2 - 22 October 2013 through 25 October 2013
ER -