Scheduled System Maintenance:
On Wednesday, July 29th, IEEE Xplore will undergo scheduled maintenance from 7:00-9:00 AM ET (11:00-13:00 UTC). During this time there may be intermittent impact on performance. We apologize for any inconvenience.
By Topic

Strategies for accelerating forward and backprojection in list-mode OSEM PET reconstruction using GPUs

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Dieckmann, W. ; PET Dept., Nat. Institutes Health, Bethesda, MD, USA ; Thada, S. ; Barker, W.C.

Image reconstruction for the ECAT HRRT PET scanner with MOLAR is computationally demanding and requires a computer cluster for reasonable run times. Parallel computing using GPUs and CUDA offers a means to accelerate MOLAR. However, forward and backprojection operations present unique challenges that must be overcome to achieve acceptable speedup. In this study we implement GPU-accelerated versions of MOLAR's forward projection, backprojection and algorithm update modules and compare their performance to CPU-only versions. During this implementation we optimized the GPU thread configurations for each of these modules separately, along with a hybrid forward-backprojection module that is used for algorithm updates. We also numerically evaluated the reconstruction results to assess the impact of floating-point to integer conversions dictated by the GPU architecture. We found forward projection to be 41 times faster than the CPU-only code, while backprojection was 20 times faster. We found the optimal thread configurations always assigned 64 threads to a thread block, but with different distributions across the nested indexing loops within each module. These results show that MOLAR's forward and backprojection modules can be adequately accelerated to make the MOLAR reconstruction package much more efficient.

Published in:

Nuclear Science Symposium Conference Record (NSS/MIC), 2009 IEEE

Date of Conference:

Oct. 24 2009-Nov. 1 2009