By Topic

Parallelizing a multi-frame blind deconvolution algorithm on clusters of multicore processors

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Richard Linderman ; Air Force Research Laboratory, Information Directorate, AFRL/RI, 525 Brooks Road, Rome, NY 13441, USA ; Scott Spetka ; Susan Emeny ; Dennis Fitzgerald

The parallelization strategy of the Physically-Constrained Iterative Deconvolution (PCID) algorithm is being altered and optimized to enhance performance on emerging multi-core architectures. This paper reports results from porting PCID to multi-core architectures including the JAWS supercomputer at the Maui HPC Center (60 TFLOPS of dual-dual Xeonreg nodes) and the Cell Cluster at AFRL in Rome, NY (52 TFLOPS of Playstation 3reg nodes with IBM Cell Broadband Enginereg multi-cores and 14 dual-quad Xeon headnodes). For 512times512 image sizes FFT performance exceeding 60 GFLOPS has been observed on dual-quad Xeon nodes. Multi-core architectures programmed with multiple threads delivered significantly better performance for parallelization of the low level image convolution operations compared to earlier parallelization across cluster nodes with MPI. Another focus of the PCID multi-core effort was to move from MPI message passing to a publish-subscribe-query approach to information management. The publish, subscribe and query infrastructure was optimized for large scale machines, such as JAWS, and features a ldquoloose couplingldquo of publishers to subscribers through intervening brokers. This change makes runs on large HPCs with thousands of intercommunicating cores more flexible and more fault tolerant.

Published in:

2009 IEEE Aerospace conference

Date of Conference:

7-14 March 2009