Gene sequence alignment on a public computing platform
Pellicer, S.; Ahmed, N.; Yi Pan; Yao Zheng
Parallel Processing, 2005. ICPP 2005 Workshops. International Conference Workshops on
Volume , Issue , 14-17 June 2005 Page(s): 95 - 102
Digital Object Identifier 10.1109/ICPPW.2005.35
Summary: Public computing can potentially supply not only computational power but also memory and short term storage resources to grid and cluster scale problems. Gene sequence alignment is a fundamental computational challenge in bioinformatics with attributes such as moderate computational requirements, extensive memory requirements, and highly interdependent tasks. This study examines the performance of calculating the alignment for two 100,000 base sequences on a public computing platform utilizing the BOINC framework. When compared to the theoretical, optimal sequential implementation, the parallel implementation achieves speedup by a factor of 1.4 and at the point of maximum parallelism and ends with a speedup of 1.2. This speedup factor is based on extrapolation of the sequential performance of a segment of the problem. This extrapolation would require a theoretical sequential machine with approximately 37.3 GB of working memory or suffer performance degradation from use of secondary storage during the calculation.
View citation and abstract |