Joint base-calling of two DNA sequences with factor graphs
Xiaomeng Shi; Lun, D.S.; Meldrim, J.; Kotter, R.; Medard, M.
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Volume , Issue , March 31 2008-April 4 2008 Page(s):2049 - 2052
Digital Object Identifier 10.1109/ICASSP.2008.4518043
Summary:To improve the utility of existing technologies based on Sanger sequencing, this paper examines the possibility of base-calling two superposed DNA sequences jointly. This approach allows a single electrophoresis experiment to process two sequences, using the same quantity of reagents and machine hours as for a single sequence. A practical heuristic is proposed to first estimate the peak parameters, then separate them into two sequences (major/minor) by passing messages on a factor graph. Base-calling on the major alone yields accuracy commensurate with single sequence approaches, and joint base-calling provides results for the minor which, while being of lesser quality, incurs no additional cost and can be ultimately used in the genome assembly process.
View citation and abstract |