ESTmapper: Efficiently Aligning DNA Sequences to Genomes
Wu, X.
Lee W-J
Tseng, C.-W.
Dept. of Comput. Sci., Maryland Univ., College Park, MD, USA
This paper appears in: Parallel and Distributed Processing Symposium, 2005. Proceedings. 19th IEEE International Publication Date: 04-08 April 2005
On page(s):
196a
- 196a
ISBN: 0-7695-2312-9
Digital Object Identifier: 10.1109/IPDPS.2005.204
Current Version Published: 2005-04-18
Abstract
With improvements in technology, scientists are able to sequence the full DNA (genome) of an increasing number of organisms. One way biologists can take advantage of this genomic sequence data is to use it in conjunction with expressed sequence tag (EST) information to find genes and their splice sites. We describe how ESTmapper uses an eager write-only top-down (WOTD) suffix tree to efficiently align DNA sequences against genomes, and compare its precision and performance against popular techniques for DNA alignment (BLAT, sim4, Spidey, BLAST, megaBLAST) and EST clustering (TGICL and PaCE). Experimental results show that ESTmapper is 3 to 1000 times faster than current techniques for aligning and clustering DNA sequences, and produces alignments of comparable or better quality.
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.
You are not
logged in.
Guests
may access Abstract records free of charge.