Abstract:
Classical DNA sequence compression algorithms consider only intra-sequence similarity, i.e., similar subsequences within the DNA sequence are found and encoded together. ...Show MoreMetadata
Abstract:
Classical DNA sequence compression algorithms consider only intra-sequence similarity, i.e., similar subsequences within the DNA sequence are found and encoded together. In this work, in addition to the intra-sequence similarity, we exploit the inter-sequence similarities in that similar subsequences are found within the DNA sequence as well as from other reference sequences. Hence, highly similar sequences from the same population or partially similar chromosome sequences of the same species can be compressed together to reduce the storage space. Experimental results show that the proposed scheme achieves good compressibility for both partially similar chromosome sequences and highly similar population sequences.
Published in: 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)
Date of Conference: 16-19 December 2015
Date Added to IEEE Xplore: 25 February 2016
Electronic ISBN:978-9-8814-7680-7