Skip to Main Content
Many previous results in genomic sequence analysis have been derived based on the representation of genomic structures as numerical sequences. Various mapping strategies have been proposed for the representation of genomic and proteomic sequences. However, little is understood about the effect of specific choices of numerical mappings on the final analysis results. In fact, inconsistent numerical mappings could have led to contradictory results in genomic sequence analysis. In this paper, we propose a mathematical framework for analysis of the consistency in representation and transformation of numerical mappings of genomic sequences. We introduce strong and weak correlation metrics to characterize consistency measures among distinct numerical mappings. We derive sufficient conditions to ensure consistency among different numerical mappings. We present an important class of equivalent transforms under the proposed consistency conditions. We also derive a class of operators which is shown to be equivalent under rotation of numerical mappings. Finally, we conduct computer simulation experiments on DNA sequences which demonstrate the theoretical results.
Date of Conference: 17-21 May 2009