At present the genomes of many organisms have been sequenced, meaning that the nucleotide structure is known but the location of genes, and most importantly, the coding regions, are unknown. Locating the coding regions is vital as they code for the proteins which control the functioning of the organism, such as its resistance to disease. We propose a new algorithm to score genomic sequences. The algorithm is based on discriminant analysis and can be incorporated into existing programs to analyse DNA sequences
Published in:
Statistical Signal Processing, 2005 IEEE/SP 13th Workshop on
Date of Conference: 17-20 July 2005