By Topic

Large-Scale Orthology Predictions for Inferring Gene Functions across Multiple Species

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Chenggang Yu ; Telemedicine & Adv. Technol. Res. Center, US Army Med. Res. & Materiel Command (MRMC), Fort Detrick, MD, USA ; Desai, V. ; Zavaljevski, N. ; Reifman, J.

An effective approach to infer the functions of genes is to use the concept of gene orthology. Because orthologous genes are likely to share similar functions, the functions of genes in an unstudied species can be inferred through the functions of their orthologs in a studied model species. To infer gene functions for a multitude of species, we developed a high-throughput orthology prediction method, termed PhyloTrace. PhyloTrace is both highly accurate and computationally efficient for large-scale applications, having the ability to infer orthologous genes across thousands of species. This is accomplished through three major steps: 1) all-against-all gene comparisons for every pair of genes, 2) pair-wise orthology predictions for every two genomes, and 3) the generation of orthologous clusters that contain orthologous genes across multiple genomes. We employed the previously developed Pipe man parallelization program to break down a set of millions of input sequences into small chunks and then processed them in parallel. We successfully predicted orthologs for over 900 bacterial genomes, achieving a false-positive prediction rate of 2.0%, which was a significant improvement compared with the widely used bidirectional best-hit method, which yielded a false-positive rate of 5.5%.

Published in:

High Performance Computing Modernization Program Users Group Conference (HPCMP-UGC), 2010 DoD

Date of Conference:

14-17 June 2010