By Topic

Ontology-based functional classification of genes: Evaluation with reference sets and overlap analysis

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

5 Author(s)
Sidahmed Benabderrahmane ; LORIA UMR7503 (CNRS, INRIA, Nancy-Université), Équipe Orpailleur, Bâtiment B, Campus scientifique, BP239, 54506, Vandoeuvre-lès-Nancy Cedex, France ; Marie Dominique Devignes ; Malika Smail Tabbone ; Amedeo Napoli
more authors

Functional classification involves grouping genes according to their molecular functions or the biological processes they participate in. This unsupervised classification task is essential for interpreting gene datasets produced by post-genomic experiments. As the functional annotation of genes is mostly based on the Gene Ontology (GO), many similarity measures using the GO have been described, but few of them have been used for clustering. In this paper we evaluate functional classification of genes using our previously described IntelliGO semantic similarity measure with the help of reference sets. These sets consist of genes taken from human and yeast KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways and Pfam clans. Hierarchical clustering and heatmap visualization are used to illustrate the advantages of IntelliGO over several other measures. Because genes often belong to more than one reference set, the fuzzy C-means clustering algorithm is then applied to the datasets using IntelliGO. The F-score method is used to estimate the quality of clustering and the optimal number of clusters. The results are compared with those obtained from the state-of-the-art DAVID (Database for Annotation Visualization and Integrated Discovery) functional classification method. Overlap analysis allows to study the matching between clusters and reference sets, and leads us to propose a set-difference method for discovering missing information. The IntelliGO similarity measure, the clustering tool and the reference sets used for evaluation are available at:

Published in:

Bioinformatics and Biomedicine Workshops (BIBMW), 2011 IEEE International Conference on

Date of Conference:

12-15 Nov. 2011