By Topic

Finding protein domain boundaries: an automated, non-homology-based method

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)

A sequence-based methodology identifies the boundaries of structural domains in proteins. The method doesn't depend on knowledge of a protein's structure or on sequence homologs. We developed a Bayesian approach based on the statistical analysis of word content used in other fields. Our method first catalogs "pattern" frequencies - occurrences of groups of amino acids - in a nonredundant database of known protein domains and then uses the distributions of these patterns to identify regions of protein sequence that appear to signal the beginnings and ends of domains. The domain-delineating signals we've produced using amino acid patterns show great promise in providing further insight into both the biochemistry and structural biology of proteins.

Published in:

IEEE Intelligent Systems  (Volume:20 ,  Issue: 6 )