By Topic

Chromatin signature analysis and prediction of genome-wide novel promoters using finite mixture model

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Cenny Taslim ; Department of Statistics, Columbus, OH 43210, USA ; Shili Lin ; Kun Huang ; Tim Huang

Regulation of gene expression has been shown to involve not only binding of transcription factor in target gene promoters but also characterization of histone around which DNA is wrapped around. Some histone modification, for example di-methylated histone H3 at lysine 4 (H3K4me2), has been shown to be associated with gene activation. However, no clear pattern has been shown to predict human promoters. This paper proposed a novel quantitative approach to characterize chromatin signature and patterns of promoters, which are then used to predict novel (alternative) promoters. In this paper, chromatin immunoprecipitation methods followed by massive parallel sequencing (ChIP-seq) data against RNA Polymerase II (Pol II) and H3K4me2 are used to identify common patterns of promoter regions. These patterns were then used to search for similar patterns over the entire genome to find novel promoters. Common patterns of promoter regions are modeled using a mixture model involving double-exponential and uniform distributions. Regions with high correlations with the common patterns are identified as putative novel promoters. We used this proposed algorithm and RNA-seq data to identify novel promoters in the MCF7 cell line. We found 4,392 high-confidence regions that display the identified promoter patterns (referred to as putative novel promoters). Of these, 875 regions (20%) overlap with RNA transcripts. Around 70% of these putative novel promoters have overlapped with RNA transcripts, EST and/or non-coding RNA suggesting that these putative novel promoters might be promoters which are currently undiscovered.

Published in:

2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)

Date of Conference:

4-6 Dec. 2011