Loading web-font TeX/Main/Regular
Predicting MicroRNA Sequence Using CNN and LSTM Stacked in Seq2Seq Architecture | IEEE Journals & Magazine | IEEE Xplore

Predicting MicroRNA Sequence Using CNN and LSTM Stacked in Seq2Seq Architecture

CodeAvailable

Abstract:

CNN and LSTM have proven their ability in feature extraction and natural language processing, respectively. So, we tried to use their ability to process the language of R...Show More

Abstract:

CNN and LSTM have proven their ability in feature extraction and natural language processing, respectively. So, we tried to use their ability to process the language of RNAs, i.e., predicting sequence of microRNAs using the sequence of mRNA. The idea is to extract the features from sequence of mRNA using CNN and use LSTM network for prediction of miRNA. The model has learned the basic features such as seed match at first 2–8 nucleotides starting at the 5^\prime end and counting toward the 3^\prime end. Also, it was able to predict G-U wobble base pair in seed region. While validating on experimentally validated data, the model was able to predict on average 72 percent of miRNAs for specific mRNA and shows highest positive expression fold change of predicted targets on a microarray data generated using anti 25 miRNAs compare to other predicted tools. Codes are available at https://github.com/rajkumar1501/sequence-prediction-using-CNN-and-LSTMs.
Published in: IEEE/ACM Transactions on Computational Biology and Bioinformatics ( Volume: 17, Issue: 6, 01 Nov.-Dec. 2020)
Page(s): 2183 - 2188
Date of Publication: 20 August 2019

ISSN Information:

PubMed ID: 31443043

Funding Agency:


This article includes code hosted on Code Ocean, a computational reproducibility platform that allows users to view, modify, run, and download code included with IEEE Xplore articles. NOTE: A Code Ocean user account is required to access functionality in the capsule below.
Contact IEEE to Subscribe

References

References is not available for this document.