Conferences >2024 14th International Confe...

Fast and Accurate Motif Discovery in Protein Sequences Using Parallel Processing with OpenMP

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In today's rapidly advancing field of biomedical research, the demand for swift and accurate identification of mutations within biological sequences, including proteins a...Show More

Metadata

Abstract:

In today's rapidly advancing field of biomedical research, the demand for swift and accurate identification of mutations within biological sequences, including proteins and genomes, is essential for effective disease diagnosis and treatment. Within the realm of protein sequence analysis, recurring patterns known as motifs play a crucial role. These motifs, whether of fixed or variable lengths, often signify essential structural or functional features such as transcription factor binding sites or protein-protein interaction interfaces. Over time, several methods have emerged for detecting motifs within protein datasets. Among these, our previous work introduced the Tree-based Fast Exact Motif (TFEM) algorithm. Unlike some contemporary techniques like Sensitive Thorough Rapid Enriched Motif Elicitation (STREME), Multiple EM for Motif Elicitation (MEME), and Discriminative Regular Expression Motif Elicitation (DREME), TFEM demonstrated superior efficiency in accurately identifying motifs. However, the computational complexity of TFEM presents challenges. With a time complexity of O (n20^k), where ‘n’ denotes the number of sequences in the input set and ‘k’ signifies the length of the motif under investigation, the algorithm's performance is heavily influenced by the size of the input set. To address this challenge, we propose leveraging CPU parallelization techniques, specifically Open-MP programming, to optimize the execution time of the TFEM algorithm. The evaluation results showed that parallelization in large datasets can reduce execution time up to approximately half compared to the serial algorithm.

Published in: 2024 14th International Conference on Computer and Knowledge Engineering (ICCKE)

Date of Conference: 19-20 November 2024

Date Added to IEEE Xplore: 18 February 2025

ISBN Information:

ISSN Information:

DOI: 10.1109/ICCKE65377.2024.10874452

Conference Location: Mashhad, Iran, Islamic Republic of

Contents

References is not available for this document.

Fast and Accurate Motif Discovery in Protein Sequences Using Parallel Processing with OpenMP

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Fast and Accurate Motif Discovery in Protein Sequences Using Parallel Processing with OpenMP

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Authors

Figures

References

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?