Abstract:
NoSQL database management system is introduced to tackle different sorts of challenges, including performing operations on unstructured, semi-structured, and structured d...Show MoreMetadata
Abstract:
NoSQL database management system is introduced to tackle different sorts of challenges, including performing operations on unstructured, semi-structured, and structured data. NoSQL databases gained popularity because of the improved performance than the SQL databases. We aim to investigate the NoSQL system's performance, namely MongoDB and Cassandra and SQL database, namely MySQL for DNA sequences data from the COVID-19 dataset. Studies of the DNA sequences are essential for medical diagnosis and biotechnology. However, it is quite challenging to store these genomics data in a traditional RDMS because of their unstructured nature. NoSQL is an efficient solution for textual characters like genomics data. We used around 3GB of human genome data from the COVID-19 dataset provided by NCBI. The original data was in the FASTA format, and we process these data into JSON format. Also, we have analyzed the different query syntax, data load time, and query performance time for the genomics data.
Published in: 2021 2nd International Conference on Robotics, Electrical and Signal Processing Techniques (ICREST)
Date of Conference: 05-07 January 2021
Date Added to IEEE Xplore: 01 February 2021
ISBN Information: