Abstract:
Multiclass classification is an important technique to many complex bioinformatics problems. However, their performance is limited by the computation power. Based on the ...Show MoreMetadata
Abstract:
Multiclass classification is an important technique to many complex bioinformatics problems. However, their performance is limited by the computation power. Based on the Apache Hadoop design framework, this study proposes a two layer architecture that exploits the inherent parallelism of GA-SVM classification to speed up the work. The performance evaluations on an mRNA benchmark cancer dataset have reduced 86.55% features and raised accuracy from 97.53% to 98.03%. With a user-friendly web interface, the system provides researchers an easy way to investigate the unrevealed secrets in the fast-growing repository of bioinformatics data.
Published in: 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)
Date of Conference: 03-07 July 2013
Date Added to IEEE Xplore: 26 September 2013
Electronic ISBN:978-1-4577-0216-7
ISSN Information:
PubMed ID: 24109986
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Cloud Computing ,
- Cloud Computing Platform ,
- Multi-label ,
- User-friendly Web Interface ,
- Test Samples ,
- Classification Accuracy ,
- Support Vector Machine ,
- Important Characteristics ,
- Search Algorithm ,
- Standard Classification ,
- Support Vector Machine Model ,
- Network Bandwidth ,
- File System ,
- Computational Demands ,
- Individuals In Generation ,
- Huge Data ,
- MapReduce ,
- Important Role In Analysis ,
- Fisher Score ,
- Server Node ,
- Hadoop Distributed File System ,
- Standard Support Vector Machine ,
- Disk Storage ,
- Role In Data Analysis ,
- Training Time ,
- Open-source Software ,
- High-dimensional ,
- Support Vector Machine Classifier ,
- Power Bandwidth
- MeSH Terms
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Cloud Computing ,
- Cloud Computing Platform ,
- Multi-label ,
- User-friendly Web Interface ,
- Test Samples ,
- Classification Accuracy ,
- Support Vector Machine ,
- Important Characteristics ,
- Search Algorithm ,
- Standard Classification ,
- Support Vector Machine Model ,
- Network Bandwidth ,
- File System ,
- Computational Demands ,
- Individuals In Generation ,
- Huge Data ,
- MapReduce ,
- Important Role In Analysis ,
- Fisher Score ,
- Server Node ,
- Hadoop Distributed File System ,
- Standard Support Vector Machine ,
- Disk Storage ,
- Role In Data Analysis ,
- Training Time ,
- Open-source Software ,
- High-dimensional ,
- Support Vector Machine Classifier ,
- Power Bandwidth
- MeSH Terms