Skip to Main Content
This paper reports on the development of the Cloud Oriented Data Analytics (CODA) framework which has functions for composing, managing, and processing workflows for data analytics in cloud computing. The framework provides a number of reusable software components for data analytics to users which can be composed as workflows through well-known workflow composers, e.g., RapidMiner, Taverna, and JOpera. In particular, workflow scheduling, workflow recommendation, resource provisioning, resource monitoring, data locality, and security for the workflow computation are addressed by the framework. By using the framework, we demonstrate that workflows can be easily composed and processed in cloud computing. By coordinating the submitted workflows, we can obtain a significant improvement in performance.