Skip to Main Content
A Web services-based toolkit for supporting distributed data mining is presented. A workflow engine is provided within the toolkit to enable a user to compose Web services to implement particular point solutions. Three types of Web services are provided to implement data mining functions: (1) classifiers; (2) clustering algorithms; and (3) association rules. Additional capability is made available through GNUPlot and Mathematica to enable visualisation of the output. Data sets may be read from the local filespace, or streamed from a remote location (provided the algorithm being used has support for streaming). A study is presented to illustrate the use of the toolkit.