Data Mining:Concepts, Models, Methods, and Algorithms

Cover Image Copyright Year: 2003
Author(s): Mehmed Kantardzic
Book Type: Wiley-IEEE Press
Content Type : Books
Topics: Computing & Processing
  • Print

Abstract

A comprehensive introduction to the exploding field of data mining

We are surrounded by data, numerical and otherwise, which must be analyzed and processed to convert it into information that informs, instructs, answers, or otherwise aids understanding and decision-making. Due to the ever-increasing complexity and size of today's data sets, a new term, data mining, was created to describe the indirect, automatic data analysis techniques that utilize more complex and sophisticated tools than those which analysts used in the past to do mere data analysis.

Data Mining: Concepts, Models, Methods, and Algorithms discusses data mining principles and then describes representative state-of-the-art methods and algorithms originating from different disciplines such as statistics, machine learning, neural networks, fuzzy logic, and evolutionary computation. Detailed algorithms are provided with necessary explanations and illustrative examples.

This text offers guidance: how a d when to use a particular software tool (with their companion data sets) from among the hundreds offered when faced with a data set to mine. This allows analysts to create and perform their own data mining experiments using their knowledge of the methodologies and techniques provided.

This book emphasizes the selection of appropriate methodologies and data analysis software, as well as parameter tuning. These critically important, qualitative decisions can only be made with the deeper understanding of parameter meaning and its role in the technique that is offered here. Data mining is an exploding field and this book offers much-needed guidance to selecting among the numerous analysis programs that are available.

  •   Click to expandTable of Contents

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      Frontmatter

      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      The prelims comprise:

      • Half Title

      • IEEE Press Board Page

      • Title

      • Copyright

      • Dedication

      • Contents

      • Preface

      View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      DataMining Concepts

      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      This chapter contains sections titled:

      • Introduction

      • Data-Mining Roots

      • Data-Mining Process

      • Large Data Sets

      • Data Warehouses

      • Organization of This Book

      • Review questions and Problems

      • References for Further Study

      View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      Preparing the Data

      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      This chapter contains sections titled:

      • Representation of Raw Data

      • Characteristics of Raw Data

      • Transformation of Raw Data

      • Missing Data

      • Time-Dependent Data

      • Outlier Analysis

      • Review Questions and Problems

      • References for Further Study

      View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      Data Reduction

      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      This chapter contains sections titled:

      • Dimensions of Large Data Sets

      • Features Reduction

      • Entropy Measure for Ranking Features

      • Principal Component Analysis

      • Values Reduction

      • Feature Discretization: Chimerge Technique

      • Cases Reduction

      • Review Questions and Problems

      • References for Further Study

      View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      Learning from Data

      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      This chapter contains sections titled:

      • Learning Machine

      • Statistical Learning Theory

      • Types of Learning Methods

      • Common Learning Tasks

      • Model Estimation

      • Review Questions and Problems

      • References for further study

      View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      Statistical Methods

      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      This chapter contains sections titled:

      • Statistical Inference

      • Assessing Differences in Data Sets

      • Bayesian Inference

      • Predictive Regression

      • Analysis of Variance

      • Logistic Regression

      • Log-Linear Models

      • Linear Discriminant Analysis

      • Review Questions and Problems

      • References for Further Study

      View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      Cluster Analysis

      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      This chapter contains sections titled:

      • Clustering Concepts

      • Similarity Measures

      • Agglomerative Hierarchical Clustering

      • Partitional Clustering

      • Incremental Clustering

      • Review Questions and Problems

      • References for Further Study

      View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      Decision Trees and Decision Rules

      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      This chapter contains sections titled:

      • Decision Trees

      • C4.5 Algorithm: Generating a Decision Tree

      • Unknown Attribute Values

      • Pruning Decision Tree

      • C4.5 Algorithm: Generating Decision Rules

      • Limitations of Decision Trees and Decision rules

      • Associative-Classification Method

      • Review Questions and Problems

      • References for Further Study

      View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      Association Rules

      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      This chapter contains sections titled:

      • Market-Basket Analysis

      • Algorithm Apriori

      • From Frequent Itemsets to Association Rules

      • Improving the Efficiency of the Apriori Algorithm

      • Frequent Pattern-Growth Method (FP-Growth Method)

      • Multidimensional Association-Rules Mining

      • Web Mining

      • HITS and LOGSOM Algorithms

      • Mining Path-Traversal Patterns

      • Text Mining

      • Review Questions and Problems

      • References for Further Study

      View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      Artificial Neural Networks

      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      This chapter contains sections titled:

      • Model of an Artificial Neuron

      • Architectures of Artificial Neural Networks

      • Learning Process

      • Learning Tasks

      • Multilayer Perceptrons

      • Competitive Networks and Competitive Learning

      • Review Questions and Problems

      • References for Further Study

      View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      Genetic Algorithms

      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      This chapter contains sections titled:

      • Fundamentals of Genetic Algorithms

      • Optimization Using Genetic Algorithms

      • A Simple Illustration of a Genetic Algorithm

      • Schemata

      • Traveling Salesman Problem

      • Machine Learning Using Genetic Algorithms

      • Review Questions and Problems

      • References for Further Study

      View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      Fuzzy Sets and Fuzzy Logic

      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      This chapter contains sections titled:

      • Fuzzy Sets

      • Fuzzy Set Operations

      • Extension Principle and Fuzzy Relations

      • Fuzzy Logic and Fuzzy Inference Systems

      • Multifactorial Evaluation

      • Extracting Fuzzy Models from Data

      • Review Questions and Problems

      • References for Further Study

      View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      Visualization Methods

      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      This chapter contains sections titled:

      • Perception and Visualization

      • Scientific Visualization and Information Visualization

      • Parallel Coordinates

      • Radial Visualization

      • Kohonen Self-Organized Maps

      • Visualization Systems for Data Mining

      • Review Questions and Problems

      • References for Further Study

      View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      References

      Page(s): 297 - 307
      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      A comprehensive introduction to the exploding field of data mining

      We are surrounded by data, numerical and otherwise, which must be analyzed and processed to convert it into information that informs, instructs, answers, or otherwise aids understanding and decision-making. Due to the ever-increasing complexity and size of today's data sets, a new term, data mining, was created to describe the indirect, automatic data analysis techniques that utilize more complex and sophisticated tools than those which analysts used in the past to do mere data analysis.

      Data Mining: Concepts, Models, Methods, and Algorithms discusses data mining principles and then describes representative state-of-the-art methods and algorithms originating from different disciplines such as statistics, machine learning, neural networks, fuzzy logic, and evolutionary computation. Detailed algorithms are provided with necessary explanations and illustrative examples.

      This text offers guidance: how a d when to use a particular software tool (with their companion data sets) from among the hundreds offered when faced with a data set to mine. This allows analysts to create and perform their own data mining experiments using their knowledge of the methodologies and techniques provided.

      This book emphasizes the selection of appropriate methodologies and data analysis software, as well as parameter tuning. These critically important, qualitative decisions can only be made with the deeper understanding of parameter meaning and its role in the technique that is offered here. Data mining is an exploding field and this book offers much-needed guidance to selecting among the numerous analysis programs that are available. View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      Appendix A: DataMining Tools

      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      This appendix contains sections titled:

      • Commercially and publicly available tools

      • Web Site Links

      View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      Appendix B: DataMining Applications

      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      This appendix contains sections titled:

      • Data Mining for Financial Data Analysis

      • Data Mining for the Telecommunications Industry

      • Data Mining for the Retail Industry

      • Data Mining in Healthcare and Biomedical Research

      • Data Mining in Science and Engineering

      • Pitfalls of Data Mining

      View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      Index

      Page(s): 339 - 343
      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      A comprehensive introduction to the exploding field of data mining

      We are surrounded by data, numerical and otherwise, which must be analyzed and processed to convert it into information that informs, instructs, answers, or otherwise aids understanding and decision-making. Due to the ever-increasing complexity and size of today's data sets, a new term, data mining, was created to describe the indirect, automatic data analysis techniques that utilize more complex and sophisticated tools than those which analysts used in the past to do mere data analysis.

      Data Mining: Concepts, Models, Methods, and Algorithms discusses data mining principles and then describes representative state-of-the-art methods and algorithms originating from different disciplines such as statistics, machine learning, neural networks, fuzzy logic, and evolutionary computation. Detailed algorithms are provided with necessary explanations and illustrative examples.

      This text offers guidance: how a d when to use a particular software tool (with their companion data sets) from among the hundreds offered when faced with a data set to mine. This allows analysts to create and perform their own data mining experiments using their knowledge of the methodologies and techniques provided.

      This book emphasizes the selection of appropriate methodologies and data analysis software, as well as parameter tuning. These critically important, qualitative decisions can only be made with the deeper understanding of parameter meaning and its role in the technique that is offered here. Data mining is an exploding field and this book offers much-needed guidance to selecting among the numerous analysis programs that are available. View full abstract»

    • Full text access may be available. Click article title to sign in or learn about subscription options.

      About the Author

      Page(s): 345
      Copyright Year: 2003

      Wiley-IEEE Press eBook Chapters

      A comprehensive introduction to the exploding field of data mining

      We are surrounded by data, numerical and otherwise, which must be analyzed and processed to convert it into information that informs, instructs, answers, or otherwise aids understanding and decision-making. Due to the ever-increasing complexity and size of today's data sets, a new term, data mining, was created to describe the indirect, automatic data analysis techniques that utilize more complex and sophisticated tools than those which analysts used in the past to do mere data analysis.

      Data Mining: Concepts, Models, Methods, and Algorithms discusses data mining principles and then describes representative state-of-the-art methods and algorithms originating from different disciplines such as statistics, machine learning, neural networks, fuzzy logic, and evolutionary computation. Detailed algorithms are provided with necessary explanations and illustrative examples.

      This text offers guidance: how a d when to use a particular software tool (with their companion data sets) from among the hundreds offered when faced with a data set to mine. This allows analysts to create and perform their own data mining experiments using their knowledge of the methodologies and techniques provided.

      This book emphasizes the selection of appropriate methodologies and data analysis software, as well as parameter tuning. These critically important, qualitative decisions can only be made with the deeper understanding of parameter meaning and its role in the technique that is offered here. Data mining is an exploding field and this book offers much-needed guidance to selecting among the numerous analysis programs that are available. View full abstract»