By Topic

Sixth International Conference on Data Mining (ICDM'06)

Date 18-22 Dec. 2006

Filter Results

Displaying Results 1 - 25 of 169
  • Sixth International Conference on Data Mining [Cover]

    Publication Year: 2006
    Request permission for commercial reuse | PDF file iconPDF (174 KB)
    Freely Available from IEEE
  • Sixth International Conference on Data Mining-Title

    Publication Year: 2006, Page(s):i - iii
    Request permission for commercial reuse | PDF file iconPDF (35 KB)
    Freely Available from IEEE
  • Sixth International Conference on Data Mining-Copyright

    Publication Year: 2006, Page(s): iv
    Request permission for commercial reuse | PDF file iconPDF (42 KB)
    Freely Available from IEEE
  • Sixth International Conference on Data Mining - TOC

    Publication Year: 2006, Page(s):v - xiii
    Request permission for commercial reuse | PDF file iconPDF (78 KB)
    Freely Available from IEEE
  • Welcome from Conference Chairs

    Publication Year: 2006, Page(s): xiv
    Request permission for commercial reuse | PDF file iconPDF (30 KB) | HTML iconHTML
    Freely Available from IEEE
  • Preface

    Publication Year: 2006, Page(s):xv - xvi
    Request permission for commercial reuse | PDF file iconPDF (36 KB) | HTML iconHTML
    Freely Available from IEEE
  • Conference organization

    Publication Year: 2006, Page(s): xvii
    Request permission for commercial reuse | PDF file iconPDF (33 KB)
    Freely Available from IEEE
  • Program Committee

    Publication Year: 2006, Page(s): xviii
    Request permission for commercial reuse | PDF file iconPDF (29 KB)
    Freely Available from IEEE
  • Program Committee

    Publication Year: 2006, Page(s):xix - xxii
    Request permission for commercial reuse | PDF file iconPDF (49 KB)
    Freely Available from IEEE
  • Non-PC reviewers

    Publication Year: 2006, Page(s):xxiii - xxv
    Request permission for commercial reuse | PDF file iconPDF (33 KB)
    Freely Available from IEEE
  • Corporate Sponsors

    Publication Year: 2006, Page(s): xxvi
    Request permission for commercial reuse | PDF file iconPDF (52 KB)
    Freely Available from IEEE
  • Invited speakers

    Publication Year: 2006
    Request permission for commercial reuse | PDF file iconPDF (27 KB)
    Freely Available from IEEE
  • Tutorials

    Publication Year: 2006, Page(s): xxviii
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (29 KB) | HTML iconHTML

    Provides an abstract for each of the presentations and a brief professional biography of each presenter. The complete presentations were not made available for publication as part of the conference proceedings. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Neuroscience: New Insights for AI?

    Publication Year: 2006, Page(s):3 - 5
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (208 KB) | HTML iconHTML

    Understanding the processing of information in our cortex is a significant part of understanding how the brain works and of understanding intelligence itself, arguably one of the greatest problems in science today. In particular, our visual abilities are computationally amazing and we are still far from imitating them with computers. Thus, visual cortex may well be a good proxy for the rest of the... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Exploratory Mining in Cube Space

    Publication Year: 2006, Page(s): 6
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (70 KB)

    Data Mining has evolved as a new discipline at the intersection of several existing areas, including Database Systems, Machine Learning, Optimization, and Statistics. An important question is whether the field has matured to the point where it has originated substantial new problems and techniques that distinguish it from its parent disciplines. In this paper, we discuss a class of new problems an... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Data Mining Methods for Modeling Gene Expression Regulation and Their Applications

    Publication Year: 2006, Page(s): 7
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (96 KB) | HTML iconHTML

    This paper demonstrates machine learning and data mining methods that can be developed and applied to analyzing large quantities of genomic information and gene expression data for characterizing and modeling gene expression regulation. In particular, there will be a discussion on some of the methods that have been developed for modeling gene expression regulation underlying abiotic stress (e.g., ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An Information Theoretic Approach to Detection of Minority Subsets in Database

    Publication Year: 2006, Page(s):11 - 20
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (663 KB) | HTML iconHTML

    Detection of rare and exceptional occurrences in large- scale databases have become an important practice in the field of knowledge discovery and information retrieval. Many databases include large amount of noise or irrelevant data, whose distribution often overlaps with the subsets of exceptional data containing useful knowledge. This paper addresses the problem of finding a small subset of "min... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Bayesian State Space Modeling Approach for Measuring the Effectiveness of Marketing Activities and Baseline Sales from POS Data

    Publication Year: 2006, Page(s):21 - 32
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (239 KB) | HTML iconHTML

    Analysis of point of sales (POS) data is an important research area of marketing science and knowledge discovery, which may enable marketing managers to attain the effective marketing activities. To measure the effectiveness of marketing activities and baseline sales, we develop the multivariate time series modeling method in the framework of a general state space model. A multivariate Poisson mod... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Learning to Use a Learned Model: A Two-Stage Approach to Classification

    Publication Year: 2006, Page(s):33 - 42
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (217 KB) | HTML iconHTML

    Association rule-based classifiers have recently emerged as competitive classification systems. However, there are still deficiencies that hinder their performance. One deficiency is the use of rules in the classification stage. Current systems assign classes to new objects based on the best rule applied or on some predefined scoring of multiple rules. In this paper we propose a new technique wher... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Hierarchical Classification by Expected Utility Maximization

    Publication Year: 2006, Page(s):43 - 52
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (243 KB) | HTML iconHTML

    Hierarchical classification refers to an extension of the standard classification problem, in which labels must be chosen from a class hierarchy. In this paper, we look at hierarchical classification from an information retrieval point of view. More specifically, we consider a scenario in which a user searches a document in a topic hierarchy. This scenario gives rise to the problem of predicting a... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • COALA: A Novel Approach for the Extraction of an Alternate Clustering of High Quality and High Dissimilarity

    Publication Year: 2006, Page(s):53 - 62
    Cited by:  Papers (12)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (197 KB) | HTML iconHTML

    Cluster analysis has long been a fundamental task in data mining and machine learning. However, traditional clustering methods concentrate on producing a single solution, even though multiple alternative clusterings may exist. It is thus difficult for the user to validate whether the given solution is in fact appropriate, particularly for large and complex datasets. In this paper we explore the cr... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Cluster Ranking with an Application to Mining Mailbox Networks

    Publication Year: 2006, Page(s):63 - 74
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (309 KB) | HTML iconHTML

    We initiate the study of a new clustering framework, called cluster ranking. Rather than simply partitioning a network into clusters, a cluster ranking algorithm also orders the clusters by their strength. To this end, we introduce a novel strength measure for clusters - the integrated cohesion - which is applicable to arbitrary weighted networks. We then present C-Rank: a new cluster ranking algo... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Large Scale Detection of Irregularities in Accounting Data

    Publication Year: 2006, Page(s):75 - 86
    Cited by:  Papers (6)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (277 KB) | HTML iconHTML

    In recent years, there have been several large accounting frauds where a company's financial results have been intentionally misrepresented by billions of dollars. In response, regulatory bodies have mandated that auditors perform analytics on detailed financial data with the intent of discovering such misstatements. For a large auditing firm, this may mean analyzing millions of records from thous... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Adaptive Blocking: Learning to Scale Up Record Linkage

    Publication Year: 2006, Page(s):87 - 96
    Cited by:  Papers (13)  |  Patents (7)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (154 KB) | HTML iconHTML

    Many data mining tasks require computing similarity between pairs of objects. Pairwise similarity computations are particularly important in record linkage systems, as well as in clustering and schema mapping algorithms. Because the number of object pairs grows quadratically with the size of the dataset, computing similarity between all pairs is impractical and becomes prohibitive for large datase... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Adaptive Parallel Graph Mining for CMP Architectures

    Publication Year: 2006, Page(s):97 - 106
    Cited by:  Papers (10)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (249 KB) | HTML iconHTML

    Mining graph data is an increasingly popular challenge, which has practical applications in many areas, including molecular substructure discovery, Web link analysis, fraud detection, and social network analysis. The problem statement is to enumerate all subgraphs occurring in at least sigma graphs of a database, where sigma is a user specified parameter. Chip multiprocessors (CMPs) provide true p... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.