Statistical Themes and Lessons for Data Mining
journal contributionposted on 01.01.1997 by Clark Glymour, David Madigan, Daryl Pregibon, Padhraic Smyth
Any type of content formally published in an academic journal, usually following a peer-review process.
Data mining is on the interface of Computer Science and Statistics, utilizing advances in both disciplines to make progress in extracting information from large databases. It is an emerging field that has attracted much attention in a very short period of time. This article highlights some statistical themes and lessons that are directly relevant to data mining and attempts to identify opportunities where close cooperation between the statistical and computational communities might reasonably provide synergy for further progress in data analysis.