next up previous contents
Next: Data Mining Techniques Up: Data Mining and Document Previous: Data Mining and Document

Introduction

The amount of electronically available data has grown rapidly because of increase in use of electronic data gathering devices, e.g., point-of-sale, remote sensing devices etc., and because data storage has become easier and cheaper with increasing computing power and disk storage capacity.

Data base management systems (DBMSs) have given access to the data stored but they give no analysis of the data. Analysis is required to reveal the hidden relationships within the data, for instance, for decision support. Size of databases has increased and therefore there is a strong need for automated techniques for automated analysis. The solution is data mining that has been defined as:

Data mining has many synonyms and related areas of research. One of the most popular alternatives for naming the area is Knowledge Discovery in Databases (KDD). In the list of frequently asked questions [gif] KDD is characterized as follows:

The structure of this chapter is partly based on a WWW course material prepared in the Queen's University of Belfast [gif]. For introductory texts on the topic see, e.g., [gif,gif,gif].


next up previous contents
Next: Data Mining Techniques Up: Data Mining and Document Previous: Data Mining and Document

Heikki Hy|tyniemi
Tue Aug 5 14:39:14 EET DST 1997