Article

Asena Atilla Saunders
Asena Atilla Saunders 21 February 2017
Categories Technology

The History Of Data Mining

You might think the history of Data Mining started very recently as it is commonly considered with new technology. However data mining is a discipline with a long history.

You might think the history of Data Mining started very recently as it is commonly considered with new technology. However data mining is a discipline with a long history. It starts with the early Data Mining methods Bayes’ Theorem (1700`s) and Regression analysis (1800`s) which were mostly identifying patterns in data. In this article, we won`t start with `Once upon a time…`, instead we will focus on the recent history and studies. However you can briefly see the major milestones of data mining history on this chronological table below:

 

http://visual.ly/

Data mining is the process of analyzing large data sets (Big Data) from different perspectives and uncovering correlations and patterns to summarize them into useful information. Nowadays it is blended with many techniques such as artificial intelligence, statistics, data science, database theory and machine learning.

Recent history

Increasing power of technology and complexity of data sets has lead Data Mining to evolve from static data delivery to more dynamic and proactive information deliveries; from tapes and disks to advanced algorithms and massive databases (see the table below). In the late 80`s Data Mining term began to be known and used within the research community by statisticians, data analysts, and the management information systems (MIS) communities.

 

Source: http://www.thearling.com/text/dmwhite/dmwhite.htm

By the early 1990`s, data mining was recognized as a sub-process or a step within a larger process called Knowledge Discovery in Databases (KDD) – which gave rise to actually making it ‘the popular guy’. The most commonly used definition of KDD is “The nontrivial process of identifying valid, novel, potentially useful, and ultimately understandable patterns in data” (Fayyad, 1996).

The sub-processes that form part of the KDD process are;

  1. Understanding of the application and identifying the goal of the KDD process
  2. Creating a target data set
  3. Data cleaning and pre-processing
  4. Matching the goals of the KDD process (step 1) to a particular data-mining method.
  5. Research analysis and hypothesis selection
  6. Data mining: Searching for patterns of interest in a particular form , including classification rules, regression, and clustering
  7. Interpreting mined patterns
  8. Acting on the discovered analysis

The popularity of data mining escalated notably in the 1990`s, with the help of dedicated conferences, in addition to the fast increase in technology, data storage capabilities and computers` processing speeds. It was also possible for organizations to keep data in computer readable form and processing of large volumes of data using desk top machines were not far from reality.

By the end of 1990`s, data mining was already a well-known technique used by the organizations after the introduction of customer loyalty cards. This opened a big door allowing organizations to record customer purchases and data, the resulting data could be mined to identify customer purchasing patterns. The popularity of data mining has continued to grow rapidly over the last decade.

The evaluation of data mining applications

The main focus of data mining was tabular data; however with the evolving technology and different needs new sources were formed to be mined!

  • Text Mining:  Still a popular data mining activity, it categorizes or clusters large document collections such as news articles or web pages.  Another application is opinion mining where the techniques are applied to obtain useful information from the questionnaire style data.
  • Image Mining: In image mining, mining techniques are applied to images (2D and 3D)
  • Graph Mining: It is formed from frequent pattern mining, which is focused on frequently occurring sub-graphs. A popular extension of graph mining is social network mining.

Data mining has become very popular over the last two decades as a discipline in its own.  Data mining applications are used in every field of business, government, and science just to name a few. Starting from text mining, it has evolved a lot and it will be very interesting to watch with the usage of different data (e.g spatial data, different sources of multimedia data) in the future.

Original Content: https://www.exastax.com/big-data/the-history-of-data-mining/ 

Please login or register to add a comment.

Contribute Now!

Loving our articles? Do you have an insightful post that you want to shout about? Well, you've come to the right place! We are always looking for fresh Doughnuts to be a part of our community.

Popular Articles

See all
How to Review a Website — A Guide for Beginners

How to Review a Website — A Guide for Beginners

A company website is crucial for any business's digital marketing strategy. To keep up with the changing trends and customer buying behaviors, it's important to review and make necessary changes regularly...

Digital Doughnut Contributor
Digital Doughnut Contributor 25 March 2024
Read more
The Impact of New Technology on Marketing

The Impact of New Technology on Marketing

Technology has impacted every part of our lives. From household chores to business disciplines and etiquette, there's a gadget or app for it. Marketing has changed dramatically over the years, but what is the...

Alex Lysak
Alex Lysak 3 April 2024
Read more
The IoT-Connected Car of Today—Case Studies

The IoT-Connected Car of Today—Case Studies

Imagine a world where your car not only drives itself, but also says intelligent things. This would look like an impossibility about five years ago, but today the IoT is already breaking fresh ground for tech...

Ronald van Loon
Ronald van Loon 10 March 2017
Read more
10 Factors that Influence Customer Buying Behaviour Online

10 Factors that Influence Customer Buying Behaviour Online

Now is an era where customers take the center stags influencing business strategies across industries. No business can afford to overlook factors that could either break the customer experience or even pose a risk of...

Edward Roesch
Edward Roesch 4 June 2018
Read more
AI-driven Personalisation Dominates the Future of Travel and Hospitality Marketing

AI-driven Personalisation Dominates the Future of Travel and Hospitality Marketing

Travel and hospitality marketing is on the verge of dramatic transformation. Personalisation of marketing communications is moving from dream to necessity, and the change is putting marketers in the driving seat.

Michael Nutley
Michael Nutley 3 December 2024
Read more