Article

Asena Atilla Saunders
Asena Atilla Saunders 21 February 2017
Categories Technology

The History Of Data Mining

You might think the history of Data Mining started very recently as it is commonly considered with new technology. However data mining is a discipline with a long history.

You might think the history of Data Mining started very recently as it is commonly considered with new technology. However data mining is a discipline with a long history. It starts with the early Data Mining methods Bayes’ Theorem (1700`s) and Regression analysis (1800`s) which were mostly identifying patterns in data. In this article, we won`t start with `Once upon a time…`, instead we will focus on the recent history and studies. However you can briefly see the major milestones of data mining history on this chronological table below:

 

http://visual.ly/

Data mining is the process of analyzing large data sets (Big Data) from different perspectives and uncovering correlations and patterns to summarize them into useful information. Nowadays it is blended with many techniques such as artificial intelligence, statistics, data science, database theory and machine learning.

Recent history

Increasing power of technology and complexity of data sets has lead Data Mining to evolve from static data delivery to more dynamic and proactive information deliveries; from tapes and disks to advanced algorithms and massive databases (see the table below). In the late 80`s Data Mining term began to be known and used within the research community by statisticians, data analysts, and the management information systems (MIS) communities.

 

Source: http://www.thearling.com/text/dmwhite/dmwhite.htm

By the early 1990`s, data mining was recognized as a sub-process or a step within a larger process called Knowledge Discovery in Databases (KDD) – which gave rise to actually making it ‘the popular guy’. The most commonly used definition of KDD is “The nontrivial process of identifying valid, novel, potentially useful, and ultimately understandable patterns in data” (Fayyad, 1996).

The sub-processes that form part of the KDD process are;

  1. Understanding of the application and identifying the goal of the KDD process
  2. Creating a target data set
  3. Data cleaning and pre-processing
  4. Matching the goals of the KDD process (step 1) to a particular data-mining method.
  5. Research analysis and hypothesis selection
  6. Data mining: Searching for patterns of interest in a particular form , including classification rules, regression, and clustering
  7. Interpreting mined patterns
  8. Acting on the discovered analysis

The popularity of data mining escalated notably in the 1990`s, with the help of dedicated conferences, in addition to the fast increase in technology, data storage capabilities and computers` processing speeds. It was also possible for organizations to keep data in computer readable form and processing of large volumes of data using desk top machines were not far from reality.

By the end of 1990`s, data mining was already a well-known technique used by the organizations after the introduction of customer loyalty cards. This opened a big door allowing organizations to record customer purchases and data, the resulting data could be mined to identify customer purchasing patterns. The popularity of data mining has continued to grow rapidly over the last decade.

The evaluation of data mining applications

The main focus of data mining was tabular data; however with the evolving technology and different needs new sources were formed to be mined!

  • Text Mining:  Still a popular data mining activity, it categorizes or clusters large document collections such as news articles or web pages.  Another application is opinion mining where the techniques are applied to obtain useful information from the questionnaire style data.
  • Image Mining: In image mining, mining techniques are applied to images (2D and 3D)
  • Graph Mining: It is formed from frequent pattern mining, which is focused on frequently occurring sub-graphs. A popular extension of graph mining is social network mining.

Data mining has become very popular over the last two decades as a discipline in its own.  Data mining applications are used in every field of business, government, and science just to name a few. Starting from text mining, it has evolved a lot and it will be very interesting to watch with the usage of different data (e.g spatial data, different sources of multimedia data) in the future.

Original Content: https://www.exastax.com/big-data/the-history-of-data-mining/ 

Please login or register to add a comment.

Contribute Now!

Loving our articles? Do you have an insightful post that you want to shout about? Well, you've come to the right place! We are always looking for fresh Doughnuts to be a part of our community.

Popular Articles

See all
Digital Marketing Vs. Traditional Marketing: Which One Is Better?

Digital Marketing Vs. Traditional Marketing: Which One Is Better?

What's the difference between digital marketing and traditional marketing, and why does it matter? The answers may surprise you.

Julie Cave
Julie Cave 14 July 2016
Read more
4 Important Digital Marketing Channels You Should Know About

4 Important Digital Marketing Channels You Should Know About

It goes without saying that a company can't do without digital marketing in today's world.

Digital Doughnut Contributor
Digital Doughnut Contributor 5 November 2014
Read more
Are virtual reality and augmented reality the next step for CMS?

Are virtual reality and augmented reality the next step for CMS?

In a fitting conclusion to a year many hailed as the “year of virtual reality” (VR), in December 2016, WordPress made it possible for users to create and publish VR content on any WordPress site. The change meant that users could publish 360-degree photos on their sites, and regular photos and panoramas could be viewed in VR.

Leonie Mercedes
Leonie Mercedes 16 October 2017
Read more
Collection Of The Best Email Testing Tools Online

Collection Of The Best Email Testing Tools Online

Don’t be afraid of email testing. There are many free or freemium tools online that can help you with testing your SPAM score, deliverability and even the rendering of your email. We feature 30 email testing tools in this article. Check out the complete list!

Roland Pokornyik
Roland Pokornyik 31 October 2016
Read more
Smash your customer acquisition targets with these AB tests

Smash your customer acquisition targets with these AB tests

Don't second guess what will work best to drive new customer acquisition via referral. Plan a program of AB tests to maximise results and you could benefit from a 5-7x improvement in results!

Angela Southall
Angela Southall 17 October 2017
Read more