Article

Asena Atilla Saunders
Asena Atilla Saunders 21 February 2017
Categories Technology

The History Of Data Mining

You might think the history of Data Mining started very recently as it is commonly considered with new technology. However data mining is a discipline with a long history.

You might think the history of Data Mining started very recently as it is commonly considered with new technology. However data mining is a discipline with a long history. It starts with the early Data Mining methods Bayes’ Theorem (1700`s) and Regression analysis (1800`s) which were mostly identifying patterns in data. In this article, we won`t start with `Once upon a time…`, instead we will focus on the recent history and studies. However you can briefly see the major milestones of data mining history on this chronological table below:

 

http://visual.ly/

Data mining is the process of analyzing large data sets (Big Data) from different perspectives and uncovering correlations and patterns to summarize them into useful information. Nowadays it is blended with many techniques such as artificial intelligence, statistics, data science, database theory and machine learning.

Recent history

Increasing power of technology and complexity of data sets has lead Data Mining to evolve from static data delivery to more dynamic and proactive information deliveries; from tapes and disks to advanced algorithms and massive databases (see the table below). In the late 80`s Data Mining term began to be known and used within the research community by statisticians, data analysts, and the management information systems (MIS) communities.

 

Source: http://www.thearling.com/text/dmwhite/dmwhite.htm

By the early 1990`s, data mining was recognized as a sub-process or a step within a larger process called Knowledge Discovery in Databases (KDD) – which gave rise to actually making it ‘the popular guy’. The most commonly used definition of KDD is “The nontrivial process of identifying valid, novel, potentially useful, and ultimately understandable patterns in data” (Fayyad, 1996).

The sub-processes that form part of the KDD process are;

  1. Understanding of the application and identifying the goal of the KDD process
  2. Creating a target data set
  3. Data cleaning and pre-processing
  4. Matching the goals of the KDD process (step 1) to a particular data-mining method.
  5. Research analysis and hypothesis selection
  6. Data mining: Searching for patterns of interest in a particular form , including classification rules, regression, and clustering
  7. Interpreting mined patterns
  8. Acting on the discovered analysis

The popularity of data mining escalated notably in the 1990`s, with the help of dedicated conferences, in addition to the fast increase in technology, data storage capabilities and computers` processing speeds. It was also possible for organizations to keep data in computer readable form and processing of large volumes of data using desk top machines were not far from reality.

By the end of 1990`s, data mining was already a well-known technique used by the organizations after the introduction of customer loyalty cards. This opened a big door allowing organizations to record customer purchases and data, the resulting data could be mined to identify customer purchasing patterns. The popularity of data mining has continued to grow rapidly over the last decade.

The evaluation of data mining applications

The main focus of data mining was tabular data; however with the evolving technology and different needs new sources were formed to be mined!

  • Text Mining:  Still a popular data mining activity, it categorizes or clusters large document collections such as news articles or web pages.  Another application is opinion mining where the techniques are applied to obtain useful information from the questionnaire style data.
  • Image Mining: In image mining, mining techniques are applied to images (2D and 3D)
  • Graph Mining: It is formed from frequent pattern mining, which is focused on frequently occurring sub-graphs. A popular extension of graph mining is social network mining.

Data mining has become very popular over the last two decades as a discipline in its own.  Data mining applications are used in every field of business, government, and science just to name a few. Starting from text mining, it has evolved a lot and it will be very interesting to watch with the usage of different data (e.g spatial data, different sources of multimedia data) in the future.

Original Content: https://www.exastax.com/big-data/the-history-of-data-mining/ 

Please login or register to add a comment.

Contribute Now!

Loving our articles? Do you have an insightful post that you want to shout about? Well, you've come to the right place! We are always looking for fresh Doughnuts to be a part of our community.

Popular Articles

See all
7 reasons why social media marketing is important for your business

7 reasons why social media marketing is important for your business

Social media is quickly becoming one of the most important aspects of digital marketing, which provides incredible benefits that help reach millions of customers worldwide. And if you are not applying this profitable...

Sharron Nelson
Sharron Nelson 6 February 2018
Read more
Digital Marketing Vs. Traditional Marketing: Which One Is Better?

Digital Marketing Vs. Traditional Marketing: Which One Is Better?

What's the difference between digital marketing and traditional marketing, and why does it matter? The answers may surprise you.

Julie Cave
Julie Cave 14 July 2016
Read more
Top 10 B2B Platforms to Help your Business Grow Worldwide

Top 10 B2B Platforms to Help your Business Grow Worldwide

Although the trend of a Business to Business portal is not new but the evolution of technology has indeed changed the way they function. Additional digital trading features and branding has taken the place of...

Salman Sharif
Salman Sharif 7 July 2017
Read more
What Marketing Content Do Different Age Groups like to Consume?

What Marketing Content Do Different Age Groups like to Consume?

Today marketers have a wide choice of different content types to create; from video to blogs, from memes to whitepapers. But which types of content are most suitable for different age groups?

Lisa Curry
Lisa Curry 21 October 2016
Read more
Collection Of The Best Email Testing Tools Online

Collection Of The Best Email Testing Tools Online

Don’t be afraid of email testing. There are many free or freemium tools online that can help you with testing your SPAM score, deliverability and even the rendering of your email. We feature 30 email testing tools in...

Roland Pokornyik
Roland Pokornyik 31 October 2016
Read more