SlideShare a Scribd company logo
INTRODUCTION TO DATA MINING
TECHNIQUE
By – Pawneshwar Datt Rai
WHAT IS DATA MINING?
 Data mining is also called knowledge discovery and data
mining (KDD)
 Data mining is
 extraction of useful patterns from data sources, e.g.,
databases, texts, web, image.
 Patterns must be:
 valid, novel, potentially useful, understandable
This PPT presented By - Pawneshwar Datt Rai
EXAMPLE OF DISCOVERED PATTERNS
 Association rules:
“80% of customers who buy cheese and milk also buy
bread, and 5% of customers buy all of them together”
Cheese, Milk Bread [sup =5%, confid=80%]
This PPT presented By - Pawneshwar Datt Rai
MAIN DATA MINING TASKS
 Classification:
mining patterns that can classify future data into known
classes.
 Association rule mining
mining any rule of the form X  Y, where X and Y are
sets of data items.
 Clustering
identifying a set of similarity groups in the data
This PPT presented By - Pawneshwar Datt Rai
MAIN DATA MINING TASKS
 Sequential pattern mining:
A sequential rule: A B, says that event A will be
immediately followed by event B with a certain confidence
 Deviation detection:
discovering the most significant changes in data
 Data visualization: using graphical methods to show
patterns in data.
This PPT presented By - Pawneshwar Datt Rai
WHY IS DATA MINING IMPORTANT?
 Rapid computerization of businesses produce huge
amount of data
 How to make best use of data?
 A growing realization: knowledge discovered from
data can be used for competitive advantage.
This PPT presented By - Pawneshwar Datt Rai
WHY IS DATA MINING NECESSARY?
 Make use of your data assets
 There is a big gap from stored data to knowledge; and
the transition won’t occur automatically.
 Many interesting things you want to find cannot be found
using database queries
“find me people likely to buy my products”
“Who are likely to respond to my promotion”
This PPT presented By - Pawneshwar Datt Rai
WHY DATA MINING NOW?
 The data is abundant.
 The data is being warehoused.
 The computing power is affordable.
 The competitive pressure is strong.
 Data mining tools have become available
This PPT presented By - Pawneshwar Datt Rai
RELATED FIELDS
 Data mining is an emerging multi-disciplinary field:
Statistics
Machine learning
Databases
Information retrieval
Visualization
etc.
This PPT presented By - Pawneshwar Datt Rai
DATA MINING (KDD) PROCESS
 Understand the application domain
 Identify data sources and select target data
 Pre-process: cleaning, attribute selection
 Data mining to extract patterns or models
 Post-process: identifying interesting or useful patterns
 Incorporate patterns in real world tasks
This PPT presented By - Pawneshwar Datt Rai
DATA MINING APPLICATIONS
 Marketing, customer profiling and retention,
identifying potential customers, market
segmentation.
 Fraud detection
identifying credit card fraud, intrusion detection
 Scientific data analysis
 Text and web mining
 Any application that involves a large amount of data.
This PPT presented By - Pawneshwar Datt Rai
WEB DATA EXTRACTION
Data
region1
Data
region2
A data
record
A data
record
This PPT presented By - Pawneshwar Datt Rai
OPINION ANALYSIS
 Word-of-mouth on the Web
 The Web has dramatically changed the way that
consumers express their opinions.
 One can post reviews of products at merchant
sites, Web forums, discussion groups, blogs
 Techniques are being developed to exploit these
sources.
 Benefits of Review Analysis
 Potential Customer: No need to read many reviews
 Product manufacturer: market intelligence, product
benchmarking
This PPT presented By - Pawneshwar Datt Rai
FEATURE BASED ANALYSIS &
SUMMARIZATION
 Extracting product features (called Opinion Features) that
have been commented on by customers.
 Identifying opinion sentences in each review and
deciding whether each opinion sentence is positive or
negative.
 Summarizing and comparing results.
This PPT presented By - Pawneshwar Datt Rai
A Happy and Prosperous day to all friends.
This PPT presented By – Pawneshwar Datt Rai
ThisPPTpresentedBy-PawneshwarDattRai
Ad

More Related Content

What's hot (20)

The Data Warehouse Lifecycle
The Data Warehouse LifecycleThe Data Warehouse Lifecycle
The Data Warehouse Lifecycle
bartlowe
 
01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.
Institute of Technology Telkom
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
DataminingTools Inc
 
3. mining frequent patterns
3. mining frequent patterns3. mining frequent patterns
3. mining frequent patterns
Azad public school
 
Data warehouse
Data warehouseData warehouse
Data warehouse
krishna kumar singh
 
Data preparation
Data preparationData preparation
Data preparation
Tony Nguyen
 
Lecture6 introduction to data streams
Lecture6 introduction to data streamsLecture6 introduction to data streams
Lecture6 introduction to data streams
hktripathy
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
janani thirupathi
 
introduction to data mining tutorial
introduction to data mining tutorial introduction to data mining tutorial
introduction to data mining tutorial
Salah Amean
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
hktripathy
 
Data mining
Data miningData mining
Data mining
Birju Tank
 
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DI&A Slides: Descriptive, Prescriptive, and Predictive AnalyticsDI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DATAVERSITY
 
Clustering in Data Mining
Clustering in Data MiningClustering in Data Mining
Clustering in Data Mining
Archana Swaminathan
 
DATA PREPROCESSING AND DATA CLEANSING
DATA PREPROCESSING AND DATA CLEANSINGDATA PREPROCESSING AND DATA CLEANSING
DATA PREPROCESSING AND DATA CLEANSING
Ahtesham Ullah khan
 
Data Engineering Basics
Data Engineering BasicsData Engineering Basics
Data Engineering Basics
Catherine Kimani
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
Vikram Nandini
 
A Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining PresentationA Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining Presentation
millerca2
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining
Phi Jack
 
Association rule mining and Apriori algorithm
Association rule mining and Apriori algorithmAssociation rule mining and Apriori algorithm
Association rule mining and Apriori algorithm
hina firdaus
 
Introduction to Data mining
Introduction to Data miningIntroduction to Data mining
Introduction to Data mining
Hadi Fadlallah
 
The Data Warehouse Lifecycle
The Data Warehouse LifecycleThe Data Warehouse Lifecycle
The Data Warehouse Lifecycle
bartlowe
 
Data preparation
Data preparationData preparation
Data preparation
Tony Nguyen
 
Lecture6 introduction to data streams
Lecture6 introduction to data streamsLecture6 introduction to data streams
Lecture6 introduction to data streams
hktripathy
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
janani thirupathi
 
introduction to data mining tutorial
introduction to data mining tutorial introduction to data mining tutorial
introduction to data mining tutorial
Salah Amean
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
hktripathy
 
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DI&A Slides: Descriptive, Prescriptive, and Predictive AnalyticsDI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DATAVERSITY
 
DATA PREPROCESSING AND DATA CLEANSING
DATA PREPROCESSING AND DATA CLEANSINGDATA PREPROCESSING AND DATA CLEANSING
DATA PREPROCESSING AND DATA CLEANSING
Ahtesham Ullah khan
 
A Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining PresentationA Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining Presentation
millerca2
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining
Phi Jack
 
Association rule mining and Apriori algorithm
Association rule mining and Apriori algorithmAssociation rule mining and Apriori algorithm
Association rule mining and Apriori algorithm
hina firdaus
 
Introduction to Data mining
Introduction to Data miningIntroduction to Data mining
Introduction to Data mining
Hadi Fadlallah
 

Viewers also liked (17)

Data mining
Data miningData mining
Data mining
Akannsha Totewar
 
Transactional Data Mining
Transactional Data MiningTransactional Data Mining
Transactional Data Mining
Ted Dunning
 
Data mining slides
Data mining slidesData mining slides
Data mining slides
smj
 
Data mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniquesData mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniques
Saif Ullah
 
Chapter 08 Data Mining Techniques
Chapter 08 Data Mining Techniques Chapter 08 Data Mining Techniques
Chapter 08 Data Mining Techniques
Houw Liong The
 
Chapter - 8.3 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 8.3 Data Mining Concepts and Techniques 2nd Ed slides Han & KamberChapter - 8.3 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 8.3 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
error007
 
Data mining and its applications!
Data mining and its applications!Data mining and its applications!
Data mining and its applications!
COSTARCH Analytical Consulting (P) Ltd.
 
Data mining process powerpoint ppt slides.
Data mining process powerpoint ppt slides.Data mining process powerpoint ppt slides.
Data mining process powerpoint ppt slides.
SlideTeam.net
 
Significance of Data Mining
Significance of Data MiningSignificance of Data Mining
Significance of Data Mining
8trackweb
 
Lecture 01 Data Mining
Lecture 01 Data MiningLecture 01 Data Mining
Lecture 01 Data Mining
Pier Luca Lanzi
 
Data Mining Overview
Data Mining OverviewData Mining Overview
Data Mining Overview
Golda Margret Sheeba J
 
Decision trees
Decision treesDecision trees
Decision trees
Jagjit Wilku
 
DATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGEDATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGE
Neeraj Goswami
 
Decision tree example problem
Decision tree example problemDecision tree example problem
Decision tree example problem
SATYABRATA PRADHAN
 
Decision Trees
Decision TreesDecision Trees
Decision Trees
International School of Engineering
 
Ch 1 Intro to Data Mining
Ch 1 Intro to Data MiningCh 1 Intro to Data Mining
Ch 1 Intro to Data Mining
Sushil Kulkarni
 
Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining Concepts
Dung Nguyen
 
Transactional Data Mining
Transactional Data MiningTransactional Data Mining
Transactional Data Mining
Ted Dunning
 
Data mining slides
Data mining slidesData mining slides
Data mining slides
smj
 
Data mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniquesData mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniques
Saif Ullah
 
Chapter 08 Data Mining Techniques
Chapter 08 Data Mining Techniques Chapter 08 Data Mining Techniques
Chapter 08 Data Mining Techniques
Houw Liong The
 
Chapter - 8.3 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 8.3 Data Mining Concepts and Techniques 2nd Ed slides Han & KamberChapter - 8.3 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 8.3 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
error007
 
Data mining process powerpoint ppt slides.
Data mining process powerpoint ppt slides.Data mining process powerpoint ppt slides.
Data mining process powerpoint ppt slides.
SlideTeam.net
 
Significance of Data Mining
Significance of Data MiningSignificance of Data Mining
Significance of Data Mining
8trackweb
 
DATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGEDATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGE
Neeraj Goswami
 
Ch 1 Intro to Data Mining
Ch 1 Intro to Data MiningCh 1 Intro to Data Mining
Ch 1 Intro to Data Mining
Sushil Kulkarni
 
Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining Concepts
Dung Nguyen
 
Ad

Similar to Introduction to data mining technique (20)

Delivering Value Through Business Analytics
Delivering Value Through Business AnalyticsDelivering Value Through Business Analytics
Delivering Value Through Business Analytics
Social Media Today
 
Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...
Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...
Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...
roniczogia
 
Bootstrap Big Data Webinar
Bootstrap Big Data WebinarBootstrap Big Data Webinar
Bootstrap Big Data Webinar
Jane Truch
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
Siwawong Wuttipongprasert
 
H2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.ioH2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.io
Sri Ambati
 
Data Science for Marketing
Data Science for MarketingData Science for Marketing
Data Science for Marketing
Komes Chandavimol
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Ann Venkataraman
 
Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the Marketspace
Bala Iyer
 
Operationalizing Customer Analytics with Azure and Power BI
Operationalizing Customer Analytics with Azure and Power BIOperationalizing Customer Analytics with Azure and Power BI
Operationalizing Customer Analytics with Azure and Power BI
CCG
 
Social Media Analytics
Social Media AnalyticsSocial Media Analytics
Social Media Analytics
Aaum Research and Analytics Private Limited
 
Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...
Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...
Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...
skubeozane3j
 
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdfChapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
hamsalubekana
 
Smart Data Module 2 d drive_own data
Smart Data Module 2 d drive_own dataSmart Data Module 2 d drive_own data
Smart Data Module 2 d drive_own data
caniceconsulting
 
Get your data analytics strategy right!
Get your data analytics strategy right!Get your data analytics strategy right!
Get your data analytics strategy right!
SPAN Infotech (India) Pvt Ltd
 
Data Mining
Data MiningData Mining
Data Mining
vihangshah12
 
Riding the wave of analytics revolution
Riding the wave of analytics revolutionRiding the wave of analytics revolution
Riding the wave of analytics revolution
Tanuj Poddar
 
data analytics lecture2.pptx
data analytics lecture2.pptxdata analytics lecture2.pptx
data analytics lecture2.pptx
NamrataBhatt8
 
Smarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with AutomationSmarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with Automation
Inside Analysis
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
Seerat Malik
 
Explainability for Natural Language Processing
Explainability for Natural Language ProcessingExplainability for Natural Language Processing
Explainability for Natural Language Processing
Yunyao Li
 
Delivering Value Through Business Analytics
Delivering Value Through Business AnalyticsDelivering Value Through Business Analytics
Delivering Value Through Business Analytics
Social Media Today
 
Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...
Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...
Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...
roniczogia
 
Bootstrap Big Data Webinar
Bootstrap Big Data WebinarBootstrap Big Data Webinar
Bootstrap Big Data Webinar
Jane Truch
 
H2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.ioH2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.io
Sri Ambati
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Ann Venkataraman
 
Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the Marketspace
Bala Iyer
 
Operationalizing Customer Analytics with Azure and Power BI
Operationalizing Customer Analytics with Azure and Power BIOperationalizing Customer Analytics with Azure and Power BI
Operationalizing Customer Analytics with Azure and Power BI
CCG
 
Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...
Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...
Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...
skubeozane3j
 
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdfChapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
hamsalubekana
 
Smart Data Module 2 d drive_own data
Smart Data Module 2 d drive_own dataSmart Data Module 2 d drive_own data
Smart Data Module 2 d drive_own data
caniceconsulting
 
Riding the wave of analytics revolution
Riding the wave of analytics revolutionRiding the wave of analytics revolution
Riding the wave of analytics revolution
Tanuj Poddar
 
data analytics lecture2.pptx
data analytics lecture2.pptxdata analytics lecture2.pptx
data analytics lecture2.pptx
NamrataBhatt8
 
Smarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with AutomationSmarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with Automation
Inside Analysis
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
Seerat Malik
 
Explainability for Natural Language Processing
Explainability for Natural Language ProcessingExplainability for Natural Language Processing
Explainability for Natural Language Processing
Yunyao Li
 
Ad

More from Pawneshwar Datt Rai (15)

Venture capital in india
Venture capital in indiaVenture capital in india
Venture capital in india
Pawneshwar Datt Rai
 
Service support process ppt
Service support process pptService support process ppt
Service support process ppt
Pawneshwar Datt Rai
 
Online payment
Online paymentOnline payment
Online payment
Pawneshwar Datt Rai
 
Quick Learning or Effective Learning
Quick Learning or Effective LearningQuick Learning or Effective Learning
Quick Learning or Effective Learning
Pawneshwar Datt Rai
 
Latter Writting
Latter WrittingLatter Writting
Latter Writting
Pawneshwar Datt Rai
 
Geometric series
Geometric seriesGeometric series
Geometric series
Pawneshwar Datt Rai
 
Facebook marketing
Facebook marketingFacebook marketing
Facebook marketing
Pawneshwar Datt Rai
 
Big data
Big dataBig data
Big data
Pawneshwar Datt Rai
 
Welcome All Of You
Welcome  All  Of  YouWelcome  All  Of  You
Welcome All Of You
Pawneshwar Datt Rai
 
Service Support Process PPT
Service Support Process PPTService Support Process PPT
Service Support Process PPT
Pawneshwar Datt Rai
 
Geometric Series
Geometric SeriesGeometric Series
Geometric Series
Pawneshwar Datt Rai
 
Big Data
Big DataBig Data
Big Data
Pawneshwar Datt Rai
 
Presentation on a Book
Presentation  on a BookPresentation  on a Book
Presentation on a Book
Pawneshwar Datt Rai
 
Learning
LearningLearning
Learning
Pawneshwar Datt Rai
 
Online Payment
Online PaymentOnline Payment
Online Payment
Pawneshwar Datt Rai
 

Recently uploaded (20)

Process Mining as Enabler for Digital Transformations
Process Mining as Enabler for Digital TransformationsProcess Mining as Enabler for Digital Transformations
Process Mining as Enabler for Digital Transformations
Process mining Evangelist
 
abebaw power point presentation esis october.ppt
abebaw power point presentation esis october.pptabebaw power point presentation esis october.ppt
abebaw power point presentation esis october.ppt
mihretwodage
 
Mixed Methods Research.pptx education 201
Mixed Methods Research.pptx education 201Mixed Methods Research.pptx education 201
Mixed Methods Research.pptx education 201
GraceSolaa1
 
presentacion.slideshare.informáticaJuridica..pptx
presentacion.slideshare.informáticaJuridica..pptxpresentacion.slideshare.informáticaJuridica..pptx
presentacion.slideshare.informáticaJuridica..pptx
GersonVillatoro4
 
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Jayantilal Bhanushali
 
Process Mining at Deutsche Bank - Journey
Process Mining at Deutsche Bank - JourneyProcess Mining at Deutsche Bank - Journey
Process Mining at Deutsche Bank - Journey
Process mining Evangelist
 
MLOps_with_SageMaker_Template_EN idioma inglés
MLOps_with_SageMaker_Template_EN idioma inglésMLOps_with_SageMaker_Template_EN idioma inglés
MLOps_with_SageMaker_Template_EN idioma inglés
FabianPierrePeaJacob
 
Ann Naser Nabil- Data Scientist Portfolio.pdf
Ann Naser Nabil- Data Scientist Portfolio.pdfAnn Naser Nabil- Data Scientist Portfolio.pdf
Ann Naser Nabil- Data Scientist Portfolio.pdf
আন্ নাসের নাবিল
 
How to Set Up Process Mining in a Decentralized Organization?
How to Set Up Process Mining in a Decentralized Organization?How to Set Up Process Mining in a Decentralized Organization?
How to Set Up Process Mining in a Decentralized Organization?
Process mining Evangelist
 
Controlling Financial Processes at a Municipality
Controlling Financial Processes at a MunicipalityControlling Financial Processes at a Municipality
Controlling Financial Processes at a Municipality
Process mining Evangelist
 
Transforming health care with ai powered
Transforming health care with ai poweredTransforming health care with ai powered
Transforming health care with ai powered
gowthamarvj
 
Storage Devices and the Mechanism of Data Storage in Audio and Visual Form
Storage Devices and the Mechanism of Data Storage in Audio and Visual FormStorage Devices and the Mechanism of Data Storage in Audio and Visual Form
Storage Devices and the Mechanism of Data Storage in Audio and Visual Form
Professional Content Writing's
 
AWS Certified Machine Learning Slides.pdf
AWS Certified Machine Learning Slides.pdfAWS Certified Machine Learning Slides.pdf
AWS Certified Machine Learning Slides.pdf
philsparkshome
 
CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™
CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™
CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™
muhammed84essa
 
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm     mmmmmfftro.pptxlecture_13 tree in mmmmmmmm     mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
sarajafffri058
 
Sets theories and applications that can used to imporve knowledge
Sets theories and applications that can used to imporve knowledgeSets theories and applications that can used to imporve knowledge
Sets theories and applications that can used to imporve knowledge
saumyasl2020
 
Introduction to systems thinking tools_Eng.pdf
Introduction to systems thinking tools_Eng.pdfIntroduction to systems thinking tools_Eng.pdf
Introduction to systems thinking tools_Eng.pdf
AbdurahmanAbd
 
What is ETL? Difference between ETL and ELT?.pdf
What is ETL? Difference between ETL and ELT?.pdfWhat is ETL? Difference between ETL and ELT?.pdf
What is ETL? Difference between ETL and ELT?.pdf
SaikatBasu37
 
Lesson 6-Interviewing in SHRM_updated.pdf
Lesson 6-Interviewing in SHRM_updated.pdfLesson 6-Interviewing in SHRM_updated.pdf
Lesson 6-Interviewing in SHRM_updated.pdf
hemelali11
 
Lagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdfLagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdf
benuju2016
 
Process Mining as Enabler for Digital Transformations
Process Mining as Enabler for Digital TransformationsProcess Mining as Enabler for Digital Transformations
Process Mining as Enabler for Digital Transformations
Process mining Evangelist
 
abebaw power point presentation esis october.ppt
abebaw power point presentation esis october.pptabebaw power point presentation esis october.ppt
abebaw power point presentation esis october.ppt
mihretwodage
 
Mixed Methods Research.pptx education 201
Mixed Methods Research.pptx education 201Mixed Methods Research.pptx education 201
Mixed Methods Research.pptx education 201
GraceSolaa1
 
presentacion.slideshare.informáticaJuridica..pptx
presentacion.slideshare.informáticaJuridica..pptxpresentacion.slideshare.informáticaJuridica..pptx
presentacion.slideshare.informáticaJuridica..pptx
GersonVillatoro4
 
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Jayantilal Bhanushali
 
MLOps_with_SageMaker_Template_EN idioma inglés
MLOps_with_SageMaker_Template_EN idioma inglésMLOps_with_SageMaker_Template_EN idioma inglés
MLOps_with_SageMaker_Template_EN idioma inglés
FabianPierrePeaJacob
 
How to Set Up Process Mining in a Decentralized Organization?
How to Set Up Process Mining in a Decentralized Organization?How to Set Up Process Mining in a Decentralized Organization?
How to Set Up Process Mining in a Decentralized Organization?
Process mining Evangelist
 
Controlling Financial Processes at a Municipality
Controlling Financial Processes at a MunicipalityControlling Financial Processes at a Municipality
Controlling Financial Processes at a Municipality
Process mining Evangelist
 
Transforming health care with ai powered
Transforming health care with ai poweredTransforming health care with ai powered
Transforming health care with ai powered
gowthamarvj
 
Storage Devices and the Mechanism of Data Storage in Audio and Visual Form
Storage Devices and the Mechanism of Data Storage in Audio and Visual FormStorage Devices and the Mechanism of Data Storage in Audio and Visual Form
Storage Devices and the Mechanism of Data Storage in Audio and Visual Form
Professional Content Writing's
 
AWS Certified Machine Learning Slides.pdf
AWS Certified Machine Learning Slides.pdfAWS Certified Machine Learning Slides.pdf
AWS Certified Machine Learning Slides.pdf
philsparkshome
 
CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™
CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™
CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™
muhammed84essa
 
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm     mmmmmfftro.pptxlecture_13 tree in mmmmmmmm     mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
sarajafffri058
 
Sets theories and applications that can used to imporve knowledge
Sets theories and applications that can used to imporve knowledgeSets theories and applications that can used to imporve knowledge
Sets theories and applications that can used to imporve knowledge
saumyasl2020
 
Introduction to systems thinking tools_Eng.pdf
Introduction to systems thinking tools_Eng.pdfIntroduction to systems thinking tools_Eng.pdf
Introduction to systems thinking tools_Eng.pdf
AbdurahmanAbd
 
What is ETL? Difference between ETL and ELT?.pdf
What is ETL? Difference between ETL and ELT?.pdfWhat is ETL? Difference between ETL and ELT?.pdf
What is ETL? Difference between ETL and ELT?.pdf
SaikatBasu37
 
Lesson 6-Interviewing in SHRM_updated.pdf
Lesson 6-Interviewing in SHRM_updated.pdfLesson 6-Interviewing in SHRM_updated.pdf
Lesson 6-Interviewing in SHRM_updated.pdf
hemelali11
 
Lagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdfLagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdf
benuju2016
 

Introduction to data mining technique

  • 1. INTRODUCTION TO DATA MINING TECHNIQUE By – Pawneshwar Datt Rai
  • 2. WHAT IS DATA MINING?  Data mining is also called knowledge discovery and data mining (KDD)  Data mining is  extraction of useful patterns from data sources, e.g., databases, texts, web, image.  Patterns must be:  valid, novel, potentially useful, understandable This PPT presented By - Pawneshwar Datt Rai
  • 3. EXAMPLE OF DISCOVERED PATTERNS  Association rules: “80% of customers who buy cheese and milk also buy bread, and 5% of customers buy all of them together” Cheese, Milk Bread [sup =5%, confid=80%] This PPT presented By - Pawneshwar Datt Rai
  • 4. MAIN DATA MINING TASKS  Classification: mining patterns that can classify future data into known classes.  Association rule mining mining any rule of the form X  Y, where X and Y are sets of data items.  Clustering identifying a set of similarity groups in the data This PPT presented By - Pawneshwar Datt Rai
  • 5. MAIN DATA MINING TASKS  Sequential pattern mining: A sequential rule: A B, says that event A will be immediately followed by event B with a certain confidence  Deviation detection: discovering the most significant changes in data  Data visualization: using graphical methods to show patterns in data. This PPT presented By - Pawneshwar Datt Rai
  • 6. WHY IS DATA MINING IMPORTANT?  Rapid computerization of businesses produce huge amount of data  How to make best use of data?  A growing realization: knowledge discovered from data can be used for competitive advantage. This PPT presented By - Pawneshwar Datt Rai
  • 7. WHY IS DATA MINING NECESSARY?  Make use of your data assets  There is a big gap from stored data to knowledge; and the transition won’t occur automatically.  Many interesting things you want to find cannot be found using database queries “find me people likely to buy my products” “Who are likely to respond to my promotion” This PPT presented By - Pawneshwar Datt Rai
  • 8. WHY DATA MINING NOW?  The data is abundant.  The data is being warehoused.  The computing power is affordable.  The competitive pressure is strong.  Data mining tools have become available This PPT presented By - Pawneshwar Datt Rai
  • 9. RELATED FIELDS  Data mining is an emerging multi-disciplinary field: Statistics Machine learning Databases Information retrieval Visualization etc. This PPT presented By - Pawneshwar Datt Rai
  • 10. DATA MINING (KDD) PROCESS  Understand the application domain  Identify data sources and select target data  Pre-process: cleaning, attribute selection  Data mining to extract patterns or models  Post-process: identifying interesting or useful patterns  Incorporate patterns in real world tasks This PPT presented By - Pawneshwar Datt Rai
  • 11. DATA MINING APPLICATIONS  Marketing, customer profiling and retention, identifying potential customers, market segmentation.  Fraud detection identifying credit card fraud, intrusion detection  Scientific data analysis  Text and web mining  Any application that involves a large amount of data. This PPT presented By - Pawneshwar Datt Rai
  • 12. WEB DATA EXTRACTION Data region1 Data region2 A data record A data record This PPT presented By - Pawneshwar Datt Rai
  • 13. OPINION ANALYSIS  Word-of-mouth on the Web  The Web has dramatically changed the way that consumers express their opinions.  One can post reviews of products at merchant sites, Web forums, discussion groups, blogs  Techniques are being developed to exploit these sources.  Benefits of Review Analysis  Potential Customer: No need to read many reviews  Product manufacturer: market intelligence, product benchmarking This PPT presented By - Pawneshwar Datt Rai
  • 14. FEATURE BASED ANALYSIS & SUMMARIZATION  Extracting product features (called Opinion Features) that have been commented on by customers.  Identifying opinion sentences in each review and deciding whether each opinion sentence is positive or negative.  Summarizing and comparing results. This PPT presented By - Pawneshwar Datt Rai
  • 15. A Happy and Prosperous day to all friends. This PPT presented By – Pawneshwar Datt Rai ThisPPTpresentedBy-PawneshwarDattRai
  翻译: