SlideShare a Scribd company logo
Introduction to Data Mining
• What is Data Mining?
• Related technologies
• Data Mining techniques
• Data Mining Goals
• Stages of data mining process
• Knowledge representation methods
• Applications
What is Data Mining?
• The process of extracting information to identify patterns, trends,
and useful data that would allow the business to take the data-
driven decision from huge sets of data is called Data Mining.
• Data mining is the act of automatically searching for large stores
of information to find trends and patterns that go beyond simple
analysis procedures.
• Data Mining is a process used by organizations to extract
specific data from huge databases to solve business problems.
It primarily turns raw data into useful information.
• Data mining utilizes complex mathematical algorithms for data
segments and evaluates the probability of future events. Data
Mining is also called Knowledge Discovery of Data (KDD).
Related Technologies
Data mining is related to many concepts. We briefly
introduce each concept and indicate how it is related to
data mining.
• Machine Learning
• DBMS
• OLAP
• Statistics
Machine Learning
• Machine learning is the area of AI that examines how to write programs that
can learn.
• In data mining, machine learning is often used for prediction or classification.
• Applications that typically use machine learning techniques include speech
recognition, training moving robots, classification of astronomical structures,
and game playing.
• When machine learning is applied to data mining tasks, a model is used to
represent the data (such as a graphical structure like a neural network or a
decision tree).
• During the learning process, a sample of the database is used to train the
system to properly perform the desired task.
• Then the system is applied to the general database to actually perform the
task.
Machine Learning
• Machine learning algorithms are divided into two types:
1. Unsupervised Learning
2. Supervised Learning
1. Unsupervised Machine Learning:
Unsupervised learning does not depend on trained data sets to predict the
results, but it utilizes direct techniques such as clustering and association in
order to predict the results.
2. Supervised Machine Learning:
Supervised learning is a learning process in which we teach or train the
machine using data which is well leveled implies that some data is already
marked with the correct responses. After that, the machine is provided with
the new sets of data so that the supervised learning algorithm analyzes the
training data and gives an accurate result.
OLAP
• OLAP stands for On-Line Analytic Processing.
• OLAP systems are targeted to provide more complex query
results than traditional OLTP or database systems.
• OLAP is performed on data warehouses or data marts. The
primary goal of OLAP is to support ad hoc querying needed to
support DSS.
• The multidimensional view of data is fundamental to OLAP
applications.
• OLAP tools can be classified as ROLAP or MOLAP.
• ROLAP- Relational OLAP
• MOLAP- Multidimensional OLAP
OLAP operations
OLAP operations
There are several types of OLAP operations supported by OLAP tools:
• A simple query may look at a single cell within the cube [Figure (a)] .
• Slice: Look at a subcube to get more specific information. This is performed
by selecting on one dimension. As seen in Figure (c), this is looking at a
portion of the cube.
• Dice: Look at a subcube by selecting on two or more dimensions. This can be
performed by a slice on one dimension and then rotating the cube to select
on a second dimension. In Figure (d)
• Roll up (dimension reduction, aggregation): Roll up allows the user to ask
questions that move up an aggregation hierarchy. Figure (b) represents a roll
up from (a).
• Drill down: Figure (a) represents a drill down from (b). These functions allow a
user to get more detailed fact information by navigating lower in the
aggregation hierarchy.
• Visualization: Visualization allows the OLAP users to actually "see" results of
an operation.
DBMS
• A database is a collection of data usually associated with some
organization or enterprise.
• Schema
– e.g. (ID,Name,Address,Salary,JobNo) may be the schema for a
personnel database.
• A database management system (DBMS) is the software used to access a
database.
• Data model is used to describe the data, attributes, and relationships
among them.
– ER Model.
DBMS
• Transaction
• Query:
SELECT Name
FROM T
WHERE Salary > 100000
• A major difference between data mining queries and those of database
systems is the output .
• Basic database queries always output either a subset of the database or
aggregates of the data. A data mining query outputs a KDD object.
Statistics
• Simple statistical concepts as determining a data distribution and calculating
a mean and a variance can be viewed as data mining techniques.
• Statistical inference: Generalizing a model created from a sample of the
data to the entire dataset.
• Exploratory Data Analysis:
– Data can actually drive the creation of the model
– Opposite of traditional statistical view.
• Statistics research has produced many of the proposed data mining
algorithms.
• The difference between the data mining and statistics is data mining is
targeted to business users not to the statistician.
Goals of Data Mining?
• Data mining is one of the most useful techniques that help
entrepreneurs, researchers, and individuals to extract valuable
information from huge sets of data.
• Data mining Store and manage the data in a multidimensional
database system.
• Data mining Provide data access to business analysts and
information technology professionals.
• Data mining Analyze the data by application software.
• Data mining Present the data in a useful format, such as a
graph or table.
Ad

More Related Content

Similar to Lecture2 (1).ppt (20)

Data warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniquesData warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniques
Vaibhav Khanna
 
M. FLORENCE DAYANA/DATABASE MANAGEMENT SYSYTEM
M. FLORENCE DAYANA/DATABASE MANAGEMENT SYSYTEMM. FLORENCE DAYANA/DATABASE MANAGEMENT SYSYTEM
M. FLORENCE DAYANA/DATABASE MANAGEMENT SYSYTEM
Dr.Florence Dayana
 
Data mining techniques unit 1
Data mining techniques  unit 1Data mining techniques  unit 1
Data mining techniques unit 1
malathieswaran29
 
Unit 1.A.Introduction to Knowledge Discovery Data Mining (1).pptx
Unit 1.A.Introduction to Knowledge Discovery  Data Mining  (1).pptxUnit 1.A.Introduction to Knowledge Discovery  Data Mining  (1).pptx
Unit 1.A.Introduction to Knowledge Discovery Data Mining (1).pptx
sayalee7
 
Data Science Fundamentals and Practices.pptx
Data Science Fundamentals and Practices.pptxData Science Fundamentals and Practices.pptx
Data Science Fundamentals and Practices.pptx
justjoking99yt
 
Machinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdfMachinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdf
SaketBansal9
 
Complete unit ii notes
Complete unit ii notesComplete unit ii notes
Complete unit ii notes
Benazir Fathima
 
Unit 3 part i Data mining
Unit 3 part i Data miningUnit 3 part i Data mining
Unit 3 part i Data mining
Dhilsath Fathima
 
Data mining slide for data mining process
Data mining slide for data mining processData mining slide for data mining process
Data mining slide for data mining process
NivaTripathy1
 
MS SQL SERVER: Introduction To Datamining Suing Sql Server
MS SQL SERVER: Introduction To Datamining Suing Sql ServerMS SQL SERVER: Introduction To Datamining Suing Sql Server
MS SQL SERVER: Introduction To Datamining Suing Sql Server
sqlserver content
 
MS Sql Server: Introduction To Datamining Suing Sql Server
MS Sql Server: Introduction To Datamining Suing Sql ServerMS Sql Server: Introduction To Datamining Suing Sql Server
MS Sql Server: Introduction To Datamining Suing Sql Server
DataminingTools Inc
 
Ch_2.pdf
Ch_2.pdfCh_2.pdf
Ch_2.pdf
DawitBirhanu13
 
Sanjeev Kumar Dash D ata Mining-2023.ppt
Sanjeev Kumar Dash D ata Mining-2023.pptSanjeev Kumar Dash D ata Mining-2023.ppt
Sanjeev Kumar Dash D ata Mining-2023.ppt
gobeli2850
 
Data Mining-2023 (2).ppt
Data Mining-2023 (2).pptData Mining-2023 (2).ppt
Data Mining-2023 (2).ppt
SATYAJITJENABTECH
 
Unit 1_data mining and warehousing subject
Unit 1_data mining and warehousing subjectUnit 1_data mining and warehousing subject
Unit 1_data mining and warehousing subject
tomjerryguest
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning
Gopal Sakarkar
 
data_preprocessingknnnaiveandothera.pptx
data_preprocessingknnnaiveandothera.pptxdata_preprocessingknnnaiveandothera.pptx
data_preprocessingknnnaiveandothera.pptx
nikhilguptha06
 
CIS60efrgrgrhdhbehthrthrthrhrtgfhgbrthrthtrhth08.pptx
CIS60efrgrgrhdhbehthrthrthrhrtgfhgbrthrthtrhth08.pptxCIS60efrgrgrhdhbehthrthrthrhrtgfhgbrthrthtrhth08.pptx
CIS60efrgrgrhdhbehthrthrthrhrtgfhgbrthrthtrhth08.pptx
spracademyielts3
 
DATA MINING BASIC INTRODUCTION OF ALL THE STAGES
DATA MINING BASIC INTRODUCTION OF ALL THE STAGESDATA MINING BASIC INTRODUCTION OF ALL THE STAGES
DATA MINING BASIC INTRODUCTION OF ALL THE STAGES
JignaJadav1
 
natural language processing ktu syllabus Module 2
natural language processing ktu syllabus  Module 2natural language processing ktu syllabus  Module 2
natural language processing ktu syllabus Module 2
AbhijithMWarrier1
 
Data warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniquesData warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniques
Vaibhav Khanna
 
M. FLORENCE DAYANA/DATABASE MANAGEMENT SYSYTEM
M. FLORENCE DAYANA/DATABASE MANAGEMENT SYSYTEMM. FLORENCE DAYANA/DATABASE MANAGEMENT SYSYTEM
M. FLORENCE DAYANA/DATABASE MANAGEMENT SYSYTEM
Dr.Florence Dayana
 
Data mining techniques unit 1
Data mining techniques  unit 1Data mining techniques  unit 1
Data mining techniques unit 1
malathieswaran29
 
Unit 1.A.Introduction to Knowledge Discovery Data Mining (1).pptx
Unit 1.A.Introduction to Knowledge Discovery  Data Mining  (1).pptxUnit 1.A.Introduction to Knowledge Discovery  Data Mining  (1).pptx
Unit 1.A.Introduction to Knowledge Discovery Data Mining (1).pptx
sayalee7
 
Data Science Fundamentals and Practices.pptx
Data Science Fundamentals and Practices.pptxData Science Fundamentals and Practices.pptx
Data Science Fundamentals and Practices.pptx
justjoking99yt
 
Machinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdfMachinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdf
SaketBansal9
 
Data mining slide for data mining process
Data mining slide for data mining processData mining slide for data mining process
Data mining slide for data mining process
NivaTripathy1
 
MS SQL SERVER: Introduction To Datamining Suing Sql Server
MS SQL SERVER: Introduction To Datamining Suing Sql ServerMS SQL SERVER: Introduction To Datamining Suing Sql Server
MS SQL SERVER: Introduction To Datamining Suing Sql Server
sqlserver content
 
MS Sql Server: Introduction To Datamining Suing Sql Server
MS Sql Server: Introduction To Datamining Suing Sql ServerMS Sql Server: Introduction To Datamining Suing Sql Server
MS Sql Server: Introduction To Datamining Suing Sql Server
DataminingTools Inc
 
Sanjeev Kumar Dash D ata Mining-2023.ppt
Sanjeev Kumar Dash D ata Mining-2023.pptSanjeev Kumar Dash D ata Mining-2023.ppt
Sanjeev Kumar Dash D ata Mining-2023.ppt
gobeli2850
 
Unit 1_data mining and warehousing subject
Unit 1_data mining and warehousing subjectUnit 1_data mining and warehousing subject
Unit 1_data mining and warehousing subject
tomjerryguest
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning
Gopal Sakarkar
 
data_preprocessingknnnaiveandothera.pptx
data_preprocessingknnnaiveandothera.pptxdata_preprocessingknnnaiveandothera.pptx
data_preprocessingknnnaiveandothera.pptx
nikhilguptha06
 
CIS60efrgrgrhdhbehthrthrthrhrtgfhgbrthrthtrhth08.pptx
CIS60efrgrgrhdhbehthrthrthrhrtgfhgbrthrthtrhth08.pptxCIS60efrgrgrhdhbehthrthrthrhrtgfhgbrthrthtrhth08.pptx
CIS60efrgrgrhdhbehthrthrthrhrtgfhgbrthrthtrhth08.pptx
spracademyielts3
 
DATA MINING BASIC INTRODUCTION OF ALL THE STAGES
DATA MINING BASIC INTRODUCTION OF ALL THE STAGESDATA MINING BASIC INTRODUCTION OF ALL THE STAGES
DATA MINING BASIC INTRODUCTION OF ALL THE STAGES
JignaJadav1
 
natural language processing ktu syllabus Module 2
natural language processing ktu syllabus  Module 2natural language processing ktu syllabus  Module 2
natural language processing ktu syllabus Module 2
AbhijithMWarrier1
 

More from Minakshee Patil (16)

Introduction to Computational Complexity Theory pptx
Introduction to Computational Complexity Theory pptxIntroduction to Computational Complexity Theory pptx
Introduction to Computational Complexity Theory pptx
Minakshee Patil
 
Different Searching and Sorting Methods.pptx
Different Searching and Sorting Methods.pptxDifferent Searching and Sorting Methods.pptx
Different Searching and Sorting Methods.pptx
Minakshee Patil
 
Analysis of Algorithms (1).pptx, asymptotic
Analysis of Algorithms (1).pptx, asymptoticAnalysis of Algorithms (1).pptx, asymptotic
Analysis of Algorithms (1).pptx, asymptotic
Minakshee Patil
 
0-1_knapsack_using_DP, types of knapsack
0-1_knapsack_using_DP, types of knapsack0-1_knapsack_using_DP, types of knapsack
0-1_knapsack_using_DP, types of knapsack
Minakshee Patil
 
Linear Data Structures, array, stack, queue
Linear Data Structures, array, stack, queueLinear Data Structures, array, stack, queue
Linear Data Structures, array, stack, queue
Minakshee Patil
 
Unit 5-BACKTRACKING- n queens, sum of subset, graph coloring problems
Unit 5-BACKTRACKING- n queens, sum of subset, graph coloring problemsUnit 5-BACKTRACKING- n queens, sum of subset, graph coloring problems
Unit 5-BACKTRACKING- n queens, sum of subset, graph coloring problems
Minakshee Patil
 
stack, opeartions on stack, applications of stack
stack, opeartions on stack, applications of stackstack, opeartions on stack, applications of stack
stack, opeartions on stack, applications of stack
Minakshee Patil
 
Algorithm Design Techiques, divide and conquer
Algorithm Design Techiques, divide and conquerAlgorithm Design Techiques, divide and conquer
Algorithm Design Techiques, divide and conquer
Minakshee Patil
 
Analysis of Algorithms, recurrence relation, solving recurrences
Analysis of Algorithms, recurrence relation, solving recurrencesAnalysis of Algorithms, recurrence relation, solving recurrences
Analysis of Algorithms, recurrence relation, solving recurrences
Minakshee Patil
 
Lecture2 (9).ppt
Lecture2 (9).pptLecture2 (9).ppt
Lecture2 (9).ppt
Minakshee Patil
 
oracle.pptx
oracle.pptxoracle.pptx
oracle.pptx
Minakshee Patil
 
Lecture1.ppt
Lecture1.pptLecture1.ppt
Lecture1.ppt
Minakshee Patil
 
Unit 1.ppt
Unit 1.pptUnit 1.ppt
Unit 1.ppt
Minakshee Patil
 
Hierarchical clustering algorithm.pptx
Hierarchical clustering algorithm.pptxHierarchical clustering algorithm.pptx
Hierarchical clustering algorithm.pptx
Minakshee Patil
 
Lecture3 (3).ppt
Lecture3 (3).pptLecture3 (3).ppt
Lecture3 (3).ppt
Minakshee Patil
 
Lecture4.ppt
Lecture4.pptLecture4.ppt
Lecture4.ppt
Minakshee Patil
 
Introduction to Computational Complexity Theory pptx
Introduction to Computational Complexity Theory pptxIntroduction to Computational Complexity Theory pptx
Introduction to Computational Complexity Theory pptx
Minakshee Patil
 
Different Searching and Sorting Methods.pptx
Different Searching and Sorting Methods.pptxDifferent Searching and Sorting Methods.pptx
Different Searching and Sorting Methods.pptx
Minakshee Patil
 
Analysis of Algorithms (1).pptx, asymptotic
Analysis of Algorithms (1).pptx, asymptoticAnalysis of Algorithms (1).pptx, asymptotic
Analysis of Algorithms (1).pptx, asymptotic
Minakshee Patil
 
0-1_knapsack_using_DP, types of knapsack
0-1_knapsack_using_DP, types of knapsack0-1_knapsack_using_DP, types of knapsack
0-1_knapsack_using_DP, types of knapsack
Minakshee Patil
 
Linear Data Structures, array, stack, queue
Linear Data Structures, array, stack, queueLinear Data Structures, array, stack, queue
Linear Data Structures, array, stack, queue
Minakshee Patil
 
Unit 5-BACKTRACKING- n queens, sum of subset, graph coloring problems
Unit 5-BACKTRACKING- n queens, sum of subset, graph coloring problemsUnit 5-BACKTRACKING- n queens, sum of subset, graph coloring problems
Unit 5-BACKTRACKING- n queens, sum of subset, graph coloring problems
Minakshee Patil
 
stack, opeartions on stack, applications of stack
stack, opeartions on stack, applications of stackstack, opeartions on stack, applications of stack
stack, opeartions on stack, applications of stack
Minakshee Patil
 
Algorithm Design Techiques, divide and conquer
Algorithm Design Techiques, divide and conquerAlgorithm Design Techiques, divide and conquer
Algorithm Design Techiques, divide and conquer
Minakshee Patil
 
Analysis of Algorithms, recurrence relation, solving recurrences
Analysis of Algorithms, recurrence relation, solving recurrencesAnalysis of Algorithms, recurrence relation, solving recurrences
Analysis of Algorithms, recurrence relation, solving recurrences
Minakshee Patil
 
Hierarchical clustering algorithm.pptx
Hierarchical clustering algorithm.pptxHierarchical clustering algorithm.pptx
Hierarchical clustering algorithm.pptx
Minakshee Patil
 
Ad

Recently uploaded (20)

Slide share PPT of SOx control technologies.pptx
Slide share PPT of SOx control technologies.pptxSlide share PPT of SOx control technologies.pptx
Slide share PPT of SOx control technologies.pptx
vvsasane
 
Control Methods of Noise Pollutions.pptx
Control Methods of Noise Pollutions.pptxControl Methods of Noise Pollutions.pptx
Control Methods of Noise Pollutions.pptx
vvsasane
 
DED KOMINFO detail engginering design gedung
DED KOMINFO detail engginering design gedungDED KOMINFO detail engginering design gedung
DED KOMINFO detail engginering design gedung
nabilarizqifadhilah1
 
Lecture - 7 Canals of the topic of the civil engineering
Lecture - 7  Canals of the topic of the civil engineeringLecture - 7  Canals of the topic of the civil engineering
Lecture - 7 Canals of the topic of the civil engineering
MJawadkhan1
 
Generative AI & Large Language Models Agents
Generative AI & Large Language Models AgentsGenerative AI & Large Language Models Agents
Generative AI & Large Language Models Agents
aasgharbee22seecs
 
Interfacing PMW3901 Optical Flow Sensor with ESP32
Interfacing PMW3901 Optical Flow Sensor with ESP32Interfacing PMW3901 Optical Flow Sensor with ESP32
Interfacing PMW3901 Optical Flow Sensor with ESP32
CircuitDigest
 
Agents chapter of Artificial intelligence
Agents chapter of Artificial intelligenceAgents chapter of Artificial intelligence
Agents chapter of Artificial intelligence
DebdeepMukherjee9
 
Dynamics of Structures with Uncertain Properties.pptx
Dynamics of Structures with Uncertain Properties.pptxDynamics of Structures with Uncertain Properties.pptx
Dynamics of Structures with Uncertain Properties.pptx
University of Glasgow
 
A Survey of Personalized Large Language Models.pptx
A Survey of Personalized Large Language Models.pptxA Survey of Personalized Large Language Models.pptx
A Survey of Personalized Large Language Models.pptx
rutujabhaskarraopati
 
SICPA: Fabien Keller - background introduction
SICPA: Fabien Keller - background introductionSICPA: Fabien Keller - background introduction
SICPA: Fabien Keller - background introduction
fabienklr
 
JRR Tolkien’s Lord of the Rings: Was It Influenced by Nordic Mythology, Homer...
JRR Tolkien’s Lord of the Rings: Was It Influenced by Nordic Mythology, Homer...JRR Tolkien’s Lord of the Rings: Was It Influenced by Nordic Mythology, Homer...
JRR Tolkien’s Lord of the Rings: Was It Influenced by Nordic Mythology, Homer...
Reflections on Morality, Philosophy, and History
 
Evonik Overview Visiomer Specialty Methacrylates.pdf
Evonik Overview Visiomer Specialty Methacrylates.pdfEvonik Overview Visiomer Specialty Methacrylates.pdf
Evonik Overview Visiomer Specialty Methacrylates.pdf
szhang13
 
Frontend Architecture Diagram/Guide For Frontend Engineers
Frontend Architecture Diagram/Guide For Frontend EngineersFrontend Architecture Diagram/Guide For Frontend Engineers
Frontend Architecture Diagram/Guide For Frontend Engineers
Michael Hertzberg
 
Applications of Centroid in Structural Engineering
Applications of Centroid in Structural EngineeringApplications of Centroid in Structural Engineering
Applications of Centroid in Structural Engineering
suvrojyotihalder2006
 
seninarppt.pptx1bhjiikjhggghjykoirgjuyhhhjj
seninarppt.pptx1bhjiikjhggghjykoirgjuyhhhjjseninarppt.pptx1bhjiikjhggghjykoirgjuyhhhjj
seninarppt.pptx1bhjiikjhggghjykoirgjuyhhhjj
AjijahamadKhaji
 
introduction technology technology tec.pptx
introduction technology technology tec.pptxintroduction technology technology tec.pptx
introduction technology technology tec.pptx
Iftikhar70
 
Surveying through global positioning system
Surveying through global positioning systemSurveying through global positioning system
Surveying through global positioning system
opneptune5
 
PRIZ Academy - Functional Modeling In Action with PRIZ.pdf
PRIZ Academy - Functional Modeling In Action with PRIZ.pdfPRIZ Academy - Functional Modeling In Action with PRIZ.pdf
PRIZ Academy - Functional Modeling In Action with PRIZ.pdf
PRIZ Guru
 
Jacob Murphy Australia - Excels In Optimizing Software Applications
Jacob Murphy Australia - Excels In Optimizing Software ApplicationsJacob Murphy Australia - Excels In Optimizing Software Applications
Jacob Murphy Australia - Excels In Optimizing Software Applications
Jacob Murphy Australia
 
Computer Security Fundamentals Chapter 1
Computer Security Fundamentals Chapter 1Computer Security Fundamentals Chapter 1
Computer Security Fundamentals Chapter 1
remoteaimms
 
Slide share PPT of SOx control technologies.pptx
Slide share PPT of SOx control technologies.pptxSlide share PPT of SOx control technologies.pptx
Slide share PPT of SOx control technologies.pptx
vvsasane
 
Control Methods of Noise Pollutions.pptx
Control Methods of Noise Pollutions.pptxControl Methods of Noise Pollutions.pptx
Control Methods of Noise Pollutions.pptx
vvsasane
 
DED KOMINFO detail engginering design gedung
DED KOMINFO detail engginering design gedungDED KOMINFO detail engginering design gedung
DED KOMINFO detail engginering design gedung
nabilarizqifadhilah1
 
Lecture - 7 Canals of the topic of the civil engineering
Lecture - 7  Canals of the topic of the civil engineeringLecture - 7  Canals of the topic of the civil engineering
Lecture - 7 Canals of the topic of the civil engineering
MJawadkhan1
 
Generative AI & Large Language Models Agents
Generative AI & Large Language Models AgentsGenerative AI & Large Language Models Agents
Generative AI & Large Language Models Agents
aasgharbee22seecs
 
Interfacing PMW3901 Optical Flow Sensor with ESP32
Interfacing PMW3901 Optical Flow Sensor with ESP32Interfacing PMW3901 Optical Flow Sensor with ESP32
Interfacing PMW3901 Optical Flow Sensor with ESP32
CircuitDigest
 
Agents chapter of Artificial intelligence
Agents chapter of Artificial intelligenceAgents chapter of Artificial intelligence
Agents chapter of Artificial intelligence
DebdeepMukherjee9
 
Dynamics of Structures with Uncertain Properties.pptx
Dynamics of Structures with Uncertain Properties.pptxDynamics of Structures with Uncertain Properties.pptx
Dynamics of Structures with Uncertain Properties.pptx
University of Glasgow
 
A Survey of Personalized Large Language Models.pptx
A Survey of Personalized Large Language Models.pptxA Survey of Personalized Large Language Models.pptx
A Survey of Personalized Large Language Models.pptx
rutujabhaskarraopati
 
SICPA: Fabien Keller - background introduction
SICPA: Fabien Keller - background introductionSICPA: Fabien Keller - background introduction
SICPA: Fabien Keller - background introduction
fabienklr
 
Evonik Overview Visiomer Specialty Methacrylates.pdf
Evonik Overview Visiomer Specialty Methacrylates.pdfEvonik Overview Visiomer Specialty Methacrylates.pdf
Evonik Overview Visiomer Specialty Methacrylates.pdf
szhang13
 
Frontend Architecture Diagram/Guide For Frontend Engineers
Frontend Architecture Diagram/Guide For Frontend EngineersFrontend Architecture Diagram/Guide For Frontend Engineers
Frontend Architecture Diagram/Guide For Frontend Engineers
Michael Hertzberg
 
Applications of Centroid in Structural Engineering
Applications of Centroid in Structural EngineeringApplications of Centroid in Structural Engineering
Applications of Centroid in Structural Engineering
suvrojyotihalder2006
 
seninarppt.pptx1bhjiikjhggghjykoirgjuyhhhjj
seninarppt.pptx1bhjiikjhggghjykoirgjuyhhhjjseninarppt.pptx1bhjiikjhggghjykoirgjuyhhhjj
seninarppt.pptx1bhjiikjhggghjykoirgjuyhhhjj
AjijahamadKhaji
 
introduction technology technology tec.pptx
introduction technology technology tec.pptxintroduction technology technology tec.pptx
introduction technology technology tec.pptx
Iftikhar70
 
Surveying through global positioning system
Surveying through global positioning systemSurveying through global positioning system
Surveying through global positioning system
opneptune5
 
PRIZ Academy - Functional Modeling In Action with PRIZ.pdf
PRIZ Academy - Functional Modeling In Action with PRIZ.pdfPRIZ Academy - Functional Modeling In Action with PRIZ.pdf
PRIZ Academy - Functional Modeling In Action with PRIZ.pdf
PRIZ Guru
 
Jacob Murphy Australia - Excels In Optimizing Software Applications
Jacob Murphy Australia - Excels In Optimizing Software ApplicationsJacob Murphy Australia - Excels In Optimizing Software Applications
Jacob Murphy Australia - Excels In Optimizing Software Applications
Jacob Murphy Australia
 
Computer Security Fundamentals Chapter 1
Computer Security Fundamentals Chapter 1Computer Security Fundamentals Chapter 1
Computer Security Fundamentals Chapter 1
remoteaimms
 
Ad

Lecture2 (1).ppt

  • 1. Introduction to Data Mining • What is Data Mining? • Related technologies • Data Mining techniques • Data Mining Goals • Stages of data mining process • Knowledge representation methods • Applications
  • 2. What is Data Mining? • The process of extracting information to identify patterns, trends, and useful data that would allow the business to take the data- driven decision from huge sets of data is called Data Mining. • Data mining is the act of automatically searching for large stores of information to find trends and patterns that go beyond simple analysis procedures. • Data Mining is a process used by organizations to extract specific data from huge databases to solve business problems. It primarily turns raw data into useful information. • Data mining utilizes complex mathematical algorithms for data segments and evaluates the probability of future events. Data Mining is also called Knowledge Discovery of Data (KDD).
  • 3. Related Technologies Data mining is related to many concepts. We briefly introduce each concept and indicate how it is related to data mining. • Machine Learning • DBMS • OLAP • Statistics
  • 4. Machine Learning • Machine learning is the area of AI that examines how to write programs that can learn. • In data mining, machine learning is often used for prediction or classification. • Applications that typically use machine learning techniques include speech recognition, training moving robots, classification of astronomical structures, and game playing. • When machine learning is applied to data mining tasks, a model is used to represent the data (such as a graphical structure like a neural network or a decision tree). • During the learning process, a sample of the database is used to train the system to properly perform the desired task. • Then the system is applied to the general database to actually perform the task.
  • 5. Machine Learning • Machine learning algorithms are divided into two types: 1. Unsupervised Learning 2. Supervised Learning 1. Unsupervised Machine Learning: Unsupervised learning does not depend on trained data sets to predict the results, but it utilizes direct techniques such as clustering and association in order to predict the results. 2. Supervised Machine Learning: Supervised learning is a learning process in which we teach or train the machine using data which is well leveled implies that some data is already marked with the correct responses. After that, the machine is provided with the new sets of data so that the supervised learning algorithm analyzes the training data and gives an accurate result.
  • 6. OLAP • OLAP stands for On-Line Analytic Processing. • OLAP systems are targeted to provide more complex query results than traditional OLTP or database systems. • OLAP is performed on data warehouses or data marts. The primary goal of OLAP is to support ad hoc querying needed to support DSS. • The multidimensional view of data is fundamental to OLAP applications. • OLAP tools can be classified as ROLAP or MOLAP. • ROLAP- Relational OLAP • MOLAP- Multidimensional OLAP
  • 8. OLAP operations There are several types of OLAP operations supported by OLAP tools: • A simple query may look at a single cell within the cube [Figure (a)] . • Slice: Look at a subcube to get more specific information. This is performed by selecting on one dimension. As seen in Figure (c), this is looking at a portion of the cube. • Dice: Look at a subcube by selecting on two or more dimensions. This can be performed by a slice on one dimension and then rotating the cube to select on a second dimension. In Figure (d) • Roll up (dimension reduction, aggregation): Roll up allows the user to ask questions that move up an aggregation hierarchy. Figure (b) represents a roll up from (a). • Drill down: Figure (a) represents a drill down from (b). These functions allow a user to get more detailed fact information by navigating lower in the aggregation hierarchy. • Visualization: Visualization allows the OLAP users to actually "see" results of an operation.
  • 9. DBMS • A database is a collection of data usually associated with some organization or enterprise. • Schema – e.g. (ID,Name,Address,Salary,JobNo) may be the schema for a personnel database. • A database management system (DBMS) is the software used to access a database. • Data model is used to describe the data, attributes, and relationships among them. – ER Model.
  • 10. DBMS • Transaction • Query: SELECT Name FROM T WHERE Salary > 100000 • A major difference between data mining queries and those of database systems is the output . • Basic database queries always output either a subset of the database or aggregates of the data. A data mining query outputs a KDD object.
  • 11. Statistics • Simple statistical concepts as determining a data distribution and calculating a mean and a variance can be viewed as data mining techniques. • Statistical inference: Generalizing a model created from a sample of the data to the entire dataset. • Exploratory Data Analysis: – Data can actually drive the creation of the model – Opposite of traditional statistical view. • Statistics research has produced many of the proposed data mining algorithms. • The difference between the data mining and statistics is data mining is targeted to business users not to the statistician.
  • 12. Goals of Data Mining? • Data mining is one of the most useful techniques that help entrepreneurs, researchers, and individuals to extract valuable information from huge sets of data. • Data mining Store and manage the data in a multidimensional database system. • Data mining Provide data access to business analysts and information technology professionals. • Data mining Analyze the data by application software. • Data mining Present the data in a useful format, such as a graph or table.
  翻译: