SlideShare a Scribd company logo
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Michael Brückner
Manager Machine Learning
25/02/2016
Machine Learning 101
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Agenda
• What is Machine Learning and why do we need it?
• Model Building
• Model Evaluation & Tuning
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
What is Machine Learning?
Methods and Systems that
…
Adapt
based on
recorded
data
Predict
new data
based on
recorded
data
Optimize
an action
given a
utility
function
Extract
hidden
structure
from the
data
Summarize
data into
concise
descriptions
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
What is Machine Learning NOT?
Methods and Systems that
…
can yield
Garbage-In
Knowledge-
Out
perform well
without
data modeling
& feature
engineering
avoid the
curse-of-
dimensionality
are a
replacement
for business
rules
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Infer-Predict-Decide Cycle
Inference
Build & evaluate
Predictor
Prediction
Apply the learned
Predictor
Decision Making
Adjust Business loss
and get new/more data
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
What for?
Automate tasks, which typically require humans in order to
• scale
• improve over humans (non-experts)
• preserve privacy
or solve tasks that are impossible for humans
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Examples: Personalized Recommandation
• Input:
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Examples: Personalized Recommandation
• Output:
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Examples: Face Detection & Recognition
Face detection
• Input: image
• Output: face position
Face recognition
• Input: face (image & face position)
• Output: person’s name
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Examples: Full-Text Translation
• Input: text in one language
• Output: text of another language
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Examples: Spam Filtering
• Input: email (text, images, …)
• Output: spam/non-spam flag
• Challenges:
• extremely high precision for
legitimate emails
• spam changes constantly
• noisy ground truth
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Supervised Machine Learning
1. Model problem in terms of input data and output data
2. Collect sample of input-output pairs
3. Learn a mapping that produces the output given the
input
4. Apply this function on new inputs to make predictions
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
A Programer’s Perspective
Traditional Programming (Predicting)
Supervised Machine Learning
Computer
Input Data
Mapping
Output Data
Computer
Input Data
Output Data
Mapping
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Advantages
• Use data instead of intuition to derive the mapping
• Can solve very complex tasks
• Can adapt to new situations (collect more data)
• Does not require much expert knowledge
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Input Data
Description Type Cost Actual Cost Diff In Catalogue
Movies Entertainment $50 $28 $22 Yes
Music (CDs, MP3s, etc.) $500 $30 $470 No
Sporting Events Entertainment $0 $40 ($40) No
Dining Out Food $1,000 $1,200 ($200) Yes
Groceries $100 $0 $100 Yes
Charity 1 Gifts and Charity $200 $200 $0 No
Charity 2 $500 $500 $0 No
Cable/Satellite Housing $100 $100 $0 Yes
Electric Housing $45 $40 $5 Yes
Mortgage or Rent $700 $700 $0 Yes
Health Insurance $400 $400 $0 Yes
Home Insurance $400 $400 $0 No
Credit Card 1 $0 Yes
Dataset
Categorical Data
Missing Data
Binary Data
Numerical Data
Attribute Name
Attribute Value
Attribute
Text Data
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Description Type Cost Actual Cost Diff In Catalogue
Movies Entertainment $50 $28 $22 Yes
Music (CDs, MP3s, etc.) ? $500 $30 $470 No
Sporting Events Entertainment $0 $40 ($40) No
Dining Out Food $1,000 $1,200 ($200) Yes
Groceries ? $100 $0 $100 Yes
Charity 1 Gifts and Charity $200 $200 $0 No
Charity 2 ? $500 $500 $0 No
Cable/Satellite Housing $100 $100 $0 Yes
Electric Housing $45 $40 $5 Yes
Mortgage or Rent ? $700 $700 $0 Yes
Health Insurance $400 $400 $0 Yes
Home Insurance $400 $400 $0 No
Credit Card 1 ? $0 Yes
Output Data
Target Attribute Values
Target Attribute
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Agenda
• What is Machine Learning and why do we need it?
• Model Building
• Model Evaluation & Tuning
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Problem Setting
• Input: vector of observable attributes, x
• Output: target attribute value, y
• Training data: pairs of input and corresponding output,
D = (x1,y1),…,(xN,yN)
• Application data: inputs only
• Goal: learn mapping fw:x ↦ y
Predictor
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Challenges in Model Building
• Which function class for Predictor (data modeling)?
• How to pre-process the data (feature engineering)?
• How to learn this Predictor from our training data?
• How to generalize to new data?
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Which function class for Predictor?
Types of prediction tasks (output type):
• Binary Classification ⇒ binary target y  {–1, +1}
• Multinomial Classification ⇒ categorical target y  {1… K}
• Regression ⇒ numeric target y  [l,u]  R
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Which function class for Binary Classification?
• Decision Tree
+
+-
-
-
x2 > 7?
no yes
+
+
+
+
+
x1 < 3?
no yes
x2 < 5?
no yes
x1 < 1?
no yes
+
+
-
-
x2
x1
1 3
5
7
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Which function class for Binary Classification?
• Decision Tree
+-
x2 > 7?
no yes
+
x1 < 3?
no yes
x2 < 5?
no yes
x1 < 1?
no yes
+ -
x2
x1
+
-
-
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Which function class for Binary Classification?
• Linear function
• binary target attribute
values y  {–1, +1}
x2
x1
Hw +
-
y(x) = sign( fw
(x))
Hw
={x| fw
(x) = xT
w+w0
= 0}
^
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Which function class for Binary Classification?
• Generalized linear function
(Kernel methods)
• Layered Generalized linear
function (Neural Networks)
• Ensemble of functions
• …
x2
x1
+
- +
-
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How to pre-process the data?
• Predictor’s function class defined for limited input domain
⇒ transform/extract attributes first (pre-processing)
• Number to (normalized) Number:
• z-standardization, min-max normalization
• Number to Category:
• Binning (quantile, equidistant)
• Category to (numeric) Vector:
• One-hot encoding
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How to pre-process the data?
• Predictor’s function class defined for limited input domain
⇒ transform/extract attributes first (pre-processing)
• Text to (numeric) Vector:
• Normalization, tokenization, stemming
• Bag-of-Words, Bag-of-NGrams, TI-IDF ⇒ sparse vector
• Latent word embedding (LSI, word2vec, LDA) ⇒ dense vector
• Image to (numeric) Vector:
• HoG, DAISY, color histogram
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How to learn a Predictor?
• Loss of Predictor fw:x ↦ y for a given input-output pair:
Loss function PredictionGround Truth
L(y, fw
(x))
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How to learn a Predictor?
Loss functions for binary classification (target ):y Î{-1,+1}
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How to learn a Predictor?
Function Class Loss Function Learning Algorithm
Decision Trees 0/1 loss ID3
Decision Trees Quadratic loss CART
Linear function Quadratic loss Least-squares regression
Linear function Logistic loss Logistic regression
Linear function Hinge loss Support Vector Machines
Layered Generalized
Linear function
Logistic loss Neural Networks
(Binary Classification)
Layered Generalized
Linear function
Quadratic loss Neural Networks
(Regression)
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How to learn a Predictor?
• Theoretical Risk:
• Empirical Risk:
Average over all possible data
Average over training data
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How to learn a Predictor?
• Prediction depends on Predictor with model
parameters w
• Minimize Risk w.r.t. those model parameters w
⇒ mathematical Optimisation Problem
• Gradient-based first or second-order methods
• Coordinate-descent methods
• (Greedy) Search
y(x)^ fw
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How to generalize to new data?
Error
Model Complexity
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How to generalize to new data?
• Empirical Risk:
• Structural Risk: Regularizer
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Agenda
• What is Machine Learning and why do we need it?
• Model Building
• Model Evaluation & Tuning
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Performance for Binary Classification
Total number of
data points (N)
True Target
positive negative
Predicted
Target
positive
True
Positive
False
Positive
negative
False
Negative
True
Negative
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Performance for Binary Classification
• Accuracy:
• Recall (true positive rate):
• Precision:
• Fall-out (false positive rate):
TP+TN
N
TP
TP+ FN
TP
TP+ FP
FP
TN + FP
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Performance for Binary Classification
Decision function
AUC
(Area Under roc Curve)
y(x) = sign( fw
(x)+b)^
Predictor Decision threshold
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Training vs. Test Performance
How do we know that a Predictor works well on new data?
Small error on training
data ≠ small error on
new data (test data)!
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Hold-out Evaluation
• Put some data aside before training = test data
• Use this hold-out data for evaluation
• Disadvantages:
• What if we were (un)lucky when choosing the hold-out data?
• We do NOT use all the data for model training!
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
K-Fold Cross Validation-based Evaluation
• Split data into K partitions (folds)
• Take all but one partition to train a Predictor
• Evaluate Predictor on the left-out partition
• Repeat this for all partitions
• Average performance for all K evaluations
• Finally train a Predictor on all data
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Model Tuning
Learning methods and Predictors have hyper-parameters
• Amount of regularization
• Choice of loss function
• Decision threshold score
• Learning rate
• …
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Example: Decision threshold
Decision threshold
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How to choose hyper-parameters?
Grid Search:
• Evaluate Predictor for all grid points (hyper-parameter
combinations)
• Take best grid point
Very expensive!


2
10 0
10 2
10
1
2
0
2
1
2
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How to choose hyper-parameters?
Bayesian Optimisation:
• Learn model to predict evaluation outcomes
• Evaluate Predictor only for promising grid points
• Take best grid point
after fixed number of
evaluations


2
10 0
10 2
10
1
2
0
2
1
2
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Common Pitfalls
• Model tuning is part of training
⇒ Do NOT use test data or test CV partitions!
• Use proper grid resolution and axis scaling
• Use same metric for tuning as for evaluation
Thank you!
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Ad

More Related Content

What's hot (15)

Presentazione tutorial
Presentazione tutorialPresentazione tutorial
Presentazione tutorial
dariospin93
 
AI/ML Powered Personalized Recommendations in Gaming Industry
AI/ML PoweredPersonalized Recommendations in Gaming IndustryAI/ML PoweredPersonalized Recommendations in Gaming Industry
AI/ML Powered Personalized Recommendations in Gaming Industry
Hasan Basri AKIRMAK, MSc,ExecMBA
 
Azure Machine Learning
Azure Machine LearningAzure Machine Learning
Azure Machine Learning
Mostafa
 
Azure Machine Learning Intro
Azure Machine Learning IntroAzure Machine Learning Intro
Azure Machine Learning Intro
Damir Dobric
 
Guiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning PipelineGuiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning Pipeline
Michael Gerke
 
Azure Machine Learning 101
Azure Machine Learning 101Azure Machine Learning 101
Azure Machine Learning 101
Andrew Badera
 
Borys Rybak “Azure Machine Learning Studio & Azure Workbench & R + Python”
Borys Rybak “Azure Machine Learning Studio & Azure Workbench & R + Python” Borys Rybak “Azure Machine Learning Studio & Azure Workbench & R + Python”
Borys Rybak “Azure Machine Learning Studio & Azure Workbench & R + Python”
Lviv Startup Club
 
Real-time Recommendations for Retail: Architecture, Algorithms, and Design
Real-time Recommendations for Retail: Architecture, Algorithms, and DesignReal-time Recommendations for Retail: Architecture, Algorithms, and Design
Real-time Recommendations for Retail: Architecture, Algorithms, and Design
Juliet Hougland
 
Square's Machine Learning Infrastructure and Applications - Rong Yan
Square's Machine Learning Infrastructure and Applications - Rong YanSquare's Machine Learning Infrastructure and Applications - Rong Yan
Square's Machine Learning Infrastructure and Applications - Rong Yan
Hakka Labs
 
How Artificial Intelligence & Machine Learning Are Transforming Modern Marketing
How Artificial Intelligence & Machine Learning Are Transforming Modern MarketingHow Artificial Intelligence & Machine Learning Are Transforming Modern Marketing
How Artificial Intelligence & Machine Learning Are Transforming Modern Marketing
CleverTap
 
The Power of Auto ML and How Does it Work
The Power of Auto ML and How Does it WorkThe Power of Auto ML and How Does it Work
The Power of Auto ML and How Does it Work
Ivo Andreev
 
How Artificial Intelligence & Machine Learning Are Transforming Modern Market...
How Artificial Intelligence & Machine Learning Are Transforming Modern Market...How Artificial Intelligence & Machine Learning Are Transforming Modern Market...
How Artificial Intelligence & Machine Learning Are Transforming Modern Market...
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
Artificial Intelligence at LinkedIn
Artificial Intelligence at LinkedInArtificial Intelligence at LinkedIn
Artificial Intelligence at LinkedIn
Bill Liu
 
Azure Machine Learning and ML on Premises
Azure Machine Learning and ML on PremisesAzure Machine Learning and ML on Premises
Azure Machine Learning and ML on Premises
Ivo Andreev
 
Introduction to Azure Machine Learning
Introduction to Azure Machine LearningIntroduction to Azure Machine Learning
Introduction to Azure Machine Learning
Paul Prae
 
Presentazione tutorial
Presentazione tutorialPresentazione tutorial
Presentazione tutorial
dariospin93
 
AI/ML Powered Personalized Recommendations in Gaming Industry
AI/ML PoweredPersonalized Recommendations in Gaming IndustryAI/ML PoweredPersonalized Recommendations in Gaming Industry
AI/ML Powered Personalized Recommendations in Gaming Industry
Hasan Basri AKIRMAK, MSc,ExecMBA
 
Azure Machine Learning
Azure Machine LearningAzure Machine Learning
Azure Machine Learning
Mostafa
 
Azure Machine Learning Intro
Azure Machine Learning IntroAzure Machine Learning Intro
Azure Machine Learning Intro
Damir Dobric
 
Guiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning PipelineGuiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning Pipeline
Michael Gerke
 
Azure Machine Learning 101
Azure Machine Learning 101Azure Machine Learning 101
Azure Machine Learning 101
Andrew Badera
 
Borys Rybak “Azure Machine Learning Studio & Azure Workbench & R + Python”
Borys Rybak “Azure Machine Learning Studio & Azure Workbench & R + Python” Borys Rybak “Azure Machine Learning Studio & Azure Workbench & R + Python”
Borys Rybak “Azure Machine Learning Studio & Azure Workbench & R + Python”
Lviv Startup Club
 
Real-time Recommendations for Retail: Architecture, Algorithms, and Design
Real-time Recommendations for Retail: Architecture, Algorithms, and DesignReal-time Recommendations for Retail: Architecture, Algorithms, and Design
Real-time Recommendations for Retail: Architecture, Algorithms, and Design
Juliet Hougland
 
Square's Machine Learning Infrastructure and Applications - Rong Yan
Square's Machine Learning Infrastructure and Applications - Rong YanSquare's Machine Learning Infrastructure and Applications - Rong Yan
Square's Machine Learning Infrastructure and Applications - Rong Yan
Hakka Labs
 
How Artificial Intelligence & Machine Learning Are Transforming Modern Marketing
How Artificial Intelligence & Machine Learning Are Transforming Modern MarketingHow Artificial Intelligence & Machine Learning Are Transforming Modern Marketing
How Artificial Intelligence & Machine Learning Are Transforming Modern Marketing
CleverTap
 
The Power of Auto ML and How Does it Work
The Power of Auto ML and How Does it WorkThe Power of Auto ML and How Does it Work
The Power of Auto ML and How Does it Work
Ivo Andreev
 
Artificial Intelligence at LinkedIn
Artificial Intelligence at LinkedInArtificial Intelligence at LinkedIn
Artificial Intelligence at LinkedIn
Bill Liu
 
Azure Machine Learning and ML on Premises
Azure Machine Learning and ML on PremisesAzure Machine Learning and ML on Premises
Azure Machine Learning and ML on Premises
Ivo Andreev
 
Introduction to Azure Machine Learning
Introduction to Azure Machine LearningIntroduction to Azure Machine Learning
Introduction to Azure Machine Learning
Paul Prae
 

Similar to Machine Learning 101 - AWS Machine Learning Web Day (18)

1시간만에 머신러닝 개념 따라 잡기
1시간만에 머신러닝 개념 따라 잡기1시간만에 머신러닝 개념 따라 잡기
1시간만에 머신러닝 개념 따라 잡기
Sungmin Kim
 
Building Applications with Apache MXNet
Building Applications with Apache MXNetBuilding Applications with Apache MXNet
Building Applications with Apache MXNet
Apache MXNet
 
Machine Learning Fundamentals
Machine Learning FundamentalsMachine Learning Fundamentals
Machine Learning Fundamentals
SigOpt
 
An introduction to Machine Learning with scikit-learn (October 2018)
An introduction to Machine Learning with scikit-learn (October 2018)An introduction to Machine Learning with scikit-learn (October 2018)
An introduction to Machine Learning with scikit-learn (October 2018)
Julien SIMON
 
Build Your Recommendation Engine on AWS Today!
Build Your Recommendation Engine on AWS Today!Build Your Recommendation Engine on AWS Today!
Build Your Recommendation Engine on AWS Today!
AWS Germany
 
Deep Learning Fundamentals
Deep Learning FundamentalsDeep Learning Fundamentals
Deep Learning Fundamentals
Thomas Delteil
 
Chaos Engineering: Why Breaking Things Should Be Practised.
Chaos Engineering: Why Breaking Things Should Be Practised.Chaos Engineering: Why Breaking Things Should Be Practised.
Chaos Engineering: Why Breaking Things Should Be Practised.
Adrian Hornsby
 
Ml intro
Ml introMl intro
Ml intro
Si Krishan
 
Ml intro
Ml introMl intro
Ml intro
Si Krishan
 
PREDICT THE FUTURE , MACHINE LEARNING & BIG DATA
PREDICT THE FUTURE , MACHINE LEARNING & BIG DATAPREDICT THE FUTURE , MACHINE LEARNING & BIG DATA
PREDICT THE FUTURE , MACHINE LEARNING & BIG DATA
DotNetCampus
 
Net campus2015 antimomusone
Net campus2015 antimomusoneNet campus2015 antimomusone
Net campus2015 antimomusone
DotNetCampus
 
Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018
Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018
Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018
Yotam Yarden
 
Machine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An IntroMachine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An Intro
Si Krishan
 
Deep Learning with MXNet
Deep Learning with MXNetDeep Learning with MXNet
Deep Learning with MXNet
Cyrus Moazami-Vahid
 
Data Science for Energy Efficiency (Dmytro Mindra Technology Stream)
Data Science for Energy Efficiency (Dmytro Mindra Technology Stream)Data Science for Energy Efficiency (Dmytro Mindra Technology Stream)
Data Science for Energy Efficiency (Dmytro Mindra Technology Stream)
IT Arena
 
Aws cloud computing conference
Aws cloud computing conferenceAws cloud computing conference
Aws cloud computing conference
Anjani Phuyal
 
Amazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithmsAmazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithms
MLconf
 
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
DataWorks Summit
 
1시간만에 머신러닝 개념 따라 잡기
1시간만에 머신러닝 개념 따라 잡기1시간만에 머신러닝 개념 따라 잡기
1시간만에 머신러닝 개념 따라 잡기
Sungmin Kim
 
Building Applications with Apache MXNet
Building Applications with Apache MXNetBuilding Applications with Apache MXNet
Building Applications with Apache MXNet
Apache MXNet
 
Machine Learning Fundamentals
Machine Learning FundamentalsMachine Learning Fundamentals
Machine Learning Fundamentals
SigOpt
 
An introduction to Machine Learning with scikit-learn (October 2018)
An introduction to Machine Learning with scikit-learn (October 2018)An introduction to Machine Learning with scikit-learn (October 2018)
An introduction to Machine Learning with scikit-learn (October 2018)
Julien SIMON
 
Build Your Recommendation Engine on AWS Today!
Build Your Recommendation Engine on AWS Today!Build Your Recommendation Engine on AWS Today!
Build Your Recommendation Engine on AWS Today!
AWS Germany
 
Deep Learning Fundamentals
Deep Learning FundamentalsDeep Learning Fundamentals
Deep Learning Fundamentals
Thomas Delteil
 
Chaos Engineering: Why Breaking Things Should Be Practised.
Chaos Engineering: Why Breaking Things Should Be Practised.Chaos Engineering: Why Breaking Things Should Be Practised.
Chaos Engineering: Why Breaking Things Should Be Practised.
Adrian Hornsby
 
PREDICT THE FUTURE , MACHINE LEARNING & BIG DATA
PREDICT THE FUTURE , MACHINE LEARNING & BIG DATAPREDICT THE FUTURE , MACHINE LEARNING & BIG DATA
PREDICT THE FUTURE , MACHINE LEARNING & BIG DATA
DotNetCampus
 
Net campus2015 antimomusone
Net campus2015 antimomusoneNet campus2015 antimomusone
Net campus2015 antimomusone
DotNetCampus
 
Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018
Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018
Build Your Recommendation Engine on AWS Today - AWS Summit Berlin 2018
Yotam Yarden
 
Machine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An IntroMachine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An Intro
Si Krishan
 
Data Science for Energy Efficiency (Dmytro Mindra Technology Stream)
Data Science for Energy Efficiency (Dmytro Mindra Technology Stream)Data Science for Energy Efficiency (Dmytro Mindra Technology Stream)
Data Science for Energy Efficiency (Dmytro Mindra Technology Stream)
IT Arena
 
Aws cloud computing conference
Aws cloud computing conferenceAws cloud computing conference
Aws cloud computing conference
Anjani Phuyal
 
Amazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithmsAmazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithms
MLconf
 
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
DataWorks Summit
 
Ad

More from AWS Germany (20)

Analytics Web Day | From Theory to Practice: Big Data Stories from the Field
Analytics Web Day | From Theory to Practice: Big Data Stories from the FieldAnalytics Web Day | From Theory to Practice: Big Data Stories from the Field
Analytics Web Day | From Theory to Practice: Big Data Stories from the Field
AWS Germany
 
Analytics Web Day | Query your Data in S3 with SQL and optimize for Cost and ...
Analytics Web Day | Query your Data in S3 with SQL and optimize for Cost and ...Analytics Web Day | Query your Data in S3 with SQL and optimize for Cost and ...
Analytics Web Day | Query your Data in S3 with SQL and optimize for Cost and ...
AWS Germany
 
Modern Applications Web Day | Impress Your Friends with Your First Serverless...
Modern Applications Web Day | Impress Your Friends with Your First Serverless...Modern Applications Web Day | Impress Your Friends with Your First Serverless...
Modern Applications Web Day | Impress Your Friends with Your First Serverless...
AWS Germany
 
Modern Applications Web Day | Manage Your Infrastructure and Configuration on...
Modern Applications Web Day | Manage Your Infrastructure and Configuration on...Modern Applications Web Day | Manage Your Infrastructure and Configuration on...
Modern Applications Web Day | Manage Your Infrastructure and Configuration on...
AWS Germany
 
Modern Applications Web Day | Container Workloads on AWS
Modern Applications Web Day | Container Workloads on AWSModern Applications Web Day | Container Workloads on AWS
Modern Applications Web Day | Container Workloads on AWS
AWS Germany
 
Modern Applications Web Day | Continuous Delivery to Amazon EKS with Spinnaker
Modern Applications Web Day | Continuous Delivery to Amazon EKS with SpinnakerModern Applications Web Day | Continuous Delivery to Amazon EKS with Spinnaker
Modern Applications Web Day | Continuous Delivery to Amazon EKS with Spinnaker
AWS Germany
 
Building Smart Home skills for Alexa
Building Smart Home skills for AlexaBuilding Smart Home skills for Alexa
Building Smart Home skills for Alexa
AWS Germany
 
Hotel or Taxi? "Sorting hat" for travel expenses with AWS ML infrastructure
Hotel or Taxi? "Sorting hat" for travel expenses with AWS ML infrastructureHotel or Taxi? "Sorting hat" for travel expenses with AWS ML infrastructure
Hotel or Taxi? "Sorting hat" for travel expenses with AWS ML infrastructure
AWS Germany
 
Wild Rydes with Big Data/Kinesis focus: AWS Serverless Workshop
Wild Rydes with Big Data/Kinesis focus: AWS Serverless WorkshopWild Rydes with Big Data/Kinesis focus: AWS Serverless Workshop
Wild Rydes with Big Data/Kinesis focus: AWS Serverless Workshop
AWS Germany
 
Log Analytics with AWS
Log Analytics with AWSLog Analytics with AWS
Log Analytics with AWS
AWS Germany
 
Deep Dive into Concepts and Tools for Analyzing Streaming Data on AWS
Deep Dive into Concepts and Tools for Analyzing Streaming Data on AWS Deep Dive into Concepts and Tools for Analyzing Streaming Data on AWS
Deep Dive into Concepts and Tools for Analyzing Streaming Data on AWS
AWS Germany
 
AWS Programme für Nonprofits
AWS Programme für NonprofitsAWS Programme für Nonprofits
AWS Programme für Nonprofits
AWS Germany
 
Microservices and Data Design
Microservices and Data DesignMicroservices and Data Design
Microservices and Data Design
AWS Germany
 
Serverless vs. Developers – the real crash
Serverless vs. Developers – the real crashServerless vs. Developers – the real crash
Serverless vs. Developers – the real crash
AWS Germany
 
Query your data in S3 with SQL and optimize for cost and performance
Query your data in S3 with SQL and optimize for cost and performanceQuery your data in S3 with SQL and optimize for cost and performance
Query your data in S3 with SQL and optimize for cost and performance
AWS Germany
 
Secret Management with Hashicorp’s Vault
Secret Management with Hashicorp’s VaultSecret Management with Hashicorp’s Vault
Secret Management with Hashicorp’s Vault
AWS Germany
 
EKS Workshop
 EKS Workshop EKS Workshop
EKS Workshop
AWS Germany
 
Scale to Infinity with ECS
Scale to Infinity with ECSScale to Infinity with ECS
Scale to Infinity with ECS
AWS Germany
 
Containers on AWS - State of the Union
Containers on AWS - State of the UnionContainers on AWS - State of the Union
Containers on AWS - State of the Union
AWS Germany
 
Deploying and Scaling Your First Cloud Application with Amazon Lightsail
Deploying and Scaling Your First Cloud Application with Amazon LightsailDeploying and Scaling Your First Cloud Application with Amazon Lightsail
Deploying and Scaling Your First Cloud Application with Amazon Lightsail
AWS Germany
 
Analytics Web Day | From Theory to Practice: Big Data Stories from the Field
Analytics Web Day | From Theory to Practice: Big Data Stories from the FieldAnalytics Web Day | From Theory to Practice: Big Data Stories from the Field
Analytics Web Day | From Theory to Practice: Big Data Stories from the Field
AWS Germany
 
Analytics Web Day | Query your Data in S3 with SQL and optimize for Cost and ...
Analytics Web Day | Query your Data in S3 with SQL and optimize for Cost and ...Analytics Web Day | Query your Data in S3 with SQL and optimize for Cost and ...
Analytics Web Day | Query your Data in S3 with SQL and optimize for Cost and ...
AWS Germany
 
Modern Applications Web Day | Impress Your Friends with Your First Serverless...
Modern Applications Web Day | Impress Your Friends with Your First Serverless...Modern Applications Web Day | Impress Your Friends with Your First Serverless...
Modern Applications Web Day | Impress Your Friends with Your First Serverless...
AWS Germany
 
Modern Applications Web Day | Manage Your Infrastructure and Configuration on...
Modern Applications Web Day | Manage Your Infrastructure and Configuration on...Modern Applications Web Day | Manage Your Infrastructure and Configuration on...
Modern Applications Web Day | Manage Your Infrastructure and Configuration on...
AWS Germany
 
Modern Applications Web Day | Container Workloads on AWS
Modern Applications Web Day | Container Workloads on AWSModern Applications Web Day | Container Workloads on AWS
Modern Applications Web Day | Container Workloads on AWS
AWS Germany
 
Modern Applications Web Day | Continuous Delivery to Amazon EKS with Spinnaker
Modern Applications Web Day | Continuous Delivery to Amazon EKS with SpinnakerModern Applications Web Day | Continuous Delivery to Amazon EKS with Spinnaker
Modern Applications Web Day | Continuous Delivery to Amazon EKS with Spinnaker
AWS Germany
 
Building Smart Home skills for Alexa
Building Smart Home skills for AlexaBuilding Smart Home skills for Alexa
Building Smart Home skills for Alexa
AWS Germany
 
Hotel or Taxi? "Sorting hat" for travel expenses with AWS ML infrastructure
Hotel or Taxi? "Sorting hat" for travel expenses with AWS ML infrastructureHotel or Taxi? "Sorting hat" for travel expenses with AWS ML infrastructure
Hotel or Taxi? "Sorting hat" for travel expenses with AWS ML infrastructure
AWS Germany
 
Wild Rydes with Big Data/Kinesis focus: AWS Serverless Workshop
Wild Rydes with Big Data/Kinesis focus: AWS Serverless WorkshopWild Rydes with Big Data/Kinesis focus: AWS Serverless Workshop
Wild Rydes with Big Data/Kinesis focus: AWS Serverless Workshop
AWS Germany
 
Log Analytics with AWS
Log Analytics with AWSLog Analytics with AWS
Log Analytics with AWS
AWS Germany
 
Deep Dive into Concepts and Tools for Analyzing Streaming Data on AWS
Deep Dive into Concepts and Tools for Analyzing Streaming Data on AWS Deep Dive into Concepts and Tools for Analyzing Streaming Data on AWS
Deep Dive into Concepts and Tools for Analyzing Streaming Data on AWS
AWS Germany
 
AWS Programme für Nonprofits
AWS Programme für NonprofitsAWS Programme für Nonprofits
AWS Programme für Nonprofits
AWS Germany
 
Microservices and Data Design
Microservices and Data DesignMicroservices and Data Design
Microservices and Data Design
AWS Germany
 
Serverless vs. Developers – the real crash
Serverless vs. Developers – the real crashServerless vs. Developers – the real crash
Serverless vs. Developers – the real crash
AWS Germany
 
Query your data in S3 with SQL and optimize for cost and performance
Query your data in S3 with SQL and optimize for cost and performanceQuery your data in S3 with SQL and optimize for cost and performance
Query your data in S3 with SQL and optimize for cost and performance
AWS Germany
 
Secret Management with Hashicorp’s Vault
Secret Management with Hashicorp’s VaultSecret Management with Hashicorp’s Vault
Secret Management with Hashicorp’s Vault
AWS Germany
 
Scale to Infinity with ECS
Scale to Infinity with ECSScale to Infinity with ECS
Scale to Infinity with ECS
AWS Germany
 
Containers on AWS - State of the Union
Containers on AWS - State of the UnionContainers on AWS - State of the Union
Containers on AWS - State of the Union
AWS Germany
 
Deploying and Scaling Your First Cloud Application with Amazon Lightsail
Deploying and Scaling Your First Cloud Application with Amazon LightsailDeploying and Scaling Your First Cloud Application with Amazon Lightsail
Deploying and Scaling Your First Cloud Application with Amazon Lightsail
AWS Germany
 
Ad

Recently uploaded (20)

Limecraft Webinar - 2025.3 release, featuring Content Delivery, Graphic Conte...
Limecraft Webinar - 2025.3 release, featuring Content Delivery, Graphic Conte...Limecraft Webinar - 2025.3 release, featuring Content Delivery, Graphic Conte...
Limecraft Webinar - 2025.3 release, featuring Content Delivery, Graphic Conte...
Maarten Verwaest
 
Agentic Automation - Delhi UiPath Community Meetup
Agentic Automation - Delhi UiPath Community MeetupAgentic Automation - Delhi UiPath Community Meetup
Agentic Automation - Delhi UiPath Community Meetup
Manoj Batra (1600 + Connections)
 
Who's choice? Making decisions with and about Artificial Intelligence, Keele ...
Who's choice? Making decisions with and about Artificial Intelligence, Keele ...Who's choice? Making decisions with and about Artificial Intelligence, Keele ...
Who's choice? Making decisions with and about Artificial Intelligence, Keele ...
Alan Dix
 
Artificial_Intelligence_in_Everyday_Life.pptx
Artificial_Intelligence_in_Everyday_Life.pptxArtificial_Intelligence_in_Everyday_Life.pptx
Artificial_Intelligence_in_Everyday_Life.pptx
03ANMOLCHAURASIYA
 
RTP Over QUIC: An Interesting Opportunity Or Wasted Time?
RTP Over QUIC: An Interesting Opportunity Or Wasted Time?RTP Over QUIC: An Interesting Opportunity Or Wasted Time?
RTP Over QUIC: An Interesting Opportunity Or Wasted Time?
Lorenzo Miniero
 
Harmonizing Multi-Agent Intelligence | Open Data Science Conference | Gary Ar...
Harmonizing Multi-Agent Intelligence | Open Data Science Conference | Gary Ar...Harmonizing Multi-Agent Intelligence | Open Data Science Conference | Gary Ar...
Harmonizing Multi-Agent Intelligence | Open Data Science Conference | Gary Ar...
Gary Arora
 
IT488 Wireless Sensor Networks_Information Technology
IT488 Wireless Sensor Networks_Information TechnologyIT488 Wireless Sensor Networks_Information Technology
IT488 Wireless Sensor Networks_Information Technology
SHEHABALYAMANI
 
React Native for Business Solutions: Building Scalable Apps for Success
React Native for Business Solutions: Building Scalable Apps for SuccessReact Native for Business Solutions: Building Scalable Apps for Success
React Native for Business Solutions: Building Scalable Apps for Success
Amelia Swank
 
Distributionally Robust Statistical Verification with Imprecise Neural Networks
Distributionally Robust Statistical Verification with Imprecise Neural NetworksDistributionally Robust Statistical Verification with Imprecise Neural Networks
Distributionally Robust Statistical Verification with Imprecise Neural Networks
Ivan Ruchkin
 
AI-proof your career by Olivier Vroom and David WIlliamson
AI-proof your career by Olivier Vroom and David WIlliamsonAI-proof your career by Olivier Vroom and David WIlliamson
AI-proof your career by Olivier Vroom and David WIlliamson
UXPA Boston
 
MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...
MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...
MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...
ICT Frame Magazine Pvt. Ltd.
 
May Patch Tuesday
May Patch TuesdayMay Patch Tuesday
May Patch Tuesday
Ivanti
 
論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...
論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...
論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...
Toru Tamaki
 
Shoehorning dependency injection into a FP language, what does it take?
Shoehorning dependency injection into a FP language, what does it take?Shoehorning dependency injection into a FP language, what does it take?
Shoehorning dependency injection into a FP language, what does it take?
Eric Torreborre
 
machines-for-woodworking-shops-en-compressed.pdf
machines-for-woodworking-shops-en-compressed.pdfmachines-for-woodworking-shops-en-compressed.pdf
machines-for-woodworking-shops-en-compressed.pdf
AmirStern2
 
Config 2025 presentation recap covering both days
Config 2025 presentation recap covering both daysConfig 2025 presentation recap covering both days
Config 2025 presentation recap covering both days
TrishAntoni1
 
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More MachinesRefactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Leon Anavi
 
UiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptx
UiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptxUiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptx
UiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptx
anabulhac
 
Design pattern talk by Kaya Weers - 2025 (v2)
Design pattern talk by Kaya Weers - 2025 (v2)Design pattern talk by Kaya Weers - 2025 (v2)
Design pattern talk by Kaya Weers - 2025 (v2)
Kaya Weers
 
Mastering Testing in the Modern F&B Landscape
Mastering Testing in the Modern F&B LandscapeMastering Testing in the Modern F&B Landscape
Mastering Testing in the Modern F&B Landscape
marketing943205
 
Limecraft Webinar - 2025.3 release, featuring Content Delivery, Graphic Conte...
Limecraft Webinar - 2025.3 release, featuring Content Delivery, Graphic Conte...Limecraft Webinar - 2025.3 release, featuring Content Delivery, Graphic Conte...
Limecraft Webinar - 2025.3 release, featuring Content Delivery, Graphic Conte...
Maarten Verwaest
 
Who's choice? Making decisions with and about Artificial Intelligence, Keele ...
Who's choice? Making decisions with and about Artificial Intelligence, Keele ...Who's choice? Making decisions with and about Artificial Intelligence, Keele ...
Who's choice? Making decisions with and about Artificial Intelligence, Keele ...
Alan Dix
 
Artificial_Intelligence_in_Everyday_Life.pptx
Artificial_Intelligence_in_Everyday_Life.pptxArtificial_Intelligence_in_Everyday_Life.pptx
Artificial_Intelligence_in_Everyday_Life.pptx
03ANMOLCHAURASIYA
 
RTP Over QUIC: An Interesting Opportunity Or Wasted Time?
RTP Over QUIC: An Interesting Opportunity Or Wasted Time?RTP Over QUIC: An Interesting Opportunity Or Wasted Time?
RTP Over QUIC: An Interesting Opportunity Or Wasted Time?
Lorenzo Miniero
 
Harmonizing Multi-Agent Intelligence | Open Data Science Conference | Gary Ar...
Harmonizing Multi-Agent Intelligence | Open Data Science Conference | Gary Ar...Harmonizing Multi-Agent Intelligence | Open Data Science Conference | Gary Ar...
Harmonizing Multi-Agent Intelligence | Open Data Science Conference | Gary Ar...
Gary Arora
 
IT488 Wireless Sensor Networks_Information Technology
IT488 Wireless Sensor Networks_Information TechnologyIT488 Wireless Sensor Networks_Information Technology
IT488 Wireless Sensor Networks_Information Technology
SHEHABALYAMANI
 
React Native for Business Solutions: Building Scalable Apps for Success
React Native for Business Solutions: Building Scalable Apps for SuccessReact Native for Business Solutions: Building Scalable Apps for Success
React Native for Business Solutions: Building Scalable Apps for Success
Amelia Swank
 
Distributionally Robust Statistical Verification with Imprecise Neural Networks
Distributionally Robust Statistical Verification with Imprecise Neural NetworksDistributionally Robust Statistical Verification with Imprecise Neural Networks
Distributionally Robust Statistical Verification with Imprecise Neural Networks
Ivan Ruchkin
 
AI-proof your career by Olivier Vroom and David WIlliamson
AI-proof your career by Olivier Vroom and David WIlliamsonAI-proof your career by Olivier Vroom and David WIlliamson
AI-proof your career by Olivier Vroom and David WIlliamson
UXPA Boston
 
MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...
MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...
MULTI-STAKEHOLDER CONSULTATION PROGRAM On Implementation of DNF 2.0 and Way F...
ICT Frame Magazine Pvt. Ltd.
 
May Patch Tuesday
May Patch TuesdayMay Patch Tuesday
May Patch Tuesday
Ivanti
 
論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...
論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...
論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...
Toru Tamaki
 
Shoehorning dependency injection into a FP language, what does it take?
Shoehorning dependency injection into a FP language, what does it take?Shoehorning dependency injection into a FP language, what does it take?
Shoehorning dependency injection into a FP language, what does it take?
Eric Torreborre
 
machines-for-woodworking-shops-en-compressed.pdf
machines-for-woodworking-shops-en-compressed.pdfmachines-for-woodworking-shops-en-compressed.pdf
machines-for-woodworking-shops-en-compressed.pdf
AmirStern2
 
Config 2025 presentation recap covering both days
Config 2025 presentation recap covering both daysConfig 2025 presentation recap covering both days
Config 2025 presentation recap covering both days
TrishAntoni1
 
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More MachinesRefactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Leon Anavi
 
UiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptx
UiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptxUiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptx
UiPath AgentHack - Build the AI agents of tomorrow_Enablement 1.pptx
anabulhac
 
Design pattern talk by Kaya Weers - 2025 (v2)
Design pattern talk by Kaya Weers - 2025 (v2)Design pattern talk by Kaya Weers - 2025 (v2)
Design pattern talk by Kaya Weers - 2025 (v2)
Kaya Weers
 
Mastering Testing in the Modern F&B Landscape
Mastering Testing in the Modern F&B LandscapeMastering Testing in the Modern F&B Landscape
Mastering Testing in the Modern F&B Landscape
marketing943205
 

Machine Learning 101 - AWS Machine Learning Web Day

  • 1. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Michael Brückner Manager Machine Learning 25/02/2016 Machine Learning 101
  • 2. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda • What is Machine Learning and why do we need it? • Model Building • Model Evaluation & Tuning
  • 3. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. What is Machine Learning? Methods and Systems that … Adapt based on recorded data Predict new data based on recorded data Optimize an action given a utility function Extract hidden structure from the data Summarize data into concise descriptions
  • 4. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. What is Machine Learning NOT? Methods and Systems that … can yield Garbage-In Knowledge- Out perform well without data modeling & feature engineering avoid the curse-of- dimensionality are a replacement for business rules
  • 5. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Infer-Predict-Decide Cycle Inference Build & evaluate Predictor Prediction Apply the learned Predictor Decision Making Adjust Business loss and get new/more data
  • 6. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. What for? Automate tasks, which typically require humans in order to • scale • improve over humans (non-experts) • preserve privacy or solve tasks that are impossible for humans
  • 7. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Examples: Personalized Recommandation • Input:
  • 8. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Examples: Personalized Recommandation • Output:
  • 9. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Examples: Face Detection & Recognition Face detection • Input: image • Output: face position Face recognition • Input: face (image & face position) • Output: person’s name
  • 10. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Examples: Full-Text Translation • Input: text in one language • Output: text of another language
  • 11. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Examples: Spam Filtering • Input: email (text, images, …) • Output: spam/non-spam flag • Challenges: • extremely high precision for legitimate emails • spam changes constantly • noisy ground truth
  • 12. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Supervised Machine Learning 1. Model problem in terms of input data and output data 2. Collect sample of input-output pairs 3. Learn a mapping that produces the output given the input 4. Apply this function on new inputs to make predictions
  • 13. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. A Programer’s Perspective Traditional Programming (Predicting) Supervised Machine Learning Computer Input Data Mapping Output Data Computer Input Data Output Data Mapping
  • 14. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Advantages • Use data instead of intuition to derive the mapping • Can solve very complex tasks • Can adapt to new situations (collect more data) • Does not require much expert knowledge
  • 15. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Input Data Description Type Cost Actual Cost Diff In Catalogue Movies Entertainment $50 $28 $22 Yes Music (CDs, MP3s, etc.) $500 $30 $470 No Sporting Events Entertainment $0 $40 ($40) No Dining Out Food $1,000 $1,200 ($200) Yes Groceries $100 $0 $100 Yes Charity 1 Gifts and Charity $200 $200 $0 No Charity 2 $500 $500 $0 No Cable/Satellite Housing $100 $100 $0 Yes Electric Housing $45 $40 $5 Yes Mortgage or Rent $700 $700 $0 Yes Health Insurance $400 $400 $0 Yes Home Insurance $400 $400 $0 No Credit Card 1 $0 Yes Dataset Categorical Data Missing Data Binary Data Numerical Data Attribute Name Attribute Value Attribute Text Data
  • 16. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Description Type Cost Actual Cost Diff In Catalogue Movies Entertainment $50 $28 $22 Yes Music (CDs, MP3s, etc.) ? $500 $30 $470 No Sporting Events Entertainment $0 $40 ($40) No Dining Out Food $1,000 $1,200 ($200) Yes Groceries ? $100 $0 $100 Yes Charity 1 Gifts and Charity $200 $200 $0 No Charity 2 ? $500 $500 $0 No Cable/Satellite Housing $100 $100 $0 Yes Electric Housing $45 $40 $5 Yes Mortgage or Rent ? $700 $700 $0 Yes Health Insurance $400 $400 $0 Yes Home Insurance $400 $400 $0 No Credit Card 1 ? $0 Yes Output Data Target Attribute Values Target Attribute
  • 17. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda • What is Machine Learning and why do we need it? • Model Building • Model Evaluation & Tuning
  • 18. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Problem Setting • Input: vector of observable attributes, x • Output: target attribute value, y • Training data: pairs of input and corresponding output, D = (x1,y1),…,(xN,yN) • Application data: inputs only • Goal: learn mapping fw:x ↦ y Predictor
  • 19. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Challenges in Model Building • Which function class for Predictor (data modeling)? • How to pre-process the data (feature engineering)? • How to learn this Predictor from our training data? • How to generalize to new data?
  • 20. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Which function class for Predictor? Types of prediction tasks (output type): • Binary Classification ⇒ binary target y  {–1, +1} • Multinomial Classification ⇒ categorical target y  {1… K} • Regression ⇒ numeric target y  [l,u]  R
  • 21. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Which function class for Binary Classification? • Decision Tree + +- - - x2 > 7? no yes + + + + + x1 < 3? no yes x2 < 5? no yes x1 < 1? no yes + + - - x2 x1 1 3 5 7
  • 22. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Which function class for Binary Classification? • Decision Tree +- x2 > 7? no yes + x1 < 3? no yes x2 < 5? no yes x1 < 1? no yes + - x2 x1 + - -
  • 23. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Which function class for Binary Classification? • Linear function • binary target attribute values y  {–1, +1} x2 x1 Hw + - y(x) = sign( fw (x)) Hw ={x| fw (x) = xT w+w0 = 0} ^
  • 24. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Which function class for Binary Classification? • Generalized linear function (Kernel methods) • Layered Generalized linear function (Neural Networks) • Ensemble of functions • … x2 x1 + - + -
  • 25. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. How to pre-process the data? • Predictor’s function class defined for limited input domain ⇒ transform/extract attributes first (pre-processing) • Number to (normalized) Number: • z-standardization, min-max normalization • Number to Category: • Binning (quantile, equidistant) • Category to (numeric) Vector: • One-hot encoding
  • 26. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. How to pre-process the data? • Predictor’s function class defined for limited input domain ⇒ transform/extract attributes first (pre-processing) • Text to (numeric) Vector: • Normalization, tokenization, stemming • Bag-of-Words, Bag-of-NGrams, TI-IDF ⇒ sparse vector • Latent word embedding (LSI, word2vec, LDA) ⇒ dense vector • Image to (numeric) Vector: • HoG, DAISY, color histogram
  • 27. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. How to learn a Predictor? • Loss of Predictor fw:x ↦ y for a given input-output pair: Loss function PredictionGround Truth L(y, fw (x))
  • 28. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. How to learn a Predictor? Loss functions for binary classification (target ):y Î{-1,+1}
  • 29. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. How to learn a Predictor? Function Class Loss Function Learning Algorithm Decision Trees 0/1 loss ID3 Decision Trees Quadratic loss CART Linear function Quadratic loss Least-squares regression Linear function Logistic loss Logistic regression Linear function Hinge loss Support Vector Machines Layered Generalized Linear function Logistic loss Neural Networks (Binary Classification) Layered Generalized Linear function Quadratic loss Neural Networks (Regression)
  • 30. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. How to learn a Predictor? • Theoretical Risk: • Empirical Risk: Average over all possible data Average over training data
  • 31. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. How to learn a Predictor? • Prediction depends on Predictor with model parameters w • Minimize Risk w.r.t. those model parameters w ⇒ mathematical Optimisation Problem • Gradient-based first or second-order methods • Coordinate-descent methods • (Greedy) Search y(x)^ fw
  • 32. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. How to generalize to new data? Error Model Complexity
  • 33. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. How to generalize to new data? • Empirical Risk: • Structural Risk: Regularizer
  • 34. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda • What is Machine Learning and why do we need it? • Model Building • Model Evaluation & Tuning
  • 35. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Performance for Binary Classification Total number of data points (N) True Target positive negative Predicted Target positive True Positive False Positive negative False Negative True Negative
  • 36. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Performance for Binary Classification • Accuracy: • Recall (true positive rate): • Precision: • Fall-out (false positive rate): TP+TN N TP TP+ FN TP TP+ FP FP TN + FP
  • 37. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Performance for Binary Classification Decision function AUC (Area Under roc Curve) y(x) = sign( fw (x)+b)^ Predictor Decision threshold
  • 38. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Training vs. Test Performance How do we know that a Predictor works well on new data? Small error on training data ≠ small error on new data (test data)!
  • 39. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Hold-out Evaluation • Put some data aside before training = test data • Use this hold-out data for evaluation • Disadvantages: • What if we were (un)lucky when choosing the hold-out data? • We do NOT use all the data for model training!
  • 40. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. K-Fold Cross Validation-based Evaluation • Split data into K partitions (folds) • Take all but one partition to train a Predictor • Evaluate Predictor on the left-out partition • Repeat this for all partitions • Average performance for all K evaluations • Finally train a Predictor on all data
  • 41. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Model Tuning Learning methods and Predictors have hyper-parameters • Amount of regularization • Choice of loss function • Decision threshold score • Learning rate • …
  • 42. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Example: Decision threshold Decision threshold
  • 43. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. How to choose hyper-parameters? Grid Search: • Evaluate Predictor for all grid points (hyper-parameter combinations) • Take best grid point Very expensive!   2 10 0 10 2 10 1 2 0 2 1 2
  • 44. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. How to choose hyper-parameters? Bayesian Optimisation: • Learn model to predict evaluation outcomes • Evaluate Predictor only for promising grid points • Take best grid point after fixed number of evaluations   2 10 0 10 2 10 1 2 0 2 1 2
  • 45. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Common Pitfalls • Model tuning is part of training ⇒ Do NOT use test data or test CV partitions! • Use proper grid resolution and axis scaling • Use same metric for tuning as for evaluation
  • 46. Thank you! © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  翻译: