SlideShare a Scribd company logo
Tailoring Machine
Learning Practices
to Support
Prescriptive
Analytics
Anthony Melson
Data
Optimization
Decision Science
Induction
OR
Deduction
Statistics
What-If
Business
Processes
Cost/Benefit
NLP
Classification
Regression
Narrowing the Scope
Subject Matter
• Models: Classifiers
• Problems: Decision (Yes/No)
Goals
• Probabilistic and Label Outputs
• Deterministic and Non-
Deterministic Decision-Making
Strategy 2: Incorporate Knowledge Prescriptive Use into
the Machine Learning Pipeline
Strategy 1: Design Classifiers For Both Types of Decisions
A Closer Look
at Classifiers
Traditional
Classifier Pipeline
• Abstract
• Accuracy
• Indifference
• Class Labels
• Little Postprocessing
Classifier Performance
Many Metrics: TPR, FPR, Recall, Precision…
Accuracy: (TP + TN) / Total Observations
ROC Curve: Visualizes FPR & TPR
Purpose: Optimize Threshold
Two Types of Classifiers
Label Classifier
• Predicts Label
• Doesn’t Account For Uncertainty
• Label is a Decision
• Modifiable Thresholds
Probabilistic Classifier
• Predicts Probability of Labels
• Accounts for Uncertainty (Risk
Scores)
• Decisions Require Additional Steps
Class
Boundary
Uncertain
Space
~0 < x < ~1
Creditworthy
Not CW
A Look From Above
Label Output Probability Output
What Changes When We
Consider Prescription
Real-World
Complexity
Stakeholders
• Risk Attitudes, Outcomes
Organizations
• Business Objectives
Weighted Outcomes
• FP FN; TP TN
How Does ML Fit In?
ML In Probabilistic Decisions
ML
Models
Outcomes
&
Payouts
Organizations
Stakeholders
ML In Deterministic Decisions
ML
Models
Outcomes
&
Payouts
Organizations
Stakeholders
Uncertain
Decision
Crew Works
on Interior
Crew Works
on Exterior
Will Rain
Will Not
Strategy 1
Label
Output
Probability
Output
Modifications Move Decision
Threshold
Pass Probabilities to
Utility Functions
Organizations Align Threshold with
Objectives
Account for Objectives as
Utilities
Weighted
Outcomes Trade-off FP, TP, FN, FP
Accordingly
Risk Mitigation (Hedging)
Stakeholders Account for Risk
Attitudes
Deliberation
How Can ML Experts Respond to these Challenges?
Decision
Threshold
Optimization
Thresholding Options:
• Risk Averse/Seeking
• Maximization/Minimization
• Class or Risk Focus
Points of control:
• Threshold Location
• Optimize for outcome/s of
interest
Risk Averse Risk Seeking
Maximization
Strategy
Example:
Terminal
Medical
Diagnosis
Context:
• First of Three Benign Tests
Stakes:
• FN = Illness Goes Undetected
• FP = Further Testing
Risk Attitude:
• Averse
Organizational Objectives:
• Patient Care
Stakeholders:
• Patient, Doctor…
Course of Action:
• Move Threshold Beyond Positive
Threshold (in probabilities)
Risk Averse
Sent Home Further Testing
Example:
Terminal
Medical
Diagnosis
(variation)
Context:
• Only one test
Stakes:
• FN = Illness Goes Undetected
• FP = High-Risk Surgery
Risk Attitude:
• ?
Organizational Objectives:
• Patient Care
Stakeholders:
• Patient, Doctor…
Course of Action:
• ?
Sent Home High-Risk Surgery
?
Things to
Think About
Can Be a Max- Min-imization Tool
• Threshold For Utility/EMV
• Minimize Risk
Order/Cost of Information
• Sequence
• Price/Risk
Label Can Be Used in Deterministic Systems
• Business Processes
Great for Automated Decision
Connection
With Utility
Function
Advantages:
• Hedge Decisions
• Maximize Utility
• Account for Risks of Multiple
Decisions
• Combine Outputs from
Multiple Models
• Individual or Batches
• Assess Risk
Example:
Max
Revenue for
Wine
Merchant
Context:
• Wine Merchant
• Space for 30 New Wines
Stakes:
• Rev/mo for Bad Case = 200
• Rev/mo Good Case = 300
Organizational Objectives:
• Maximize Revenue, Stock Shelves
Stakeholders:
• Merchants, Customers
Course of Action:
• Probability Good/Bad
• Expected Value
• Rank Wines
• Buy Top 30
Iterate
Iterate through list
Example:
Wine
Merchant
(Italian
Variation)
Context:
• Wine Merchant
• Shelf Space for 30 New Wines
Stakes:
• Rev/mo for Bad Case = 200
• Rev/mo Good Case = 300
Organizational Objectives:
• Maximize Revenue, Stock
Shelves, Stock Italian
Stakeholders:
• Merchants, Customers
Course of Action:
• Probability Good/Bad
• Expected Value
• Re-Weight Italian Wines
• Rank Wines
• Buy Top 30
Iterate
Iterate through list
Things to
Think About
Batch vs Individual
• Calibration (Especially People)
• Difference in Risk Attitudes
Utilities Other Than Money
• Ethics, Laws, Norms
• Predictability
• Health
• Anything Hard to Put Monetary Value On
Strategy 2
Label Output Probability Output
Feature
Selection
IF Experiment: Include
Custom Scoring
IF Not: Evaluate with
Custom Scoring
Evaluate with
Cross-Entropy/Log-Loss
Possibly Others
Parameter
Tuning
Loss Function
Selection
Trade-off FP, TP, FN, FP
Accordingly
Note: High-Risk
Select
Cross-Entropy/Log-Loss
Model Selection
Evaluate with
Total Value OR
Risk Attitude
Evaluate with
Cross-Entropy/Log-Loss
Decisions in
ML Pipeline
• Feature Selection
• Algorithm Selection
• Loss Function
• Parameter Tuning
In Abstract
• Evaluation Metrics
(usually accuracy)
• Previous Experience
In Business Context
• Based on Outcomes
How do we make these decisions?
Feature
Selection
Wrapper Methods
• Builds models to select
features
• Selects highest scoring set
Points of control:
• Custom Scoring
• Selection based on outcomes
of interest
Hyper-
Parameter
Tuning
Search Types
• Grid Search
• Random Search
• Many Others
Points of control:
• Custom Scoring
• Selection based on outcomes
of interest
Loss
Function
Loss Functions
• Cross-Entropy
• Hinge Loss
• Many Others
Points of control:
• Selection based on Use-Case
• Selection based on outcomes
• Generation based on outcomes
• Note: Risky to modify
Model
Selection
Selection Methods
• Bias/Variance
• Scoring Metrics
• Many Others
Points of control:
• Custom Scoring
• Selection based on outcomes
• Most Important (Must Be
Exclusive in Part)
Brief Conclusions
Conclusions
Don’t Over-Focus on Accuracy
• Outcomes
• Context
• Stakeholders
• Organizations
Keep the Use-Case in the Process
• Choose the Right Classifier
• Make Decisions Based on Application
Work With Domain Experts and Prescriptive
Analysts
• Model Consumption/Utilization
• Get Utilities and Risk Attitudes
Happy
Classifying
Ad

More Related Content

Similar to Tailoring machine learning practices to support prescriptive analytics (20)

Why Customers Buy | Conjoint Analysis: Unlocking the Secret to What Your Cu...
Why Customers Buy  |  Conjoint Analysis: Unlocking the Secret to What Your Cu...Why Customers Buy  |  Conjoint Analysis: Unlocking the Secret to What Your Cu...
Why Customers Buy | Conjoint Analysis: Unlocking the Secret to What Your Cu...
Qualtrics
 
MIS Unit-2.pptx
MIS Unit-2.pptxMIS Unit-2.pptx
MIS Unit-2.pptx
ZulfequarAliAhmad
 
Practical Tools for Measurement Systems Analysis
Practical Tools for Measurement Systems AnalysisPractical Tools for Measurement Systems Analysis
Practical Tools for Measurement Systems Analysis
Gabor Szabo, CQE
 
Regenstrief WIP 07012015
Regenstrief WIP 07012015Regenstrief WIP 07012015
Regenstrief WIP 07012015
Suranga Nath Kasthurirathne
 
Operational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss PreventionOperational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss Prevention
Michael Marshall, PE
 
CISSP Chapter 1 Risk Management
CISSP Chapter 1  Risk ManagementCISSP Chapter 1  Risk Management
CISSP Chapter 1 Risk Management
Karthikeyan Dhayalan
 
Multivariate_Data_Analysis_Session1.pptx
Multivariate_Data_Analysis_Session1.pptxMultivariate_Data_Analysis_Session1.pptx
Multivariate_Data_Analysis_Session1.pptx
apsapssingh9
 
Analytics Types.pdfdvf ifbvuibugdfiubuibubufdibhdfiubfduibhfiuvdih
Analytics Types.pdfdvf ifbvuibugdfiubuibubufdibhdfiubfduibhfiuvdihAnalytics Types.pdfdvf ifbvuibugdfiubuibubufdibhdfiubfduibhfiuvdih
Analytics Types.pdfdvf ifbvuibugdfiubuibubufdibhdfiubfduibhfiuvdih
NarishaBhawsar
 
Metrics
MetricsMetrics
Metrics
Dean Da Costa
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
Spartan60
 
Localization Metrics and KPIs
Localization Metrics and KPIsLocalization Metrics and KPIs
Localization Metrics and KPIs
RWS Moravia
 
1st Lecture.pdf
1st Lecture.pdf1st Lecture.pdf
1st Lecture.pdf
TayyabHameed15
 
Introduction to Operations -PPT.ppt
Introduction to Operations -PPT.pptIntroduction to Operations -PPT.ppt
Introduction to Operations -PPT.ppt
etebarkhmichale
 
Machine Learning with Big Data using Apache Spark
Machine Learning with Big Data using Apache SparkMachine Learning with Big Data using Apache Spark
Machine Learning with Big Data using Apache Spark
InSemble
 
Topic 1 - Risk Auditing 1-17.pdf
Topic 1 - Risk Auditing 1-17.pdfTopic 1 - Risk Auditing 1-17.pdf
Topic 1 - Risk Auditing 1-17.pdf
Javier138365
 
Dss
DssDss
Dss
Dr. Vardhan choubey
 
Discriminant analysis and its applications in business decision.pptx
Discriminant analysis and its applications in business decision.pptxDiscriminant analysis and its applications in business decision.pptx
Discriminant analysis and its applications in business decision.pptx
shruti singh
 
WEEK 9 - DATA COLLECTION GUIDELINES COMPACT.pptx
WEEK 9 - DATA COLLECTION GUIDELINES COMPACT.pptxWEEK 9 - DATA COLLECTION GUIDELINES COMPACT.pptx
WEEK 9 - DATA COLLECTION GUIDELINES COMPACT.pptx
noviantobudik
 
Sales Tips of Success for Sales representatives
Sales Tips of Success for Sales representativesSales Tips of Success for Sales representatives
Sales Tips of Success for Sales representatives
TabrezMirza1
 
Training Needs Assessment (TNA) Of emplyee.ppt
Training Needs Assessment (TNA) Of emplyee.pptTraining Needs Assessment (TNA) Of emplyee.ppt
Training Needs Assessment (TNA) Of emplyee.ppt
rizzzhayday
 
Why Customers Buy | Conjoint Analysis: Unlocking the Secret to What Your Cu...
Why Customers Buy  |  Conjoint Analysis: Unlocking the Secret to What Your Cu...Why Customers Buy  |  Conjoint Analysis: Unlocking the Secret to What Your Cu...
Why Customers Buy | Conjoint Analysis: Unlocking the Secret to What Your Cu...
Qualtrics
 
Practical Tools for Measurement Systems Analysis
Practical Tools for Measurement Systems AnalysisPractical Tools for Measurement Systems Analysis
Practical Tools for Measurement Systems Analysis
Gabor Szabo, CQE
 
Operational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss PreventionOperational Excellence in Oil and Gas Loss Prevention
Operational Excellence in Oil and Gas Loss Prevention
Michael Marshall, PE
 
Multivariate_Data_Analysis_Session1.pptx
Multivariate_Data_Analysis_Session1.pptxMultivariate_Data_Analysis_Session1.pptx
Multivariate_Data_Analysis_Session1.pptx
apsapssingh9
 
Analytics Types.pdfdvf ifbvuibugdfiubuibubufdibhdfiubfduibhfiuvdih
Analytics Types.pdfdvf ifbvuibugdfiubuibubufdibhdfiubfduibhfiuvdihAnalytics Types.pdfdvf ifbvuibugdfiubuibubufdibhdfiubfduibhfiuvdih
Analytics Types.pdfdvf ifbvuibugdfiubuibubufdibhdfiubfduibhfiuvdih
NarishaBhawsar
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
Spartan60
 
Localization Metrics and KPIs
Localization Metrics and KPIsLocalization Metrics and KPIs
Localization Metrics and KPIs
RWS Moravia
 
Introduction to Operations -PPT.ppt
Introduction to Operations -PPT.pptIntroduction to Operations -PPT.ppt
Introduction to Operations -PPT.ppt
etebarkhmichale
 
Machine Learning with Big Data using Apache Spark
Machine Learning with Big Data using Apache SparkMachine Learning with Big Data using Apache Spark
Machine Learning with Big Data using Apache Spark
InSemble
 
Topic 1 - Risk Auditing 1-17.pdf
Topic 1 - Risk Auditing 1-17.pdfTopic 1 - Risk Auditing 1-17.pdf
Topic 1 - Risk Auditing 1-17.pdf
Javier138365
 
Discriminant analysis and its applications in business decision.pptx
Discriminant analysis and its applications in business decision.pptxDiscriminant analysis and its applications in business decision.pptx
Discriminant analysis and its applications in business decision.pptx
shruti singh
 
WEEK 9 - DATA COLLECTION GUIDELINES COMPACT.pptx
WEEK 9 - DATA COLLECTION GUIDELINES COMPACT.pptxWEEK 9 - DATA COLLECTION GUIDELINES COMPACT.pptx
WEEK 9 - DATA COLLECTION GUIDELINES COMPACT.pptx
noviantobudik
 
Sales Tips of Success for Sales representatives
Sales Tips of Success for Sales representativesSales Tips of Success for Sales representatives
Sales Tips of Success for Sales representatives
TabrezMirza1
 
Training Needs Assessment (TNA) Of emplyee.ppt
Training Needs Assessment (TNA) Of emplyee.pptTraining Needs Assessment (TNA) Of emplyee.ppt
Training Needs Assessment (TNA) Of emplyee.ppt
rizzzhayday
 

More from Adam Doyle (20)

ML Ops.pptx
ML Ops.pptxML Ops.pptx
ML Ops.pptx
Adam Doyle
 
Data Engineering Roles
Data Engineering RolesData Engineering Roles
Data Engineering Roles
Adam Doyle
 
Managed Cluster Services
Managed Cluster ServicesManaged Cluster Services
Managed Cluster Services
Adam Doyle
 
Delta lake and the delta architecture
Delta lake and the delta architectureDelta lake and the delta architecture
Delta lake and the delta architecture
Adam Doyle
 
Great Expectations Presentation
Great Expectations PresentationGreat Expectations Presentation
Great Expectations Presentation
Adam Doyle
 
May 2021 Spark Testing ... or how to farm reputation on StackOverflow
May 2021 Spark Testing ... or how to farm reputation on StackOverflowMay 2021 Spark Testing ... or how to farm reputation on StackOverflow
May 2021 Spark Testing ... or how to farm reputation on StackOverflow
Adam Doyle
 
Automate your data flows with Apache NIFI
Automate your data flows with Apache NIFIAutomate your data flows with Apache NIFI
Automate your data flows with Apache NIFI
Adam Doyle
 
Apache Iceberg Presentation for the St. Louis Big Data IDEA
Apache Iceberg Presentation for the St. Louis Big Data IDEAApache Iceberg Presentation for the St. Louis Big Data IDEA
Apache Iceberg Presentation for the St. Louis Big Data IDEA
Adam Doyle
 
Localized Hadoop Development
Localized Hadoop DevelopmentLocalized Hadoop Development
Localized Hadoop Development
Adam Doyle
 
The new big data
The new big dataThe new big data
The new big data
Adam Doyle
 
Feature store Overview St. Louis Big Data IDEA Meetup aug 2020
Feature store Overview   St. Louis Big Data IDEA Meetup aug 2020Feature store Overview   St. Louis Big Data IDEA Meetup aug 2020
Feature store Overview St. Louis Big Data IDEA Meetup aug 2020
Adam Doyle
 
Snowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at ScaleSnowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at Scale
Adam Doyle
 
Operationalizing Data Science St. Louis Big Data IDEA
Operationalizing Data Science St. Louis Big Data IDEAOperationalizing Data Science St. Louis Big Data IDEA
Operationalizing Data Science St. Louis Big Data IDEA
Adam Doyle
 
Retooling on the Modern Data and Analytics Tech Stack
Retooling on the Modern Data and Analytics Tech StackRetooling on the Modern Data and Analytics Tech Stack
Retooling on the Modern Data and Analytics Tech Stack
Adam Doyle
 
Stl meetup cloudera platform - january 2020
Stl meetup   cloudera platform  - january 2020Stl meetup   cloudera platform  - january 2020
Stl meetup cloudera platform - january 2020
Adam Doyle
 
How stlrda does data
How stlrda does dataHow stlrda does data
How stlrda does data
Adam Doyle
 
Synthesis of analytical methods data driven decision-making
Synthesis of analytical methods data driven decision-makingSynthesis of analytical methods data driven decision-making
Synthesis of analytical methods data driven decision-making
Adam Doyle
 
Big Data IDEA 101 2019
Big Data IDEA 101 2019Big Data IDEA 101 2019
Big Data IDEA 101 2019
Adam Doyle
 
Data Engineering and the Data Science Lifecycle
Data Engineering and the Data Science LifecycleData Engineering and the Data Science Lifecycle
Data Engineering and the Data Science Lifecycle
Adam Doyle
 
Data engineering Stl Big Data IDEA user group
Data engineering   Stl Big Data IDEA user groupData engineering   Stl Big Data IDEA user group
Data engineering Stl Big Data IDEA user group
Adam Doyle
 
Data Engineering Roles
Data Engineering RolesData Engineering Roles
Data Engineering Roles
Adam Doyle
 
Managed Cluster Services
Managed Cluster ServicesManaged Cluster Services
Managed Cluster Services
Adam Doyle
 
Delta lake and the delta architecture
Delta lake and the delta architectureDelta lake and the delta architecture
Delta lake and the delta architecture
Adam Doyle
 
Great Expectations Presentation
Great Expectations PresentationGreat Expectations Presentation
Great Expectations Presentation
Adam Doyle
 
May 2021 Spark Testing ... or how to farm reputation on StackOverflow
May 2021 Spark Testing ... or how to farm reputation on StackOverflowMay 2021 Spark Testing ... or how to farm reputation on StackOverflow
May 2021 Spark Testing ... or how to farm reputation on StackOverflow
Adam Doyle
 
Automate your data flows with Apache NIFI
Automate your data flows with Apache NIFIAutomate your data flows with Apache NIFI
Automate your data flows with Apache NIFI
Adam Doyle
 
Apache Iceberg Presentation for the St. Louis Big Data IDEA
Apache Iceberg Presentation for the St. Louis Big Data IDEAApache Iceberg Presentation for the St. Louis Big Data IDEA
Apache Iceberg Presentation for the St. Louis Big Data IDEA
Adam Doyle
 
Localized Hadoop Development
Localized Hadoop DevelopmentLocalized Hadoop Development
Localized Hadoop Development
Adam Doyle
 
The new big data
The new big dataThe new big data
The new big data
Adam Doyle
 
Feature store Overview St. Louis Big Data IDEA Meetup aug 2020
Feature store Overview   St. Louis Big Data IDEA Meetup aug 2020Feature store Overview   St. Louis Big Data IDEA Meetup aug 2020
Feature store Overview St. Louis Big Data IDEA Meetup aug 2020
Adam Doyle
 
Snowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at ScaleSnowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at Scale
Adam Doyle
 
Operationalizing Data Science St. Louis Big Data IDEA
Operationalizing Data Science St. Louis Big Data IDEAOperationalizing Data Science St. Louis Big Data IDEA
Operationalizing Data Science St. Louis Big Data IDEA
Adam Doyle
 
Retooling on the Modern Data and Analytics Tech Stack
Retooling on the Modern Data and Analytics Tech StackRetooling on the Modern Data and Analytics Tech Stack
Retooling on the Modern Data and Analytics Tech Stack
Adam Doyle
 
Stl meetup cloudera platform - january 2020
Stl meetup   cloudera platform  - january 2020Stl meetup   cloudera platform  - january 2020
Stl meetup cloudera platform - january 2020
Adam Doyle
 
How stlrda does data
How stlrda does dataHow stlrda does data
How stlrda does data
Adam Doyle
 
Synthesis of analytical methods data driven decision-making
Synthesis of analytical methods data driven decision-makingSynthesis of analytical methods data driven decision-making
Synthesis of analytical methods data driven decision-making
Adam Doyle
 
Big Data IDEA 101 2019
Big Data IDEA 101 2019Big Data IDEA 101 2019
Big Data IDEA 101 2019
Adam Doyle
 
Data Engineering and the Data Science Lifecycle
Data Engineering and the Data Science LifecycleData Engineering and the Data Science Lifecycle
Data Engineering and the Data Science Lifecycle
Adam Doyle
 
Data engineering Stl Big Data IDEA user group
Data engineering   Stl Big Data IDEA user groupData engineering   Stl Big Data IDEA user group
Data engineering Stl Big Data IDEA user group
Adam Doyle
 
Ad

Recently uploaded (20)

50_questions_full.pptxdddddddddddddddddd
50_questions_full.pptxdddddddddddddddddd50_questions_full.pptxdddddddddddddddddd
50_questions_full.pptxdddddddddddddddddd
emir73065
 
2024 Digital Equity Accelerator Report.pdf
2024 Digital Equity Accelerator Report.pdf2024 Digital Equity Accelerator Report.pdf
2024 Digital Equity Accelerator Report.pdf
dominikamizerska1
 
Z14_IBM__APL_by_Christian_Demmer_IBM.pdf
Z14_IBM__APL_by_Christian_Demmer_IBM.pdfZ14_IBM__APL_by_Christian_Demmer_IBM.pdf
Z14_IBM__APL_by_Christian_Demmer_IBM.pdf
Fariborz Seyedloo
 
Process Mining Machine Recoveries to Reduce Downtime
Process Mining Machine Recoveries to Reduce DowntimeProcess Mining Machine Recoveries to Reduce Downtime
Process Mining Machine Recoveries to Reduce Downtime
Process mining Evangelist
 
What is ETL? Difference between ETL and ELT?.pdf
What is ETL? Difference between ETL and ELT?.pdfWhat is ETL? Difference between ETL and ELT?.pdf
What is ETL? Difference between ETL and ELT?.pdf
SaikatBasu37
 
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
Taqyea
 
Feature Engineering for Electronic Health Record Systems
Feature Engineering for Electronic Health Record SystemsFeature Engineering for Electronic Health Record Systems
Feature Engineering for Electronic Health Record Systems
Process mining Evangelist
 
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docxAnalysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
hershtara1
 
Dr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug - Expert In Artificial IntelligenceDr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug
 
Fundamentals of Data Analysis, its types, tools, algorithms
Fundamentals of Data Analysis, its types, tools, algorithmsFundamentals of Data Analysis, its types, tools, algorithms
Fundamentals of Data Analysis, its types, tools, algorithms
priyaiyerkbcsc
 
Controlling Financial Processes at a Municipality
Controlling Financial Processes at a MunicipalityControlling Financial Processes at a Municipality
Controlling Financial Processes at a Municipality
Process mining Evangelist
 
Automated Melanoma Detection via Image Processing.pptx
Automated Melanoma Detection via Image Processing.pptxAutomated Melanoma Detection via Image Processing.pptx
Automated Melanoma Detection via Image Processing.pptx
handrymaharjan23
 
L1_Slides_Foundational Concepts_508.pptx
L1_Slides_Foundational Concepts_508.pptxL1_Slides_Foundational Concepts_508.pptx
L1_Slides_Foundational Concepts_508.pptx
38NoopurPatel
 
hersh's midterm project.pdf music retail and distribution
hersh's midterm project.pdf music retail and distributionhersh's midterm project.pdf music retail and distribution
hersh's midterm project.pdf music retail and distribution
hershtara1
 
Sets theories and applications that can used to imporve knowledge
Sets theories and applications that can used to imporve knowledgeSets theories and applications that can used to imporve knowledge
Sets theories and applications that can used to imporve knowledge
saumyasl2020
 
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm     mmmmmfftro.pptxlecture_13 tree in mmmmmmmm     mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
sarajafffri058
 
Process Mining as Enabler for Digital Transformations
Process Mining as Enabler for Digital TransformationsProcess Mining as Enabler for Digital Transformations
Process Mining as Enabler for Digital Transformations
Process mining Evangelist
 
HershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistributionHershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistribution
hershtara1
 
AI ------------------------------ W1L2.pptx
AI ------------------------------ W1L2.pptxAI ------------------------------ W1L2.pptx
AI ------------------------------ W1L2.pptx
AyeshaJalil6
 
Understanding Complex Development Processes
Understanding Complex Development ProcessesUnderstanding Complex Development Processes
Understanding Complex Development Processes
Process mining Evangelist
 
50_questions_full.pptxdddddddddddddddddd
50_questions_full.pptxdddddddddddddddddd50_questions_full.pptxdddddddddddddddddd
50_questions_full.pptxdddddddddddddddddd
emir73065
 
2024 Digital Equity Accelerator Report.pdf
2024 Digital Equity Accelerator Report.pdf2024 Digital Equity Accelerator Report.pdf
2024 Digital Equity Accelerator Report.pdf
dominikamizerska1
 
Z14_IBM__APL_by_Christian_Demmer_IBM.pdf
Z14_IBM__APL_by_Christian_Demmer_IBM.pdfZ14_IBM__APL_by_Christian_Demmer_IBM.pdf
Z14_IBM__APL_by_Christian_Demmer_IBM.pdf
Fariborz Seyedloo
 
Process Mining Machine Recoveries to Reduce Downtime
Process Mining Machine Recoveries to Reduce DowntimeProcess Mining Machine Recoveries to Reduce Downtime
Process Mining Machine Recoveries to Reduce Downtime
Process mining Evangelist
 
What is ETL? Difference between ETL and ELT?.pdf
What is ETL? Difference between ETL and ELT?.pdfWhat is ETL? Difference between ETL and ELT?.pdf
What is ETL? Difference between ETL and ELT?.pdf
SaikatBasu37
 
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
Taqyea
 
Feature Engineering for Electronic Health Record Systems
Feature Engineering for Electronic Health Record SystemsFeature Engineering for Electronic Health Record Systems
Feature Engineering for Electronic Health Record Systems
Process mining Evangelist
 
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docxAnalysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
hershtara1
 
Dr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug - Expert In Artificial IntelligenceDr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug
 
Fundamentals of Data Analysis, its types, tools, algorithms
Fundamentals of Data Analysis, its types, tools, algorithmsFundamentals of Data Analysis, its types, tools, algorithms
Fundamentals of Data Analysis, its types, tools, algorithms
priyaiyerkbcsc
 
Controlling Financial Processes at a Municipality
Controlling Financial Processes at a MunicipalityControlling Financial Processes at a Municipality
Controlling Financial Processes at a Municipality
Process mining Evangelist
 
Automated Melanoma Detection via Image Processing.pptx
Automated Melanoma Detection via Image Processing.pptxAutomated Melanoma Detection via Image Processing.pptx
Automated Melanoma Detection via Image Processing.pptx
handrymaharjan23
 
L1_Slides_Foundational Concepts_508.pptx
L1_Slides_Foundational Concepts_508.pptxL1_Slides_Foundational Concepts_508.pptx
L1_Slides_Foundational Concepts_508.pptx
38NoopurPatel
 
hersh's midterm project.pdf music retail and distribution
hersh's midterm project.pdf music retail and distributionhersh's midterm project.pdf music retail and distribution
hersh's midterm project.pdf music retail and distribution
hershtara1
 
Sets theories and applications that can used to imporve knowledge
Sets theories and applications that can used to imporve knowledgeSets theories and applications that can used to imporve knowledge
Sets theories and applications that can used to imporve knowledge
saumyasl2020
 
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm     mmmmmfftro.pptxlecture_13 tree in mmmmmmmm     mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
sarajafffri058
 
Process Mining as Enabler for Digital Transformations
Process Mining as Enabler for Digital TransformationsProcess Mining as Enabler for Digital Transformations
Process Mining as Enabler for Digital Transformations
Process mining Evangelist
 
HershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistributionHershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistribution
hershtara1
 
AI ------------------------------ W1L2.pptx
AI ------------------------------ W1L2.pptxAI ------------------------------ W1L2.pptx
AI ------------------------------ W1L2.pptx
AyeshaJalil6
 
Ad

Tailoring machine learning practices to support prescriptive analytics

  • 1. Tailoring Machine Learning Practices to Support Prescriptive Analytics Anthony Melson Data Optimization Decision Science Induction OR Deduction Statistics What-If Business Processes Cost/Benefit NLP Classification Regression
  • 2. Narrowing the Scope Subject Matter • Models: Classifiers • Problems: Decision (Yes/No) Goals • Probabilistic and Label Outputs • Deterministic and Non- Deterministic Decision-Making Strategy 2: Incorporate Knowledge Prescriptive Use into the Machine Learning Pipeline Strategy 1: Design Classifiers For Both Types of Decisions
  • 3. A Closer Look at Classifiers
  • 4. Traditional Classifier Pipeline • Abstract • Accuracy • Indifference • Class Labels • Little Postprocessing
  • 5. Classifier Performance Many Metrics: TPR, FPR, Recall, Precision… Accuracy: (TP + TN) / Total Observations ROC Curve: Visualizes FPR & TPR Purpose: Optimize Threshold
  • 6. Two Types of Classifiers Label Classifier • Predicts Label • Doesn’t Account For Uncertainty • Label is a Decision • Modifiable Thresholds Probabilistic Classifier • Predicts Probability of Labels • Accounts for Uncertainty (Risk Scores) • Decisions Require Additional Steps Class Boundary Uncertain Space ~0 < x < ~1 Creditworthy Not CW
  • 7. A Look From Above Label Output Probability Output
  • 8. What Changes When We Consider Prescription
  • 9. Real-World Complexity Stakeholders • Risk Attitudes, Outcomes Organizations • Business Objectives Weighted Outcomes • FP FN; TP TN How Does ML Fit In?
  • 10. ML In Probabilistic Decisions ML Models Outcomes & Payouts Organizations Stakeholders
  • 11. ML In Deterministic Decisions ML Models Outcomes & Payouts Organizations Stakeholders Uncertain Decision Crew Works on Interior Crew Works on Exterior Will Rain Will Not
  • 12. Strategy 1 Label Output Probability Output Modifications Move Decision Threshold Pass Probabilities to Utility Functions Organizations Align Threshold with Objectives Account for Objectives as Utilities Weighted Outcomes Trade-off FP, TP, FN, FP Accordingly Risk Mitigation (Hedging) Stakeholders Account for Risk Attitudes Deliberation How Can ML Experts Respond to these Challenges?
  • 13. Decision Threshold Optimization Thresholding Options: • Risk Averse/Seeking • Maximization/Minimization • Class or Risk Focus Points of control: • Threshold Location • Optimize for outcome/s of interest Risk Averse Risk Seeking Maximization Strategy
  • 14. Example: Terminal Medical Diagnosis Context: • First of Three Benign Tests Stakes: • FN = Illness Goes Undetected • FP = Further Testing Risk Attitude: • Averse Organizational Objectives: • Patient Care Stakeholders: • Patient, Doctor… Course of Action: • Move Threshold Beyond Positive Threshold (in probabilities) Risk Averse Sent Home Further Testing
  • 15. Example: Terminal Medical Diagnosis (variation) Context: • Only one test Stakes: • FN = Illness Goes Undetected • FP = High-Risk Surgery Risk Attitude: • ? Organizational Objectives: • Patient Care Stakeholders: • Patient, Doctor… Course of Action: • ? Sent Home High-Risk Surgery ?
  • 16. Things to Think About Can Be a Max- Min-imization Tool • Threshold For Utility/EMV • Minimize Risk Order/Cost of Information • Sequence • Price/Risk Label Can Be Used in Deterministic Systems • Business Processes Great for Automated Decision
  • 17. Connection With Utility Function Advantages: • Hedge Decisions • Maximize Utility • Account for Risks of Multiple Decisions • Combine Outputs from Multiple Models • Individual or Batches • Assess Risk
  • 18. Example: Max Revenue for Wine Merchant Context: • Wine Merchant • Space for 30 New Wines Stakes: • Rev/mo for Bad Case = 200 • Rev/mo Good Case = 300 Organizational Objectives: • Maximize Revenue, Stock Shelves Stakeholders: • Merchants, Customers Course of Action: • Probability Good/Bad • Expected Value • Rank Wines • Buy Top 30 Iterate Iterate through list
  • 19. Example: Wine Merchant (Italian Variation) Context: • Wine Merchant • Shelf Space for 30 New Wines Stakes: • Rev/mo for Bad Case = 200 • Rev/mo Good Case = 300 Organizational Objectives: • Maximize Revenue, Stock Shelves, Stock Italian Stakeholders: • Merchants, Customers Course of Action: • Probability Good/Bad • Expected Value • Re-Weight Italian Wines • Rank Wines • Buy Top 30 Iterate Iterate through list
  • 20. Things to Think About Batch vs Individual • Calibration (Especially People) • Difference in Risk Attitudes Utilities Other Than Money • Ethics, Laws, Norms • Predictability • Health • Anything Hard to Put Monetary Value On
  • 21. Strategy 2 Label Output Probability Output Feature Selection IF Experiment: Include Custom Scoring IF Not: Evaluate with Custom Scoring Evaluate with Cross-Entropy/Log-Loss Possibly Others Parameter Tuning Loss Function Selection Trade-off FP, TP, FN, FP Accordingly Note: High-Risk Select Cross-Entropy/Log-Loss Model Selection Evaluate with Total Value OR Risk Attitude Evaluate with Cross-Entropy/Log-Loss
  • 22. Decisions in ML Pipeline • Feature Selection • Algorithm Selection • Loss Function • Parameter Tuning
  • 23. In Abstract • Evaluation Metrics (usually accuracy) • Previous Experience In Business Context • Based on Outcomes How do we make these decisions?
  • 24. Feature Selection Wrapper Methods • Builds models to select features • Selects highest scoring set Points of control: • Custom Scoring • Selection based on outcomes of interest
  • 25. Hyper- Parameter Tuning Search Types • Grid Search • Random Search • Many Others Points of control: • Custom Scoring • Selection based on outcomes of interest
  • 26. Loss Function Loss Functions • Cross-Entropy • Hinge Loss • Many Others Points of control: • Selection based on Use-Case • Selection based on outcomes • Generation based on outcomes • Note: Risky to modify
  • 27. Model Selection Selection Methods • Bias/Variance • Scoring Metrics • Many Others Points of control: • Custom Scoring • Selection based on outcomes • Most Important (Must Be Exclusive in Part)
  • 29. Conclusions Don’t Over-Focus on Accuracy • Outcomes • Context • Stakeholders • Organizations Keep the Use-Case in the Process • Choose the Right Classifier • Make Decisions Based on Application Work With Domain Experts and Prescriptive Analysts • Model Consumption/Utilization • Get Utilities and Risk Attitudes
  翻译: