SlideShare a Scribd company logo
Decision Support Analysis for Software Effort Estimation by Analogy Jingzhou Li Guenther Ruhe University of Calgary, Canada PROMISE’07, May 20, 2007
Outline Technology (evaluation) Which technology is suitable for which situations? What is the empirical evidence support of the decision? Software effort estimation by analogy (EBA) What are the optional methods for EBA? What are the basic decision-making problems? What are the empirical evidences to support the decision-making? Decision making Empirical studies Empirical study (an example   ) Decision–centric process model of EBA
1. Estimation by analogy —An introduction New Object EBA Effort estimate Historical data Three steps : 1. Search for analogs (similar objects) 2. Determine the closest analogs 3. Predict by analogy adaptation How many analogs should we use? What adaptation strategy should we use? What if there are missing values? What similarity measures should we use? 1. What are the basic tasks to accomplish for a user in order to apply or customize EBA? 2. What are the basic decision-making problems and their solution alternatives for applying or customizing EBA? v nm … v 1m a m … v ij … … r 2 e n ... v n1 r n e 1 … v 11 r 1 Effort … a 2 a 1 v gm a m ? … v g2 v g1 s g Effort … a 2 a 1
2. Decision-centric process model of EBA Processed Historical Data Raw Historical Data D8. Determining closest analogs  D2. Dealing with missing values D1. Impact analysis of missing values D7. Retrieving analogs Objects Under Estimation Effort Estimates D9. Analogy adaptation D11. Comparing EBA methods in general D10. Choosing evaluation criteria D6. Determining similarity measures  D3. Object selection  D5. Attribute weighting & selection  D4. Discretization of attributes
3.  Decision problems of EBA and solution alternatives  where   Si.j  represent the   j th   solution alternative of decision problem   Di   Accuracy-based methods EBA comparison methods in General D11 Some conventional criteria: e.g. MMRE, Pred Choosing evaluation criteria D10 Mean, weighted mean, linear extrapolation Analogy adaptation strategy D9 Fixed number of  analogs without considering similarity measure; through learning process Determining closest analogs D8 Using similarity measures or rule-based heuristics Retrieving analogs D7 Distance-based, local-global similarity principle  Determining similarity measures D6 S5.1—Brute-force attribute selection S5.2—WRAPPER attribute selection S5.3—Rough Sets based attribute selection S5.4—Attribute weighting using regression S5.5—Attribute weighting using genetic algorithm S5.6-S5.9—Attribute weighting using Rough Sets (heuristic H 1  to H 4 ) Attribute weighting and selection D5 For RSA-based attribute weighting; Based on interval, frequency, or both; other techniques used in machine learning Discretization of continuous attributes  D4 Hill climbing,  simulated annealing, forward and backward sequential selection algorithms Object selection D3 Deletion and imputation techniques; NULL value Dealing with missing values D2 Preliminary knowledge Impact analysis of missing values D1 Typical solution alternatives Decision problems ID
3.  Decision problems of EBA and solution alternatives  General form of EBA: EBA =   F   (D1, D2, …, D11) where domain of Di : {Si.j} – solution alternatives of Di F   is an amalgamation function Customization of EBA: A specific EBA is obtained for a given data set  DB  by using a (set of) specific solution alternatives Si.j of Di and  aggregated through function  F. EBA( DB ) =   F   (D1, D2, …, D11,  DB )
3.  Decision problems of EBA and solution alternatives --Customization of EBA   EBA =  F  (D1, D2, …, D11) Data set type 1 Data set type 2 Data set type k …… Classification according to characteristics of the data sets Si.j for Di? 3. How empirical study can be used to support the decision-making regarding the customization of EBA? Customization 1 Customization 2 Customization k
4. Decision support in an example EBA method —AQUA + Effort estimates Data set for  AQUA + Learned accuracy distribution Attributes & weights Raw historical data Pre-process (missing value, attribute type…) Pre-Phase (D2, D6) (D4, D5) (D8) (D7, D9) Objects under estimation S2.3:  NULL  value S6.5: local-global similarity, weighted mean of local-similarity measures S4.2: equal frequency and equal width discretization S5.6-S5.9: RSA-based attribute weighting, heuristics H 1 -H 4 S7.1: similarity measure S8.2: learning process S9.1: adaptation using weighted mean General form of AQUA + : AQUA +  =   F   (D2(S2.3), D4(S4.2), D5(S5.6),  D6(S6.5), D7(S7.1), D8(S8.2), D9(S9.1)) For a specific type of data set  DB : AQUA +  ( DB ) = ?  e.g. S5.6-S5.9: H 1 -H 4 ? AQUA + Learning Phase1 Predicting Phase2 Attribute weighting and selection  Phase0
Data sets used in the comparative study 4. Decision support in an example EBA method  —Comparative study Mendes et al., 2003  0 0 6 34 Mends03 Kemerer et al., 1987 40 0 5 15 Kem87 ISBSG, 2004 63 27.24 24 158 ISBSG04-2 Jingzhou et al., 2005  71 6.8 14 76 USP05-FT Jingzhou et al., 2005  71 2.54 14 121 USP05-RQ Source %Non-Quantitative Attributes %Missing Values #Attributes #Objects Data Sets
Comparison of the four attribute weighting heuristics 2. H 1  performed better than H 0  for all data sets, hence is recommended for use in AQUA + . Tentative conclusions: 1. H 1  and H 3  performed the best, hence RSA-based attribute weighting is recommended for use by AQUA + . 4. Decision support in an example EBA method  —Comparative study H 4 H 3 H 2 H 1 H 0 -0.05 -0.05 − 0.15 -0.09 Kem87 0.15 0.62 − 0.03 -0.79 USP05-RQ 0.37 0.52 -1.53 0.42 0.22 USP05-FT 0.35 0.30 -2.62 1.81 0.16 ISBSG04-2 -0.47 AccuH[i] -0.48 -0.47 1.42 1.42 Mends03 Weighting  Heuristics Data sets
4. Decision support in an example EBA method  —Apply the knowledge obtained from the comparative study H 3  is suitable for this class H 1  is suitable for this class New data set Which heuristic should be used? H 1  is suitable for this class
5. Decision support and empirical studies Application or customization  of EBA e.g. Knowledge about which alternatives are suitable for which types of data set New Data Set Knowledge base DSS for EBA Empirical studies Apply knowledge Classify Customize EBA
6. Summary and future work Decision-centric process model   Decision support Decision problems and solution alternatives   Example EBA AQUA + Empirical studies Knowledge base DSS for EBA
Major references G. Ruhe, "Software Engineering Decision Support—A New Paradigm for Learning Software Organizations",  Advances in Learning Software Organization , Lecture Notes In Computer Science, Vol. 2640, Springer 2003, pp 104-115. V.R. Basili, G. Caldiera, and H.D. Rombach, "Experience Factory",  Encyclopedia of Software Engineering  (Eds. J. Marciniak), Vol. 1, 2001, pp 511-519.  G. Ruhe, "Software Engineering Decision Support and Empirical Investigations - A Proposed Marriage",  The Future of Empirical Studies in Software Engineering  (A. Jedlitschka, M. Ciolkowski, Eds.), Workshop Serious on Empirical Studies in Software Engineering, Vol. 2, 2003, pp 25-34. M. Shepperd, C. Schofield, “Estimating Software Project Effort Using Analogies”,  IEEE Transactions on Software Engineering , 23(1997) 736-743. J.Z. Li, G. Ruhe, A. Al-Emran, and M.M. Ritcher, "A Flexible Method for Effort Estimation by Analogy",  Empirical Software Engineering , Vol. 12, No. 1, 2007, pp 65-106.  J.Z. Li, G. Ruhe, "Software Effort Estimation by Analogy Using Attribute Weighting Based on Rough Sets",  International Journal of Software Engineering and Knowledge Engineering , To appear.  J.Z. Li, A. Ahmed, G. Ruhe, "Impact Analysis of Missing Values on the Prediction Accuracy of Analogy-based Software Estimation Method AQUA", ESEM’07, Madrid, Spain, September 2007.
Thank you !  Comments and questions?
A preliminary DSS framework for EBA Machine learning and reasoning tools Virtual DB Virtual KB … Interface Database Documents Web contents, hypermedia Model base Other forms of contents Rule base Domain knowledge Dealing with missing values Attribute weighting and selection Discretization of attributes General EBA comparison methods … Object selection Determining similarity measures Retrieving & determining analogs Analogy adaptation strategy … Knowledge representation and acquisition General data analysis tools Decision-centric EBA process Objects under estimation Effort estimates
Ad

More Related Content

What's hot (20)

Complexity Measures for Secure Service-Orieted Software Architectures
Complexity Measures for Secure Service-Orieted Software ArchitecturesComplexity Measures for Secure Service-Orieted Software Architectures
Complexity Measures for Secure Service-Orieted Software Architectures
Tim Menzies
 
Using Developer Information as a Prediction Factor
Using Developer Information as a Prediction FactorUsing Developer Information as a Prediction Factor
Using Developer Information as a Prediction Factor
Tim Menzies
 
Mca se chapter_9_formal_methods
Mca se chapter_9_formal_methodsMca se chapter_9_formal_methods
Mca se chapter_9_formal_methods
Aman Adhikari
 
Model Checking in Formal Methods
Model Checking in Formal MethodsModel Checking in Formal Methods
Model Checking in Formal Methods
Sana Rahim
 
June 2010 exam questions and answers
June 2010   exam questions and answersJune 2010   exam questions and answers
June 2010 exam questions and answers
DamonTauroa
 
Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...
Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...
Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...
CS, NcState
 
Introduction to formal methods
Introduction to formal methodsIntroduction to formal methods
Introduction to formal methods
Inzemamul Haque
 
Evolutionary Search Techniques with Strong Heuristics for Multi-Objective Fea...
Evolutionary Search Techniques with Strong Heuristics for Multi-Objective Fea...Evolutionary Search Techniques with Strong Heuristics for Multi-Objective Fea...
Evolutionary Search Techniques with Strong Heuristics for Multi-Objective Fea...
Abdel Salam Sayyad
 
Slides chapters 28-32
Slides chapters 28-32Slides chapters 28-32
Slides chapters 28-32
Priyanka Shetty
 
Practical Constraint Solving for Generating System Test Data
Practical Constraint Solving for Generating System Test DataPractical Constraint Solving for Generating System Test Data
Practical Constraint Solving for Generating System Test Data
Lionel Briand
 
Formal Methods
Formal MethodsFormal Methods
Formal Methods
HendMuhammad
 
Star Master Cocomo07
Star Master Cocomo07Star Master Cocomo07
Star Master Cocomo07
CS, NcState
 
Principles of effort estimation
Principles of effort estimationPrinciples of effort estimation
Principles of effort estimation
CS, NcState
 
Midterm Exam Solutions Fall03
Midterm Exam Solutions Fall03Midterm Exam Solutions Fall03
Midterm Exam Solutions Fall03
Radu_Negulescu
 
Keynote SBST 2014 - Search-Based Testing
Keynote SBST 2014 - Search-Based TestingKeynote SBST 2014 - Search-Based Testing
Keynote SBST 2014 - Search-Based Testing
Lionel Briand
 
Final Exam Questions Fall03
Final Exam Questions Fall03Final Exam Questions Fall03
Final Exam Questions Fall03
Radu_Negulescu
 
Make the Most of Your Time: How Should the Analyst Work with Automated Tracea...
Make the Most of Your Time: How Should the Analyst Work with Automated Tracea...Make the Most of Your Time: How Should the Analyst Work with Automated Tracea...
Make the Most of Your Time: How Should the Analyst Work with Automated Tracea...
Tim Menzies
 
Scenario $4$
Scenario $4$Scenario $4$
Scenario $4$
Jason121
 
Painful Test Estimation
Painful Test EstimationPainful Test Estimation
Painful Test Estimation
GlobalLogic Ukraine
 
Research-Based Innovation with Industry: Project Experience and Lessons Learned
Research-Based Innovation with Industry: Project Experience and Lessons LearnedResearch-Based Innovation with Industry: Project Experience and Lessons Learned
Research-Based Innovation with Industry: Project Experience and Lessons Learned
Lionel Briand
 
Complexity Measures for Secure Service-Orieted Software Architectures
Complexity Measures for Secure Service-Orieted Software ArchitecturesComplexity Measures for Secure Service-Orieted Software Architectures
Complexity Measures for Secure Service-Orieted Software Architectures
Tim Menzies
 
Using Developer Information as a Prediction Factor
Using Developer Information as a Prediction FactorUsing Developer Information as a Prediction Factor
Using Developer Information as a Prediction Factor
Tim Menzies
 
Mca se chapter_9_formal_methods
Mca se chapter_9_formal_methodsMca se chapter_9_formal_methods
Mca se chapter_9_formal_methods
Aman Adhikari
 
Model Checking in Formal Methods
Model Checking in Formal MethodsModel Checking in Formal Methods
Model Checking in Formal Methods
Sana Rahim
 
June 2010 exam questions and answers
June 2010   exam questions and answersJune 2010   exam questions and answers
June 2010 exam questions and answers
DamonTauroa
 
Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...
Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...
Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...
CS, NcState
 
Introduction to formal methods
Introduction to formal methodsIntroduction to formal methods
Introduction to formal methods
Inzemamul Haque
 
Evolutionary Search Techniques with Strong Heuristics for Multi-Objective Fea...
Evolutionary Search Techniques with Strong Heuristics for Multi-Objective Fea...Evolutionary Search Techniques with Strong Heuristics for Multi-Objective Fea...
Evolutionary Search Techniques with Strong Heuristics for Multi-Objective Fea...
Abdel Salam Sayyad
 
Practical Constraint Solving for Generating System Test Data
Practical Constraint Solving for Generating System Test DataPractical Constraint Solving for Generating System Test Data
Practical Constraint Solving for Generating System Test Data
Lionel Briand
 
Star Master Cocomo07
Star Master Cocomo07Star Master Cocomo07
Star Master Cocomo07
CS, NcState
 
Principles of effort estimation
Principles of effort estimationPrinciples of effort estimation
Principles of effort estimation
CS, NcState
 
Midterm Exam Solutions Fall03
Midterm Exam Solutions Fall03Midterm Exam Solutions Fall03
Midterm Exam Solutions Fall03
Radu_Negulescu
 
Keynote SBST 2014 - Search-Based Testing
Keynote SBST 2014 - Search-Based TestingKeynote SBST 2014 - Search-Based Testing
Keynote SBST 2014 - Search-Based Testing
Lionel Briand
 
Final Exam Questions Fall03
Final Exam Questions Fall03Final Exam Questions Fall03
Final Exam Questions Fall03
Radu_Negulescu
 
Make the Most of Your Time: How Should the Analyst Work with Automated Tracea...
Make the Most of Your Time: How Should the Analyst Work with Automated Tracea...Make the Most of Your Time: How Should the Analyst Work with Automated Tracea...
Make the Most of Your Time: How Should the Analyst Work with Automated Tracea...
Tim Menzies
 
Scenario $4$
Scenario $4$Scenario $4$
Scenario $4$
Jason121
 
Research-Based Innovation with Industry: Project Experience and Lessons Learned
Research-Based Innovation with Industry: Project Experience and Lessons LearnedResearch-Based Innovation with Industry: Project Experience and Lessons Learned
Research-Based Innovation with Industry: Project Experience and Lessons Learned
Lionel Briand
 

Similar to Decision Support Analyss for Software Effort Estimation by Analogy (20)

Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
gregoryg
 
Research proposal
Research proposalResearch proposal
Research proposal
Sadia Sharmin
 
Review of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & PredictionReview of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & Prediction
IRJET Journal
 
Part 1
Part 1Part 1
Part 1
butest
 
Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...
Gurdal Ertek
 
Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...
ertekg
 
Chinese Named Entity Recognition with Graph-based Semi-supervised Learning Model
Chinese Named Entity Recognition with Graph-based Semi-supervised Learning ModelChinese Named Entity Recognition with Graph-based Semi-supervised Learning Model
Chinese Named Entity Recognition with Graph-based Semi-supervised Learning Model
Lifeng (Aaron) Han
 
Exploratory Data Analysis - NIST eHandbook of Statistical Methods-out.pdf
Exploratory Data Analysis - NIST eHandbook of Statistical Methods-out.pdfExploratory Data Analysis - NIST eHandbook of Statistical Methods-out.pdf
Exploratory Data Analysis - NIST eHandbook of Statistical Methods-out.pdf
lsharkey602
 
Data analytcis-first-steps
Data analytcis-first-stepsData analytcis-first-steps
Data analytcis-first-steps
Shesha R
 
How to write an academic paper by a Bulgarian teacher
How to write an academic paper by a Bulgarian teacherHow to write an academic paper by a Bulgarian teacher
How to write an academic paper by a Bulgarian teacher
oriceebun
 
Hypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining AlgorithmsHypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining Algorithms
IJERA Editor
 
Query Aware Determinization of Uncertain Objects
Query Aware Determinization of Uncertain ObjectsQuery Aware Determinization of Uncertain Objects
Query Aware Determinization of Uncertain Objects
nexgentechnology
 
Query aware determinization of uncertain
Query aware determinization of uncertainQuery aware determinization of uncertain
Query aware determinization of uncertain
nexgentech15
 
QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTS
 QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTS QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTS
QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTS
Nexgen Technology
 
Current Approaches in Search Result Diversification
Current Approaches in Search Result DiversificationCurrent Approaches in Search Result Diversification
Current Approaches in Search Result Diversification
Mario Sangiorgio
 
Query aware determinization of uncertain
Query aware determinization of uncertainQuery aware determinization of uncertain
Query aware determinization of uncertain
Shakas Technologies
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
Dmitry Grapov
 
Data Science research methodology & processes
Data Science research methodology & processesData Science research methodology & processes
Data Science research methodology & processes
IPMCBIT
 
313 IDS _Course_Introduction_PPT.pptx
313 IDS _Course_Introduction_PPT.pptx313 IDS _Course_Introduction_PPT.pptx
313 IDS _Course_Introduction_PPT.pptx
sameernsn1
 
DIRECTIONS READ THE FOLLOWING STUDENT POST AND RESPOND EVALUATE I.docx
DIRECTIONS READ THE FOLLOWING STUDENT POST AND RESPOND EVALUATE I.docxDIRECTIONS READ THE FOLLOWING STUDENT POST AND RESPOND EVALUATE I.docx
DIRECTIONS READ THE FOLLOWING STUDENT POST AND RESPOND EVALUATE I.docx
lynettearnold46882
 
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
gregoryg
 
Review of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & PredictionReview of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & Prediction
IRJET Journal
 
Part 1
Part 1Part 1
Part 1
butest
 
Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...
Gurdal Ertek
 
Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...
ertekg
 
Chinese Named Entity Recognition with Graph-based Semi-supervised Learning Model
Chinese Named Entity Recognition with Graph-based Semi-supervised Learning ModelChinese Named Entity Recognition with Graph-based Semi-supervised Learning Model
Chinese Named Entity Recognition with Graph-based Semi-supervised Learning Model
Lifeng (Aaron) Han
 
Exploratory Data Analysis - NIST eHandbook of Statistical Methods-out.pdf
Exploratory Data Analysis - NIST eHandbook of Statistical Methods-out.pdfExploratory Data Analysis - NIST eHandbook of Statistical Methods-out.pdf
Exploratory Data Analysis - NIST eHandbook of Statistical Methods-out.pdf
lsharkey602
 
Data analytcis-first-steps
Data analytcis-first-stepsData analytcis-first-steps
Data analytcis-first-steps
Shesha R
 
How to write an academic paper by a Bulgarian teacher
How to write an academic paper by a Bulgarian teacherHow to write an academic paper by a Bulgarian teacher
How to write an academic paper by a Bulgarian teacher
oriceebun
 
Hypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining AlgorithmsHypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining Algorithms
IJERA Editor
 
Query Aware Determinization of Uncertain Objects
Query Aware Determinization of Uncertain ObjectsQuery Aware Determinization of Uncertain Objects
Query Aware Determinization of Uncertain Objects
nexgentechnology
 
Query aware determinization of uncertain
Query aware determinization of uncertainQuery aware determinization of uncertain
Query aware determinization of uncertain
nexgentech15
 
QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTS
 QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTS QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTS
QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTS
Nexgen Technology
 
Current Approaches in Search Result Diversification
Current Approaches in Search Result DiversificationCurrent Approaches in Search Result Diversification
Current Approaches in Search Result Diversification
Mario Sangiorgio
 
Query aware determinization of uncertain
Query aware determinization of uncertainQuery aware determinization of uncertain
Query aware determinization of uncertain
Shakas Technologies
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
Dmitry Grapov
 
Data Science research methodology & processes
Data Science research methodology & processesData Science research methodology & processes
Data Science research methodology & processes
IPMCBIT
 
313 IDS _Course_Introduction_PPT.pptx
313 IDS _Course_Introduction_PPT.pptx313 IDS _Course_Introduction_PPT.pptx
313 IDS _Course_Introduction_PPT.pptx
sameernsn1
 
DIRECTIONS READ THE FOLLOWING STUDENT POST AND RESPOND EVALUATE I.docx
DIRECTIONS READ THE FOLLOWING STUDENT POST AND RESPOND EVALUATE I.docxDIRECTIONS READ THE FOLLOWING STUDENT POST AND RESPOND EVALUATE I.docx
DIRECTIONS READ THE FOLLOWING STUDENT POST AND RESPOND EVALUATE I.docx
lynettearnold46882
 
Ad

More from Tim Menzies (6)

Modeling the Effect of Size of Defect Proneness for Open-Source Software
Modeling the Effect of Size of Defect Proneness for Open-Source SoftwareModeling the Effect of Size of Defect Proneness for Open-Source Software
Modeling the Effect of Size of Defect Proneness for Open-Source Software
Tim Menzies
 
Assessing the Reliability of a Human Estimator
Assessing the Reliability of a Human EstimatorAssessing the Reliability of a Human Estimator
Assessing the Reliability of a Human Estimator
Tim Menzies
 
Project Data Incorporating Qualitative Factors for Improved Software Defect P...
Project Data Incorporating Qualitative Factors for Improved Software Defect P...Project Data Incorporating Qualitative Factors for Improved Software Defect P...
Project Data Incorporating Qualitative Factors for Improved Software Defect P...
Tim Menzies
 
Promise Keynote
Promise KeynotePromise Keynote
Promise Keynote
Tim Menzies
 
Adequate and Precise Evaluation of Predictive Models in Software Engineering ...
Adequate and Precise Evaluation of Predictive Models in Software Engineering ...Adequate and Precise Evaluation of Predictive Models in Software Engineering ...
Adequate and Precise Evaluation of Predictive Models in Software Engineering ...
Tim Menzies
 
Predicting Defects for Eclipse
Predicting Defects for EclipsePredicting Defects for Eclipse
Predicting Defects for Eclipse
Tim Menzies
 
Modeling the Effect of Size of Defect Proneness for Open-Source Software
Modeling the Effect of Size of Defect Proneness for Open-Source SoftwareModeling the Effect of Size of Defect Proneness for Open-Source Software
Modeling the Effect of Size of Defect Proneness for Open-Source Software
Tim Menzies
 
Assessing the Reliability of a Human Estimator
Assessing the Reliability of a Human EstimatorAssessing the Reliability of a Human Estimator
Assessing the Reliability of a Human Estimator
Tim Menzies
 
Project Data Incorporating Qualitative Factors for Improved Software Defect P...
Project Data Incorporating Qualitative Factors for Improved Software Defect P...Project Data Incorporating Qualitative Factors for Improved Software Defect P...
Project Data Incorporating Qualitative Factors for Improved Software Defect P...
Tim Menzies
 
Adequate and Precise Evaluation of Predictive Models in Software Engineering ...
Adequate and Precise Evaluation of Predictive Models in Software Engineering ...Adequate and Precise Evaluation of Predictive Models in Software Engineering ...
Adequate and Precise Evaluation of Predictive Models in Software Engineering ...
Tim Menzies
 
Predicting Defects for Eclipse
Predicting Defects for EclipsePredicting Defects for Eclipse
Predicting Defects for Eclipse
Tim Menzies
 
Ad

Recently uploaded (20)

WRN_Investor_Presentation_May 2025 Update.pdf
WRN_Investor_Presentation_May 2025 Update.pdfWRN_Investor_Presentation_May 2025 Update.pdf
WRN_Investor_Presentation_May 2025 Update.pdf
cmagee4
 
Mastering Crypto Security: How GXCYPX Solutions Help Prevent Social Engineeri...
Mastering Crypto Security: How GXCYPX Solutions Help Prevent Social Engineeri...Mastering Crypto Security: How GXCYPX Solutions Help Prevent Social Engineeri...
Mastering Crypto Security: How GXCYPX Solutions Help Prevent Social Engineeri...
gxcypx
 
Economic_Planning_and_Development_by_CA_Suvidha_Chaplot.pdf
Economic_Planning_and_Development_by_CA_Suvidha_Chaplot.pdfEconomic_Planning_and_Development_by_CA_Suvidha_Chaplot.pdf
Economic_Planning_and_Development_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
How To Trade Synthetic Indecies 2025 edition
How To Trade Synthetic Indecies 2025 editionHow To Trade Synthetic Indecies 2025 edition
How To Trade Synthetic Indecies 2025 edition
Vince Stanzione
 
report in economics abcdefghijklmnop.pptx
report in economics abcdefghijklmnop.pptxreport in economics abcdefghijklmnop.pptx
report in economics abcdefghijklmnop.pptx
msalvador21
 
Cryptocurrency: The Future of Monetary Policy.pptx
Cryptocurrency: The Future of Monetary Policy.pptxCryptocurrency: The Future of Monetary Policy.pptx
Cryptocurrency: The Future of Monetary Policy.pptx
RockstarSnehadeep
 
Lundin Gold Q1 2025 Conference Call Presentation
Lundin Gold Q1 2025 Conference Call PresentationLundin Gold Q1 2025 Conference Call Presentation
Lundin Gold Q1 2025 Conference Call Presentation
Adnet Communications
 
An indepth study of behavioral finances.
An indepth study of behavioral finances.An indepth study of behavioral finances.
An indepth study of behavioral finances.
Khushboo Dange
 
GCF - Master Presentation Buyside - UK - 0525 - GCF.pdf
GCF - Master Presentation Buyside - UK - 0525 - GCF.pdfGCF - Master Presentation Buyside - UK - 0525 - GCF.pdf
GCF - Master Presentation Buyside - UK - 0525 - GCF.pdf
hkmd5mqzjb
 
JoinM2020_FTBEmembers.pptx.SIDEEVENTTpdf
JoinM2020_FTBEmembers.pptx.SIDEEVENTTpdfJoinM2020_FTBEmembers.pptx.SIDEEVENTTpdf
JoinM2020_FTBEmembers.pptx.SIDEEVENTTpdf
FinTech Belgium
 
Virtual-Galaxy-Infotech-IPO-GMP-An-Overview.pptx
Virtual-Galaxy-Infotech-IPO-GMP-An-Overview.pptxVirtual-Galaxy-Infotech-IPO-GMP-An-Overview.pptx
Virtual-Galaxy-Infotech-IPO-GMP-An-Overview.pptx
Sahard finowings
 
GCF - Our added value in F&B sector 0525.pdf
GCF - Our added value in F&B sector 0525.pdfGCF - Our added value in F&B sector 0525.pdf
GCF - Our added value in F&B sector 0525.pdf
dianepioux1
 
Telegraph - 'Rachel Reeves paves way for Dutch-style pensions' May 2025 2.docx
Telegraph - 'Rachel Reeves paves way for Dutch-style pensions' May 2025 2.docxTelegraph - 'Rachel Reeves paves way for Dutch-style pensions' May 2025 2.docx
Telegraph - 'Rachel Reeves paves way for Dutch-style pensions' May 2025 2.docx
Henry Tapper
 
New Monthly Enterprises Survey. Issue 34. (02.2025) Ukrainian Business in War...
New Monthly Enterprises Survey. Issue 34. (02.2025) Ukrainian Business in War...New Monthly Enterprises Survey. Issue 34. (02.2025) Ukrainian Business in War...
New Monthly Enterprises Survey. Issue 34. (02.2025) Ukrainian Business in War...
Інститут економічних досліджень та політичних консультацій
 
Моніторинг ІТ сектору України GET_UKR_PB_03_2025-1.pdf
Моніторинг ІТ сектору України GET_UKR_PB_03_2025-1.pdfМоніторинг ІТ сектору України GET_UKR_PB_03_2025-1.pdf
Моніторинг ІТ сектору України GET_UKR_PB_03_2025-1.pdf
Інститут економічних досліджень та політичних консультацій
 
2025 0507 Macro Trends and their impact on Enterprise AI.pptx
2025 0507 Macro Trends and their impact on Enterprise AI.pptx2025 0507 Macro Trends and their impact on Enterprise AI.pptx
2025 0507 Macro Trends and their impact on Enterprise AI.pptx
Sunil Grover
 
GCF - Our Added Value in Mobility & Environment Sector 0525.pdf
GCF - Our Added Value in Mobility & Environment Sector 0525.pdfGCF - Our Added Value in Mobility & Environment Sector 0525.pdf
GCF - Our Added Value in Mobility & Environment Sector 0525.pdf
dianepioux1
 
Ethereum's Market Value Soars Through JQRBT-Level Growth, Overtaking Major Co...
Ethereum's Market Value Soars Through JQRBT-Level Growth, Overtaking Major Co...Ethereum's Market Value Soars Through JQRBT-Level Growth, Overtaking Major Co...
Ethereum's Market Value Soars Through JQRBT-Level Growth, Overtaking Major Co...
jqrbt
 
GCF - Master Presentation Sellside - UK - 0525.pdf
GCF - Master Presentation Sellside - UK - 0525.pdfGCF - Master Presentation Sellside - UK - 0525.pdf
GCF - Master Presentation Sellside - UK - 0525.pdf
hkmd5mqzjb
 
GCF - Our added Value in Cosmetics & Fragrances sector 0425.pdf
GCF - Our added Value in Cosmetics & Fragrances sector 0425.pdfGCF - Our added Value in Cosmetics & Fragrances sector 0425.pdf
GCF - Our added Value in Cosmetics & Fragrances sector 0425.pdf
hkmd5mqzjb
 
WRN_Investor_Presentation_May 2025 Update.pdf
WRN_Investor_Presentation_May 2025 Update.pdfWRN_Investor_Presentation_May 2025 Update.pdf
WRN_Investor_Presentation_May 2025 Update.pdf
cmagee4
 
Mastering Crypto Security: How GXCYPX Solutions Help Prevent Social Engineeri...
Mastering Crypto Security: How GXCYPX Solutions Help Prevent Social Engineeri...Mastering Crypto Security: How GXCYPX Solutions Help Prevent Social Engineeri...
Mastering Crypto Security: How GXCYPX Solutions Help Prevent Social Engineeri...
gxcypx
 
Economic_Planning_and_Development_by_CA_Suvidha_Chaplot.pdf
Economic_Planning_and_Development_by_CA_Suvidha_Chaplot.pdfEconomic_Planning_and_Development_by_CA_Suvidha_Chaplot.pdf
Economic_Planning_and_Development_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
How To Trade Synthetic Indecies 2025 edition
How To Trade Synthetic Indecies 2025 editionHow To Trade Synthetic Indecies 2025 edition
How To Trade Synthetic Indecies 2025 edition
Vince Stanzione
 
report in economics abcdefghijklmnop.pptx
report in economics abcdefghijklmnop.pptxreport in economics abcdefghijklmnop.pptx
report in economics abcdefghijklmnop.pptx
msalvador21
 
Cryptocurrency: The Future of Monetary Policy.pptx
Cryptocurrency: The Future of Monetary Policy.pptxCryptocurrency: The Future of Monetary Policy.pptx
Cryptocurrency: The Future of Monetary Policy.pptx
RockstarSnehadeep
 
Lundin Gold Q1 2025 Conference Call Presentation
Lundin Gold Q1 2025 Conference Call PresentationLundin Gold Q1 2025 Conference Call Presentation
Lundin Gold Q1 2025 Conference Call Presentation
Adnet Communications
 
An indepth study of behavioral finances.
An indepth study of behavioral finances.An indepth study of behavioral finances.
An indepth study of behavioral finances.
Khushboo Dange
 
GCF - Master Presentation Buyside - UK - 0525 - GCF.pdf
GCF - Master Presentation Buyside - UK - 0525 - GCF.pdfGCF - Master Presentation Buyside - UK - 0525 - GCF.pdf
GCF - Master Presentation Buyside - UK - 0525 - GCF.pdf
hkmd5mqzjb
 
JoinM2020_FTBEmembers.pptx.SIDEEVENTTpdf
JoinM2020_FTBEmembers.pptx.SIDEEVENTTpdfJoinM2020_FTBEmembers.pptx.SIDEEVENTTpdf
JoinM2020_FTBEmembers.pptx.SIDEEVENTTpdf
FinTech Belgium
 
Virtual-Galaxy-Infotech-IPO-GMP-An-Overview.pptx
Virtual-Galaxy-Infotech-IPO-GMP-An-Overview.pptxVirtual-Galaxy-Infotech-IPO-GMP-An-Overview.pptx
Virtual-Galaxy-Infotech-IPO-GMP-An-Overview.pptx
Sahard finowings
 
GCF - Our added value in F&B sector 0525.pdf
GCF - Our added value in F&B sector 0525.pdfGCF - Our added value in F&B sector 0525.pdf
GCF - Our added value in F&B sector 0525.pdf
dianepioux1
 
Telegraph - 'Rachel Reeves paves way for Dutch-style pensions' May 2025 2.docx
Telegraph - 'Rachel Reeves paves way for Dutch-style pensions' May 2025 2.docxTelegraph - 'Rachel Reeves paves way for Dutch-style pensions' May 2025 2.docx
Telegraph - 'Rachel Reeves paves way for Dutch-style pensions' May 2025 2.docx
Henry Tapper
 
2025 0507 Macro Trends and their impact on Enterprise AI.pptx
2025 0507 Macro Trends and their impact on Enterprise AI.pptx2025 0507 Macro Trends and their impact on Enterprise AI.pptx
2025 0507 Macro Trends and their impact on Enterprise AI.pptx
Sunil Grover
 
GCF - Our Added Value in Mobility & Environment Sector 0525.pdf
GCF - Our Added Value in Mobility & Environment Sector 0525.pdfGCF - Our Added Value in Mobility & Environment Sector 0525.pdf
GCF - Our Added Value in Mobility & Environment Sector 0525.pdf
dianepioux1
 
Ethereum's Market Value Soars Through JQRBT-Level Growth, Overtaking Major Co...
Ethereum's Market Value Soars Through JQRBT-Level Growth, Overtaking Major Co...Ethereum's Market Value Soars Through JQRBT-Level Growth, Overtaking Major Co...
Ethereum's Market Value Soars Through JQRBT-Level Growth, Overtaking Major Co...
jqrbt
 
GCF - Master Presentation Sellside - UK - 0525.pdf
GCF - Master Presentation Sellside - UK - 0525.pdfGCF - Master Presentation Sellside - UK - 0525.pdf
GCF - Master Presentation Sellside - UK - 0525.pdf
hkmd5mqzjb
 
GCF - Our added Value in Cosmetics & Fragrances sector 0425.pdf
GCF - Our added Value in Cosmetics & Fragrances sector 0425.pdfGCF - Our added Value in Cosmetics & Fragrances sector 0425.pdf
GCF - Our added Value in Cosmetics & Fragrances sector 0425.pdf
hkmd5mqzjb
 

Decision Support Analyss for Software Effort Estimation by Analogy

  • 1. Decision Support Analysis for Software Effort Estimation by Analogy Jingzhou Li Guenther Ruhe University of Calgary, Canada PROMISE’07, May 20, 2007
  • 2. Outline Technology (evaluation) Which technology is suitable for which situations? What is the empirical evidence support of the decision? Software effort estimation by analogy (EBA) What are the optional methods for EBA? What are the basic decision-making problems? What are the empirical evidences to support the decision-making? Decision making Empirical studies Empirical study (an example ) Decision–centric process model of EBA
  • 3. 1. Estimation by analogy —An introduction New Object EBA Effort estimate Historical data Three steps : 1. Search for analogs (similar objects) 2. Determine the closest analogs 3. Predict by analogy adaptation How many analogs should we use? What adaptation strategy should we use? What if there are missing values? What similarity measures should we use? 1. What are the basic tasks to accomplish for a user in order to apply or customize EBA? 2. What are the basic decision-making problems and their solution alternatives for applying or customizing EBA? v nm … v 1m a m … v ij … … r 2 e n ... v n1 r n e 1 … v 11 r 1 Effort … a 2 a 1 v gm a m ? … v g2 v g1 s g Effort … a 2 a 1
  • 4. 2. Decision-centric process model of EBA Processed Historical Data Raw Historical Data D8. Determining closest analogs D2. Dealing with missing values D1. Impact analysis of missing values D7. Retrieving analogs Objects Under Estimation Effort Estimates D9. Analogy adaptation D11. Comparing EBA methods in general D10. Choosing evaluation criteria D6. Determining similarity measures D3. Object selection D5. Attribute weighting & selection D4. Discretization of attributes
  • 5. 3. Decision problems of EBA and solution alternatives where Si.j represent the j th solution alternative of decision problem Di Accuracy-based methods EBA comparison methods in General D11 Some conventional criteria: e.g. MMRE, Pred Choosing evaluation criteria D10 Mean, weighted mean, linear extrapolation Analogy adaptation strategy D9 Fixed number of analogs without considering similarity measure; through learning process Determining closest analogs D8 Using similarity measures or rule-based heuristics Retrieving analogs D7 Distance-based, local-global similarity principle Determining similarity measures D6 S5.1—Brute-force attribute selection S5.2—WRAPPER attribute selection S5.3—Rough Sets based attribute selection S5.4—Attribute weighting using regression S5.5—Attribute weighting using genetic algorithm S5.6-S5.9—Attribute weighting using Rough Sets (heuristic H 1 to H 4 ) Attribute weighting and selection D5 For RSA-based attribute weighting; Based on interval, frequency, or both; other techniques used in machine learning Discretization of continuous attributes D4 Hill climbing, simulated annealing, forward and backward sequential selection algorithms Object selection D3 Deletion and imputation techniques; NULL value Dealing with missing values D2 Preliminary knowledge Impact analysis of missing values D1 Typical solution alternatives Decision problems ID
  • 6. 3. Decision problems of EBA and solution alternatives General form of EBA: EBA = F (D1, D2, …, D11) where domain of Di : {Si.j} – solution alternatives of Di F is an amalgamation function Customization of EBA: A specific EBA is obtained for a given data set DB by using a (set of) specific solution alternatives Si.j of Di and aggregated through function F. EBA( DB ) = F (D1, D2, …, D11, DB )
  • 7. 3. Decision problems of EBA and solution alternatives --Customization of EBA EBA = F (D1, D2, …, D11) Data set type 1 Data set type 2 Data set type k …… Classification according to characteristics of the data sets Si.j for Di? 3. How empirical study can be used to support the decision-making regarding the customization of EBA? Customization 1 Customization 2 Customization k
  • 8. 4. Decision support in an example EBA method —AQUA + Effort estimates Data set for AQUA + Learned accuracy distribution Attributes & weights Raw historical data Pre-process (missing value, attribute type…) Pre-Phase (D2, D6) (D4, D5) (D8) (D7, D9) Objects under estimation S2.3: NULL value S6.5: local-global similarity, weighted mean of local-similarity measures S4.2: equal frequency and equal width discretization S5.6-S5.9: RSA-based attribute weighting, heuristics H 1 -H 4 S7.1: similarity measure S8.2: learning process S9.1: adaptation using weighted mean General form of AQUA + : AQUA + = F (D2(S2.3), D4(S4.2), D5(S5.6), D6(S6.5), D7(S7.1), D8(S8.2), D9(S9.1)) For a specific type of data set DB : AQUA + ( DB ) = ? e.g. S5.6-S5.9: H 1 -H 4 ? AQUA + Learning Phase1 Predicting Phase2 Attribute weighting and selection Phase0
  • 9. Data sets used in the comparative study 4. Decision support in an example EBA method —Comparative study Mendes et al., 2003 0 0 6 34 Mends03 Kemerer et al., 1987 40 0 5 15 Kem87 ISBSG, 2004 63 27.24 24 158 ISBSG04-2 Jingzhou et al., 2005 71 6.8 14 76 USP05-FT Jingzhou et al., 2005 71 2.54 14 121 USP05-RQ Source %Non-Quantitative Attributes %Missing Values #Attributes #Objects Data Sets
  • 10. Comparison of the four attribute weighting heuristics 2. H 1 performed better than H 0 for all data sets, hence is recommended for use in AQUA + . Tentative conclusions: 1. H 1 and H 3 performed the best, hence RSA-based attribute weighting is recommended for use by AQUA + . 4. Decision support in an example EBA method —Comparative study H 4 H 3 H 2 H 1 H 0 -0.05 -0.05 − 0.15 -0.09 Kem87 0.15 0.62 − 0.03 -0.79 USP05-RQ 0.37 0.52 -1.53 0.42 0.22 USP05-FT 0.35 0.30 -2.62 1.81 0.16 ISBSG04-2 -0.47 AccuH[i] -0.48 -0.47 1.42 1.42 Mends03 Weighting Heuristics Data sets
  • 11. 4. Decision support in an example EBA method —Apply the knowledge obtained from the comparative study H 3 is suitable for this class H 1 is suitable for this class New data set Which heuristic should be used? H 1 is suitable for this class
  • 12. 5. Decision support and empirical studies Application or customization of EBA e.g. Knowledge about which alternatives are suitable for which types of data set New Data Set Knowledge base DSS for EBA Empirical studies Apply knowledge Classify Customize EBA
  • 13. 6. Summary and future work Decision-centric process model Decision support Decision problems and solution alternatives Example EBA AQUA + Empirical studies Knowledge base DSS for EBA
  • 14. Major references G. Ruhe, "Software Engineering Decision Support—A New Paradigm for Learning Software Organizations", Advances in Learning Software Organization , Lecture Notes In Computer Science, Vol. 2640, Springer 2003, pp 104-115. V.R. Basili, G. Caldiera, and H.D. Rombach, "Experience Factory", Encyclopedia of Software Engineering (Eds. J. Marciniak), Vol. 1, 2001, pp 511-519. G. Ruhe, "Software Engineering Decision Support and Empirical Investigations - A Proposed Marriage", The Future of Empirical Studies in Software Engineering (A. Jedlitschka, M. Ciolkowski, Eds.), Workshop Serious on Empirical Studies in Software Engineering, Vol. 2, 2003, pp 25-34. M. Shepperd, C. Schofield, “Estimating Software Project Effort Using Analogies”, IEEE Transactions on Software Engineering , 23(1997) 736-743. J.Z. Li, G. Ruhe, A. Al-Emran, and M.M. Ritcher, "A Flexible Method for Effort Estimation by Analogy", Empirical Software Engineering , Vol. 12, No. 1, 2007, pp 65-106. J.Z. Li, G. Ruhe, "Software Effort Estimation by Analogy Using Attribute Weighting Based on Rough Sets", International Journal of Software Engineering and Knowledge Engineering , To appear. J.Z. Li, A. Ahmed, G. Ruhe, "Impact Analysis of Missing Values on the Prediction Accuracy of Analogy-based Software Estimation Method AQUA", ESEM’07, Madrid, Spain, September 2007.
  • 15. Thank you !  Comments and questions?
  • 16. A preliminary DSS framework for EBA Machine learning and reasoning tools Virtual DB Virtual KB … Interface Database Documents Web contents, hypermedia Model base Other forms of contents Rule base Domain knowledge Dealing with missing values Attribute weighting and selection Discretization of attributes General EBA comparison methods … Object selection Determining similarity measures Retrieving & determining analogs Analogy adaptation strategy … Knowledge representation and acquisition General data analysis tools Decision-centric EBA process Objects under estimation Effort estimates

Editor's Notes

  • #2: Slide 11: 4 Heuristics as a header for the lower Introduce a simplified formulae exlaing how the coefficients were calculated. - Be prepared for the following questions: (1) Are there alternatives to using RSA to determine the importance of the attributes? (2) What is the overall effort of the method(s) (3) Wham means the name AQUA? (4) When do you recommend apply the method? (and when better not?) (5) Needs the learning be done after each new prediction (data point)??
  翻译: