SlideShare a Scribd company logo
Presented by –Harsh Upadhyay
Module Objective


          Agenda
•Introduce the concept of Simple Linear Regression

•Walk through the process of plotting our data

•Apply regression Techniques

•Evaluate our model

•Interpret the result

   Expected Learning
•Understand key Simple Linear Regression terminology

•Evaluate the relationship between a continuous X and continuous Y

•Use regression analysis to make predictions about process
Historical Note


•Sir Francis Galton (1822-1911) used the term
Regression.


•To explain the relationship between the heights
(inches) of fathers and their sons.
Simple Linear Regression


Regression analysis is used to predict the value of one variable (the dependent
variable) on the basis of other variables (the independent variables).

Dependent variable: denoted Y
Independent variables: denoted X1, X2, …, Xk

   If we only have ONE independent variable, the model is
Simple Linear Regression Analysis



Variables:
         X = Independent Variable (we provide this)
         Y = Dependent Variable (we observe this)
Parameters:
         β0 = Intercept
                 The y-intercept of a line is the point at which the line crosses the y
                  axis. ( i.e. where the x value equals 0)
             β1 = Slope
                  Change in the mean of Y for a unit change in X
             ε ~ Normal Random Variable (με = 0, σε = ???) [Noise]
Simple Linear Regression Analysis


Meaning of       and

   > 0 [positive slope]   < 0 [negative slope]

             y



                                         rise

                               run
                                           =slope (=rise/run)


  =y-intercept

                                                    x
Effect of Larger Values of σ ε


                Lower vs. Higher
                Variability
 Y




25K$

                    Y= β0+ β1 +

                                   X
The least Square Method


                                                            s
                                                         nce
                                               dif   fere
these differences are                       ed
                                        quar
called residuals or                  hes
errors                           of t
                           e sum line…
                    ze s th d the
                 imi nts an
              min poi
           ne
       s li n the
    Thi wee
     bet
Exercise

A black belt is connected with optimize a call center in a retail bank where clients place
the inquiries and order over the phone during 6 am to 6 pm (Monday to Friday). The
current staffing plan is begin with about 4 associates at 6 am and increases to about 35
associate by 9 am. At 3.30 pm the no. of associate begin to drop to about 7 by 6 pm. The
black belt is anticipating an increasing in call volume and want to know how many call
can be answer in 30 min time interval for various staffing level to better staff the call
center.

The black belt obtain the data on the no. of associate and the no. of calls answered
for each 30 min interval for the last two weeks. You have a total of 240 Samples.




                                          Data sheet
Scatter Plot
Fitted Line Plot

      Regression Analysis: CallsAnswd
      versus Staff

      The regression equation is
      CallsAnswd = - 8.844 + 3.099 Staff


      S = 11.7311     R-Sq = 83.4%     R-Sq(adj) = 83.3%


      Analysis of Variance

      Source         DF       SS       MS         F       P
      Regression      1   164091   164091   1192.37   0.000
      Error         238    32753      138
      Total         239   196844



      Fitted Line: CallsAnswd versus Staff
Minitab Output-Session Window

CallsAnswd = - 8.844 + 3.099 Staff

CallsAnswd = - 8.844 + 3.099 (17)=43.839

CallsAnswd = - 8.844 + 3.099 (22)=59.334


                 How Confident are you
                 with this model?

                               59 Calls
   Staff of 22         ?       59 Calls
                               59 Calls
Assessing The Model

              Evaluate the strength of the regression model
1. Determine how much variation in our calls answered data is actually explained by
   staff.

2. Determine The strength of the relationship between Call Answered and Staff


Regression Analysis: CallsAnswd                                     R-sq
versus Staff
The regression equation is
CallsAnswd = - 8.844 + 3.099 Staff                         % of variation in the Y
S = 11.7311    R-Sq = 83.4%    R-Sq(adj) = 83.3%           values explained by the
                                                           linear relationship with X
Analysis of Variance
Source       DF      SS       MS         F       P
Regression    1 164091    164091   1192.37   0.000
Error       238   32753      138                         % of variation Calls Answered
Total       239 196844                                   explained by Staff
Fitted Line: CallsAnswd versus Staff
% of Variation-Explanation




                                                       Explained       Total
                                                       Variation (Y)   Variation (Y)




              Explained Variation
R Squared =      Total Variation    = Between 0 and 1 (0% to 100%)
Properties of R

                                                              The Correlation Coefficient
                                      √ R2 = R         Measure the strength of the linear relationship
                                                          (Pearson’s Correlation of coefficient)




                                                                    R
      Strong            Moderate                     Low                            Low                   Moderate           Moderate
-1 Relationship -0.8   Relationship   -0.5       Relationship        0          Relationship    0.5      Relationship   0.8 Relationship +1
Caution- Correlation and
                          Causation


                                          Calls Answered



                                                  Staff
                                              Correlated




   Correlation                                Causation
                                        Change in one variable causes
Two things vary together
                                             change in another
Regression Interpretation-
                               Graphical




•Residual is not normal
•Residual Vs Fits is not constant
Regression- Session Window

Regression Analysis: CallsAnswd versus Staff         Unusual Observations

The regression equation is                           Obs   Staff CallsAnswd Fit      SE Fit   Residual   St Resid
CallsAnswd = - 8.84 + 3.10 Staff                       1   3.0    6.000     0.454     1.995     5.546     0.48 X
                                                       3   4.0    7.000     3.554     1.913     3.446     0.30 X
                                                       6   4.0   11.000     3.554     1.913     7.446     0.64 X
Predictor Coef SE Coef      T    P                     7   4.0   12.000     3.554     1.913     8.446     0.73 X
Constant -8.844 2.247     -3.94 0.000                  8   3.0    5.000     0.454     1.995     4.546     0.39 X
Staff     3.09947 0.08976 34.53 0.000                  9   3.0    7.000     0.454     1.995     6.546     0.57 X
                                                      67   29.0 109.000     81.040    0.901    27.960     2.39R
                                                      78   32.0 127.000     90.339    1.071    36.661     3.14R
S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3%           141   30.0 134.000     84.140    0.952    49.860     4.26R
                                                     151   30.0 114.000     84.140    0.952    29.860     2.55R
                                                     161   33.0 128.000     93.438    1.136    34.562     2.96R
Analysis of Variance                                 171   33.0 126.000     93.438    1.136    32.562     2.79R
                                                     183   32.0    59.000   90.339    1.071   -31.339    -2.68R
Source         DF       SS     MS       F       P    199   29.0    48.000   81.040    0.901   -33.040    -2.82R
Regression     1       164091 164091 1192.37 0.000   202   25.0    35.000   68.643    0.768   -33.643    -2.87R
Residual Error 238     32753 138                     235    4.0   14.000     3.554    1.913    10.446     0.90 X
Total          239     196844                        236    4.0    7.000     3.554    1.913     3.446     0.30 X

                                                     R denotes an observation with a large standardized residual (>2)
                                                     X denotes an observation whose X value gives it large leverage
Unusual Observations
 Unusual Observations                                               Unusual Observations

 Obs   Staff CallsAnswd Fit      SE Fit   Residual      St Resid    Obs   Staff CallsAnswd Fit      SE Fit   Residual   St Resid
   1   3.0    6.000     0.454     1.995     5.546        0.48 X       1   3.0    6.000     0.454     1.995     5.546     0.48 X
   3   4.0    7.000     3.554     1.913     3.446        0.30 X       3   4.0    7.000     3.554     1.913     3.446     0.30 X
   6   4.0   11.000     3.554     1.913     7.446        0.64 X       6   4.0   11.000     3.554     1.913     7.446     0.64 X
   7   4.0   12.000     3.554     1.913     8.446        0.73 X       7   4.0   12.000     3.554     1.913     8.446     0.73 X
   8   3.0    5.000     0.454     1.995     4.546        0.39 X       8   3.0    5.000     0.454     1.995     4.546     0.39 X
   9   3.0    7.000     0.454     1.995     6.546        0.57 X       9   3.0    7.000     0.454     1.995     6.546     0.57 X
  67   29.0 109.000     81.040    0.901    27.960        2.39R       67   29.0 109.000     81.040    0.901    27.960     2.39R
  78   32.0 127.000     90.339    1.071    36.661        3.14R       78   32.0 127.000     90.339    1.071    36.661     3.14R
 141   30.0 134.000     84.140    0.952    49.860        4.26R      141   30.0 134.000     84.140    0.952    49.860     4.26R
 151   30.0 114.000     84.140    0.952    29.860        2.55R      151   30.0 114.000     84.140    0.952    29.860     2.55R
 161   33.0 128.000     93.438    1.136    34.562        2.96R      161   33.0 128.000     93.438    1.136    34.562     2.96R
 171   33.0 126.000     93.438    1.136    32.562        2.79R      171   33.0 126.000     93.438    1.136    32.562     2.79R
 183   32.0    59.000   90.339    1.071   -31.339       -2.68R      183   32.0    59.000   90.339    1.071   -31.339    -2.68R
 199   29.0    48.000   81.040    0.901   -33.040       -2.82R      199   29.0    48.000   81.040    0.901   -33.040    -2.82R
 202   25.0    35.000   68.643    0.768   -33.643       -2.87R      202   25.0    35.000   68.643    0.768   -33.643    -2.87R
 235    4.0   14.000     3.554    1.913    10.446        0.90 X     235    4.0   14.000     3.554    1.913    10.446     0.90 X
 236    4.0    7.000     3.554    1.913     3.446        0.30 X     236    4.0    7.000     3.554    1.913     3.446     0.30 X

 R denotes an observation with a large standardized residual (>2)   X denotes an observation whose X value gives it large leverage

                                          240 Data points

Unusual observation                       4% = 9 points


Residual values (R) ≤
5% of total observation
Original Vs Altered

Original Graph               Altered Graph
Interpreting The Model

Regression Analysis: CallsAnswd versus Staff

The regression equation is
CallsAnswd = - 8.84 + 3.10 Staff


Predictor    Coef    SE Coef         T     P
Constant    -8.844    2.247        -3.94 0.000         Predictor Table
Staff        3.09947 0.08976        34.53 0.000



S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3%           Additional Statistics


Analysis of Variance

Source         DF       SS     MS       F       P
Regression     1       164091 164091 1192.37 0.000
                                                        ANOVA Table
Residual Error 238     32753 138
Total          239     196844
Interpreting The Model

Regression Analysis: CallsAnswd versus Staff

The regression equation is
CallsAnswd = - 8.84 + 3.10 Staff
                                                           Predictor Table
Predictor    Coef    SE Coef         T     P
Constant    -8.844    2.247        -3.94 0.000
Staff        3.09947 0.08976        34.53 0.000      Ho: Slope = 0
                                                     •No difference in Y when X
                                                     changes
S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3%           •X has no impact on y


Analysis of Variance                                 Ha: Slope ≠ 0
Source         DF       SS     MS       F       P
Regression     1       164091 164091 1192.37 0.000
                                                     •Y changes as X changes
Residual Error 238     32753 138                     •X has impact on y
Total          239     196844
Interpreting The Model

Regression Analysis: CallsAnswd versus Staff

The regression equation is
CallsAnswd = - 8.84 + 3.10 Staff
                                                         Additional Statistics
Predictor    Coef    SE Coef         T     P
Constant    -8.844    2.247        -3.94 0.000
                                                     R-squared
Staff        3.09947 0.08976        34.53 0.000      •The amount of variation
                                                     explained by this variation

S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3%           R-squared (Adjusted)
                                                     •Account for the no. of X’s used in
                                                     model (Used in multiple
Analysis of Variance                                 regression)

Source         DF       SS     MS       F       P    S
Regression     1       164091 164091 1192.37 0.000
Residual Error 238     32753 138
                                                     •Standard deviation for the
Total          239     196844                        leftover or unexplained variation.
                                                     It is used for designing the
                                                     confidence interval
Interpreting The Model

Regression Analysis: CallsAnswd versus Staff

The regression equation is
CallsAnswd = - 8.84 + 3.10 Staff
                                                             ANOVA Table
Predictor    Coef    SE Coef         T     P
Constant    -8.844    2.247        -3.94 0.000
                                                     Regression
Staff        3.09947 0.08976        34.53 0.000
                                                     •The explained variation


S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3%           Residual Error
                                                     •Unexplained Variation
Analysis of Variance

Source         DF       SS     MS       F       P
                                                     Ho:
Regression     1       164091 164091 1192.37 0.000   •The model does not explain the
Residual Error 238     32753 138                     observed variation
Total          239     196844

                                                     Ha:
                                                     •The model does explain the
                                                     observed variation
Confidence on Output

 CallsAnswd = -8.844 + 3.099 Staff



If Staff 10 then, - 8.844 + 3.099 (10) = 22

If Staff 20 then, - 8.844 + 3.099 (20) = 53

If Staff 30 then, - 8.844 + 3.099 (30) = 84


               Confidence = ???%

               R-squared = 83.4%
                                              Report result using
         Unexplained Variation = 16.6%
                                              confidence intervals
Confidence on Output



 95% confidence the true
regression line lies within




                        95% prediction interval expect 95%
                          of the data points to fall within
Confidence on Output



Confidence interval report
  average performance




                         Prediction intervals
                        predict actual values
Predicted Value for Y
Let management decide max 20 staff, then how may call answer by these
20 executive



Predicted Values for New Observations

New Obs  Fit   SE Fit          95% CI              95% PI
   1    53.145 0.822       (51.526, 54.765)   (29.979, 76.312)


Values of Predictors for New Observations

New Obs Staff
  1 20.0
Review Methodology

1. Get Familiar with the data                     3. Check the model & the assumption

• Identify the output or Y variable               • Look at the residual plot
• Identify the X variable or predictor                   Are assumption valid?
• Look at the time series, dot plots, histogram          Do residual look ok?
and scatter plot                                         Are there unusual observation? If yes,
• Run descriptive statistics                            address, if possible and rerun the analysis
        Check for outliers                       • What is r-squared?
        Check for gaps in the data               • How much variation can be explained by the
                                                  model?
                                                  • Look at P –values
                                                         Does X have significant impact on Y?



2. Fit the model to the data                      4. Report Result & Use Equation
• Run regression                                  • Summarize the results for your stakeholders
• Use the regression                              • If you have excluded data, explain why
                                                  • If you kept outlier in the data, explain why
                                                  • How much variation in Y can be explained by X?
                                                  • Remember causation vs. correlation
                                                  • Make a predictions using confidence and
                                                  prediction intervals.
Simple linear regression (final)
Ad

More Related Content

What's hot (20)

Linear regression
Linear regressionLinear regression
Linear regression
Karishma Chaudhary
 
Regression analysis algorithm
Regression analysis algorithm Regression analysis algorithm
Regression analysis algorithm
Sammer Qader
 
Regression
RegressionRegression
Regression
simran sakshi
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
Shiela Vinarao
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
Teachers Mitraa
 
R square vs adjusted r square
R square vs adjusted r squareR square vs adjusted r square
R square vs adjusted r square
Akhilesh Joshi
 
Statistics-Regression analysis
Statistics-Regression analysisStatistics-Regression analysis
Statistics-Regression analysis
Rabin BK
 
Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regression
alok tiwari
 
Spearman rank correlation coefficient
Spearman rank correlation coefficientSpearman rank correlation coefficient
Spearman rank correlation coefficient
Karishma Chaudhary
 
Assumptions of Linear Regression - Machine Learning
Assumptions of Linear Regression - Machine LearningAssumptions of Linear Regression - Machine Learning
Assumptions of Linear Regression - Machine Learning
Kush Kulshrestha
 
Logistic regression with SPSS
Logistic regression with SPSSLogistic regression with SPSS
Logistic regression with SPSS
LNIPE
 
Correlation and Simple Regression
Correlation  and Simple RegressionCorrelation  and Simple Regression
Correlation and Simple Regression
Venkata Reddy Konasani
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
Salim Azad
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
Dr.ammara khakwani
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
University of Jaffna
 
Chapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares RegressionChapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares Regression
nszakir
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
DrZahid Khan
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
Srikant001p
 
Linear regression
Linear regressionLinear regression
Linear regression
Tech_MX
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
Khaled Abd Elaziz
 
Regression analysis algorithm
Regression analysis algorithm Regression analysis algorithm
Regression analysis algorithm
Sammer Qader
 
R square vs adjusted r square
R square vs adjusted r squareR square vs adjusted r square
R square vs adjusted r square
Akhilesh Joshi
 
Statistics-Regression analysis
Statistics-Regression analysisStatistics-Regression analysis
Statistics-Regression analysis
Rabin BK
 
Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regression
alok tiwari
 
Spearman rank correlation coefficient
Spearman rank correlation coefficientSpearman rank correlation coefficient
Spearman rank correlation coefficient
Karishma Chaudhary
 
Assumptions of Linear Regression - Machine Learning
Assumptions of Linear Regression - Machine LearningAssumptions of Linear Regression - Machine Learning
Assumptions of Linear Regression - Machine Learning
Kush Kulshrestha
 
Logistic regression with SPSS
Logistic regression with SPSSLogistic regression with SPSS
Logistic regression with SPSS
LNIPE
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
Salim Azad
 
Chapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares RegressionChapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares Regression
nszakir
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
DrZahid Khan
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
Srikant001p
 
Linear regression
Linear regressionLinear regression
Linear regression
Tech_MX
 

Viewers also liked (15)

Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis ppt
Elkana Rorio
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
Shameer P Hamsa
 
Normal Distribution
Normal DistributionNormal Distribution
Normal Distribution
Shubham Mehta
 
Measures of variability
Measures of variabilityMeasures of variability
Measures of variability
Mark Santos
 
Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)
Upama Dwivedi
 
Hypothesis
HypothesisHypothesis
Hypothesis
Mukut Deori
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
nadiazaheer
 
Normal Curve and Standard Scores
Normal Curve and Standard ScoresNormal Curve and Standard Scores
Normal Curve and Standard Scores
Jenewel Azuelo
 
Regression
RegressionRegression
Regression
mandrewmartin
 
Measures of dispersion
Measures  of  dispersionMeasures  of  dispersion
Measures of dispersion
Southern Range, Berhampur, Odisha
 
Test of significance (t-test, proportion test, chi-square test)
Test of significance (t-test, proportion test, chi-square test)Test of significance (t-test, proportion test, chi-square test)
Test of significance (t-test, proportion test, chi-square test)
Ramnath Takiar
 
Pearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear RegressionPearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear Regression
Azmi Mohd Tamil
 
Pearson Correlation
Pearson CorrelationPearson Correlation
Pearson Correlation
Noreen Morales
 
Statistics
StatisticsStatistics
Statistics
pikuoec
 
Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)
Sneh Kumari
 
Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis ppt
Elkana Rorio
 
Measures of variability
Measures of variabilityMeasures of variability
Measures of variability
Mark Santos
 
Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)
Upama Dwivedi
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
nadiazaheer
 
Normal Curve and Standard Scores
Normal Curve and Standard ScoresNormal Curve and Standard Scores
Normal Curve and Standard Scores
Jenewel Azuelo
 
Test of significance (t-test, proportion test, chi-square test)
Test of significance (t-test, proportion test, chi-square test)Test of significance (t-test, proportion test, chi-square test)
Test of significance (t-test, proportion test, chi-square test)
Ramnath Takiar
 
Pearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear RegressionPearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear Regression
Azmi Mohd Tamil
 
Statistics
StatisticsStatistics
Statistics
pikuoec
 
Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)
Sneh Kumari
 
Ad

Similar to Simple linear regression (final) (20)

Regression and Co-Relation
Regression and Co-RelationRegression and Co-Relation
Regression and Co-Relation
nuwan udugampala
 
Linear Regression
Linear Regression Linear Regression
Linear Regression
Rupak Roy
 
Regression-SIMPLE LINEAR (1).psssssssssptx
Regression-SIMPLE LINEAR (1).psssssssssptxRegression-SIMPLE LINEAR (1).psssssssssptx
Regression-SIMPLE LINEAR (1).psssssssssptx
pokah34509
 
UNIT 4.pptx
UNIT 4.pptxUNIT 4.pptx
UNIT 4.pptx
Mrunmayee Manjari
 
Correlation and Regression Analysis.pptx
Correlation and Regression Analysis.pptxCorrelation and Regression Analysis.pptx
Correlation and Regression Analysis.pptx
Unfold1
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
saba khan
 
Linear regression
Linear regressionLinear regression
Linear regression
Regent University
 
need help with stats 301 assignment help
need help with stats 301 assignment helpneed help with stats 301 assignment help
need help with stats 301 assignment help
realnerdovo
 
Cost Behavior
Cost BehaviorCost Behavior
Cost Behavior
AIS_USU
 
Corr-and-Regress (1).ppt
Corr-and-Regress (1).pptCorr-and-Regress (1).ppt
Corr-and-Regress (1).ppt
MuhammadAftab89
 
Corr-and-Regress.ppt
Corr-and-Regress.pptCorr-and-Regress.ppt
Corr-and-Regress.ppt
BAGARAGAZAROMUALD2
 
Cr-and-Regress.ppt
Cr-and-Regress.pptCr-and-Regress.ppt
Cr-and-Regress.ppt
RidaIrfan10
 
Corr-and-Regress.ppt
Corr-and-Regress.pptCorr-and-Regress.ppt
Corr-and-Regress.ppt
MoinPasha12
 
Corr-and-Regress.ppt
Corr-and-Regress.pptCorr-and-Regress.ppt
Corr-and-Regress.ppt
HarunorRashid74
 
Correlation & Regression for Statistics Social Science
Correlation & Regression for Statistics Social ScienceCorrelation & Regression for Statistics Social Science
Correlation & Regression for Statistics Social Science
ssuser71ac73
 
Corr-and-Regress.ppt
Corr-and-Regress.pptCorr-and-Regress.ppt
Corr-and-Regress.ppt
krunal soni
 
Numerical approximation
Numerical approximationNumerical approximation
Numerical approximation
Mileacre
 
CORRELATION-AND-REGRESSION.pdf for human resource
CORRELATION-AND-REGRESSION.pdf for human resourceCORRELATION-AND-REGRESSION.pdf for human resource
CORRELATION-AND-REGRESSION.pdf for human resource
Sharon517605
 
regression and correlation
regression and correlationregression and correlation
regression and correlation
Priya Sharma
 
Measure of dispersion
Measure of dispersionMeasure of dispersion
Measure of dispersion
Waqar Abbasi
 
Regression and Co-Relation
Regression and Co-RelationRegression and Co-Relation
Regression and Co-Relation
nuwan udugampala
 
Linear Regression
Linear Regression Linear Regression
Linear Regression
Rupak Roy
 
Regression-SIMPLE LINEAR (1).psssssssssptx
Regression-SIMPLE LINEAR (1).psssssssssptxRegression-SIMPLE LINEAR (1).psssssssssptx
Regression-SIMPLE LINEAR (1).psssssssssptx
pokah34509
 
Correlation and Regression Analysis.pptx
Correlation and Regression Analysis.pptxCorrelation and Regression Analysis.pptx
Correlation and Regression Analysis.pptx
Unfold1
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
saba khan
 
need help with stats 301 assignment help
need help with stats 301 assignment helpneed help with stats 301 assignment help
need help with stats 301 assignment help
realnerdovo
 
Cost Behavior
Cost BehaviorCost Behavior
Cost Behavior
AIS_USU
 
Corr-and-Regress (1).ppt
Corr-and-Regress (1).pptCorr-and-Regress (1).ppt
Corr-and-Regress (1).ppt
MuhammadAftab89
 
Cr-and-Regress.ppt
Cr-and-Regress.pptCr-and-Regress.ppt
Cr-and-Regress.ppt
RidaIrfan10
 
Corr-and-Regress.ppt
Corr-and-Regress.pptCorr-and-Regress.ppt
Corr-and-Regress.ppt
MoinPasha12
 
Correlation & Regression for Statistics Social Science
Correlation & Regression for Statistics Social ScienceCorrelation & Regression for Statistics Social Science
Correlation & Regression for Statistics Social Science
ssuser71ac73
 
Corr-and-Regress.ppt
Corr-and-Regress.pptCorr-and-Regress.ppt
Corr-and-Regress.ppt
krunal soni
 
Numerical approximation
Numerical approximationNumerical approximation
Numerical approximation
Mileacre
 
CORRELATION-AND-REGRESSION.pdf for human resource
CORRELATION-AND-REGRESSION.pdf for human resourceCORRELATION-AND-REGRESSION.pdf for human resource
CORRELATION-AND-REGRESSION.pdf for human resource
Sharon517605
 
regression and correlation
regression and correlationregression and correlation
regression and correlation
Priya Sharma
 
Measure of dispersion
Measure of dispersionMeasure of dispersion
Measure of dispersion
Waqar Abbasi
 
Ad

Recently uploaded (20)

Unit 5 ACUTE, SUBACUTE,CHRONIC TOXICITY.pptx
Unit 5 ACUTE, SUBACUTE,CHRONIC TOXICITY.pptxUnit 5 ACUTE, SUBACUTE,CHRONIC TOXICITY.pptx
Unit 5 ACUTE, SUBACUTE,CHRONIC TOXICITY.pptx
Mayuri Chavan
 
Final Evaluation.docx...........................
Final Evaluation.docx...........................Final Evaluation.docx...........................
Final Evaluation.docx...........................
l1bbyburrell
 
PUBH1000 Slides - Module 11: Governance for Health
PUBH1000 Slides - Module 11: Governance for HealthPUBH1000 Slides - Module 11: Governance for Health
PUBH1000 Slides - Module 11: Governance for Health
JonathanHallett4
 
How To Maximize Sales Performance using Odoo 18 Diverse views in sales module
How To Maximize Sales Performance using Odoo 18 Diverse views in sales moduleHow To Maximize Sales Performance using Odoo 18 Diverse views in sales module
How To Maximize Sales Performance using Odoo 18 Diverse views in sales module
Celine George
 
Conditions for Boltzmann Law – Biophysics Lecture Slide
Conditions for Boltzmann Law – Biophysics Lecture SlideConditions for Boltzmann Law – Biophysics Lecture Slide
Conditions for Boltzmann Law – Biophysics Lecture Slide
PKLI-Institute of Nursing and Allied Health Sciences Lahore , Pakistan.
 
Chemotherapy of Malignancy -Anticancer.pptx
Chemotherapy of Malignancy -Anticancer.pptxChemotherapy of Malignancy -Anticancer.pptx
Chemotherapy of Malignancy -Anticancer.pptx
Mayuri Chavan
 
Dastur_ul_Amal under Jahangir Key Features.pptx
Dastur_ul_Amal under Jahangir Key Features.pptxDastur_ul_Amal under Jahangir Key Features.pptx
Dastur_ul_Amal under Jahangir Key Features.pptx
omorfaruqkazi
 
Module_2_Types_and_Approaches_of_Research (2).pptx
Module_2_Types_and_Approaches_of_Research (2).pptxModule_2_Types_and_Approaches_of_Research (2).pptx
Module_2_Types_and_Approaches_of_Research (2).pptx
drroxannekemp
 
How to Share Accounts Between Companies in Odoo 18
How to Share Accounts Between Companies in Odoo 18How to Share Accounts Between Companies in Odoo 18
How to Share Accounts Between Companies in Odoo 18
Celine George
 
How to Configure Extra Steps During Checkout in Odoo 18 Website
How to Configure Extra Steps During Checkout in Odoo 18 WebsiteHow to Configure Extra Steps During Checkout in Odoo 18 Website
How to Configure Extra Steps During Checkout in Odoo 18 Website
Celine George
 
Peer Assessment_ Unit 2 Skills Development for Live Performance - for Libby.docx
Peer Assessment_ Unit 2 Skills Development for Live Performance - for Libby.docxPeer Assessment_ Unit 2 Skills Development for Live Performance - for Libby.docx
Peer Assessment_ Unit 2 Skills Development for Live Performance - for Libby.docx
19lburrell
 
YSPH VMOC Special Report - Measles Outbreak Southwest US 5-17-2025 .pptx
YSPH VMOC Special Report - Measles Outbreak  Southwest US 5-17-2025  .pptxYSPH VMOC Special Report - Measles Outbreak  Southwest US 5-17-2025  .pptx
YSPH VMOC Special Report - Measles Outbreak Southwest US 5-17-2025 .pptx
Yale School of Public Health - The Virtual Medical Operations Center (VMOC)
 
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptxANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
Mayuri Chavan
 
Classification of mental disorder in 5th semester bsc. nursing and also used ...
Classification of mental disorder in 5th semester bsc. nursing and also used ...Classification of mental disorder in 5th semester bsc. nursing and also used ...
Classification of mental disorder in 5th semester bsc. nursing and also used ...
parmarjuli1412
 
PUBH1000 Slides - Module 12: Advocacy for Health
PUBH1000 Slides - Module 12: Advocacy for HealthPUBH1000 Slides - Module 12: Advocacy for Health
PUBH1000 Slides - Module 12: Advocacy for Health
JonathanHallett4
 
The History of Kashmir Lohar Dynasty NEP.ppt
The History of Kashmir Lohar Dynasty NEP.pptThe History of Kashmir Lohar Dynasty NEP.ppt
The History of Kashmir Lohar Dynasty NEP.ppt
Arya Mahila P. G. College, Banaras Hindu University, Varanasi, India.
 
YSPH VMOC Special Report - Measles Outbreak Southwest US 5-14-2025 .pptx
YSPH VMOC Special Report - Measles Outbreak  Southwest US 5-14-2025  .pptxYSPH VMOC Special Report - Measles Outbreak  Southwest US 5-14-2025  .pptx
YSPH VMOC Special Report - Measles Outbreak Southwest US 5-14-2025 .pptx
Yale School of Public Health - The Virtual Medical Operations Center (VMOC)
 
ITI COPA Question Paper PDF 2017 Theory MCQ
ITI COPA Question Paper PDF 2017 Theory MCQITI COPA Question Paper PDF 2017 Theory MCQ
ITI COPA Question Paper PDF 2017 Theory MCQ
SONU HEETSON
 
The role of wall art in interior designing
The role of wall art in interior designingThe role of wall art in interior designing
The role of wall art in interior designing
meghaark2110
 
"Bridging Cultures Through Holiday Cards: 39 Students Celebrate Global Tradit...
"Bridging Cultures Through Holiday Cards: 39 Students Celebrate Global Tradit..."Bridging Cultures Through Holiday Cards: 39 Students Celebrate Global Tradit...
"Bridging Cultures Through Holiday Cards: 39 Students Celebrate Global Tradit...
AlionaBujoreanu
 
Unit 5 ACUTE, SUBACUTE,CHRONIC TOXICITY.pptx
Unit 5 ACUTE, SUBACUTE,CHRONIC TOXICITY.pptxUnit 5 ACUTE, SUBACUTE,CHRONIC TOXICITY.pptx
Unit 5 ACUTE, SUBACUTE,CHRONIC TOXICITY.pptx
Mayuri Chavan
 
Final Evaluation.docx...........................
Final Evaluation.docx...........................Final Evaluation.docx...........................
Final Evaluation.docx...........................
l1bbyburrell
 
PUBH1000 Slides - Module 11: Governance for Health
PUBH1000 Slides - Module 11: Governance for HealthPUBH1000 Slides - Module 11: Governance for Health
PUBH1000 Slides - Module 11: Governance for Health
JonathanHallett4
 
How To Maximize Sales Performance using Odoo 18 Diverse views in sales module
How To Maximize Sales Performance using Odoo 18 Diverse views in sales moduleHow To Maximize Sales Performance using Odoo 18 Diverse views in sales module
How To Maximize Sales Performance using Odoo 18 Diverse views in sales module
Celine George
 
Chemotherapy of Malignancy -Anticancer.pptx
Chemotherapy of Malignancy -Anticancer.pptxChemotherapy of Malignancy -Anticancer.pptx
Chemotherapy of Malignancy -Anticancer.pptx
Mayuri Chavan
 
Dastur_ul_Amal under Jahangir Key Features.pptx
Dastur_ul_Amal under Jahangir Key Features.pptxDastur_ul_Amal under Jahangir Key Features.pptx
Dastur_ul_Amal under Jahangir Key Features.pptx
omorfaruqkazi
 
Module_2_Types_and_Approaches_of_Research (2).pptx
Module_2_Types_and_Approaches_of_Research (2).pptxModule_2_Types_and_Approaches_of_Research (2).pptx
Module_2_Types_and_Approaches_of_Research (2).pptx
drroxannekemp
 
How to Share Accounts Between Companies in Odoo 18
How to Share Accounts Between Companies in Odoo 18How to Share Accounts Between Companies in Odoo 18
How to Share Accounts Between Companies in Odoo 18
Celine George
 
How to Configure Extra Steps During Checkout in Odoo 18 Website
How to Configure Extra Steps During Checkout in Odoo 18 WebsiteHow to Configure Extra Steps During Checkout in Odoo 18 Website
How to Configure Extra Steps During Checkout in Odoo 18 Website
Celine George
 
Peer Assessment_ Unit 2 Skills Development for Live Performance - for Libby.docx
Peer Assessment_ Unit 2 Skills Development for Live Performance - for Libby.docxPeer Assessment_ Unit 2 Skills Development for Live Performance - for Libby.docx
Peer Assessment_ Unit 2 Skills Development for Live Performance - for Libby.docx
19lburrell
 
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptxANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
Mayuri Chavan
 
Classification of mental disorder in 5th semester bsc. nursing and also used ...
Classification of mental disorder in 5th semester bsc. nursing and also used ...Classification of mental disorder in 5th semester bsc. nursing and also used ...
Classification of mental disorder in 5th semester bsc. nursing and also used ...
parmarjuli1412
 
PUBH1000 Slides - Module 12: Advocacy for Health
PUBH1000 Slides - Module 12: Advocacy for HealthPUBH1000 Slides - Module 12: Advocacy for Health
PUBH1000 Slides - Module 12: Advocacy for Health
JonathanHallett4
 
ITI COPA Question Paper PDF 2017 Theory MCQ
ITI COPA Question Paper PDF 2017 Theory MCQITI COPA Question Paper PDF 2017 Theory MCQ
ITI COPA Question Paper PDF 2017 Theory MCQ
SONU HEETSON
 
The role of wall art in interior designing
The role of wall art in interior designingThe role of wall art in interior designing
The role of wall art in interior designing
meghaark2110
 
"Bridging Cultures Through Holiday Cards: 39 Students Celebrate Global Tradit...
"Bridging Cultures Through Holiday Cards: 39 Students Celebrate Global Tradit..."Bridging Cultures Through Holiday Cards: 39 Students Celebrate Global Tradit...
"Bridging Cultures Through Holiday Cards: 39 Students Celebrate Global Tradit...
AlionaBujoreanu
 

Simple linear regression (final)

  • 2. Module Objective Agenda •Introduce the concept of Simple Linear Regression •Walk through the process of plotting our data •Apply regression Techniques •Evaluate our model •Interpret the result Expected Learning •Understand key Simple Linear Regression terminology •Evaluate the relationship between a continuous X and continuous Y •Use regression analysis to make predictions about process
  • 3. Historical Note •Sir Francis Galton (1822-1911) used the term Regression. •To explain the relationship between the heights (inches) of fathers and their sons.
  • 4. Simple Linear Regression Regression analysis is used to predict the value of one variable (the dependent variable) on the basis of other variables (the independent variables). Dependent variable: denoted Y Independent variables: denoted X1, X2, …, Xk If we only have ONE independent variable, the model is
  • 5. Simple Linear Regression Analysis Variables: X = Independent Variable (we provide this) Y = Dependent Variable (we observe this) Parameters: β0 = Intercept The y-intercept of a line is the point at which the line crosses the y axis. ( i.e. where the x value equals 0) β1 = Slope Change in the mean of Y for a unit change in X ε ~ Normal Random Variable (με = 0, σε = ???) [Noise]
  • 6. Simple Linear Regression Analysis Meaning of and > 0 [positive slope] < 0 [negative slope] y rise run =slope (=rise/run) =y-intercept x
  • 7. Effect of Larger Values of σ ε Lower vs. Higher Variability Y 25K$ Y= β0+ β1 + X
  • 8. The least Square Method s nce dif fere these differences are ed quar called residuals or hes errors of t e sum line… ze s th d the imi nts an min poi ne s li n the Thi wee bet
  • 9. Exercise A black belt is connected with optimize a call center in a retail bank where clients place the inquiries and order over the phone during 6 am to 6 pm (Monday to Friday). The current staffing plan is begin with about 4 associates at 6 am and increases to about 35 associate by 9 am. At 3.30 pm the no. of associate begin to drop to about 7 by 6 pm. The black belt is anticipating an increasing in call volume and want to know how many call can be answer in 30 min time interval for various staffing level to better staff the call center. The black belt obtain the data on the no. of associate and the no. of calls answered for each 30 min interval for the last two weeks. You have a total of 240 Samples. Data sheet
  • 11. Fitted Line Plot Regression Analysis: CallsAnswd versus Staff The regression equation is CallsAnswd = - 8.844 + 3.099 Staff S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3% Analysis of Variance Source DF SS MS F P Regression 1 164091 164091 1192.37 0.000 Error 238 32753 138 Total 239 196844 Fitted Line: CallsAnswd versus Staff
  • 12. Minitab Output-Session Window CallsAnswd = - 8.844 + 3.099 Staff CallsAnswd = - 8.844 + 3.099 (17)=43.839 CallsAnswd = - 8.844 + 3.099 (22)=59.334 How Confident are you with this model? 59 Calls Staff of 22 ? 59 Calls 59 Calls
  • 13. Assessing The Model Evaluate the strength of the regression model 1. Determine how much variation in our calls answered data is actually explained by staff. 2. Determine The strength of the relationship between Call Answered and Staff Regression Analysis: CallsAnswd R-sq versus Staff The regression equation is CallsAnswd = - 8.844 + 3.099 Staff % of variation in the Y S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3% values explained by the linear relationship with X Analysis of Variance Source DF SS MS F P Regression 1 164091 164091 1192.37 0.000 Error 238 32753 138 % of variation Calls Answered Total 239 196844 explained by Staff Fitted Line: CallsAnswd versus Staff
  • 14. % of Variation-Explanation Explained Total Variation (Y) Variation (Y) Explained Variation R Squared = Total Variation = Between 0 and 1 (0% to 100%)
  • 15. Properties of R The Correlation Coefficient √ R2 = R Measure the strength of the linear relationship (Pearson’s Correlation of coefficient) R Strong Moderate Low Low Moderate Moderate -1 Relationship -0.8 Relationship -0.5 Relationship 0 Relationship 0.5 Relationship 0.8 Relationship +1
  • 16. Caution- Correlation and Causation Calls Answered Staff Correlated Correlation Causation Change in one variable causes Two things vary together change in another
  • 17. Regression Interpretation- Graphical •Residual is not normal •Residual Vs Fits is not constant
  • 18. Regression- Session Window Regression Analysis: CallsAnswd versus Staff Unusual Observations The regression equation is Obs Staff CallsAnswd Fit SE Fit Residual St Resid CallsAnswd = - 8.84 + 3.10 Staff 1 3.0 6.000 0.454 1.995 5.546 0.48 X 3 4.0 7.000 3.554 1.913 3.446 0.30 X 6 4.0 11.000 3.554 1.913 7.446 0.64 X Predictor Coef SE Coef T P 7 4.0 12.000 3.554 1.913 8.446 0.73 X Constant -8.844 2.247 -3.94 0.000 8 3.0 5.000 0.454 1.995 4.546 0.39 X Staff 3.09947 0.08976 34.53 0.000 9 3.0 7.000 0.454 1.995 6.546 0.57 X 67 29.0 109.000 81.040 0.901 27.960 2.39R 78 32.0 127.000 90.339 1.071 36.661 3.14R S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3% 141 30.0 134.000 84.140 0.952 49.860 4.26R 151 30.0 114.000 84.140 0.952 29.860 2.55R 161 33.0 128.000 93.438 1.136 34.562 2.96R Analysis of Variance 171 33.0 126.000 93.438 1.136 32.562 2.79R 183 32.0 59.000 90.339 1.071 -31.339 -2.68R Source DF SS MS F P 199 29.0 48.000 81.040 0.901 -33.040 -2.82R Regression 1 164091 164091 1192.37 0.000 202 25.0 35.000 68.643 0.768 -33.643 -2.87R Residual Error 238 32753 138 235 4.0 14.000 3.554 1.913 10.446 0.90 X Total 239 196844 236 4.0 7.000 3.554 1.913 3.446 0.30 X R denotes an observation with a large standardized residual (>2) X denotes an observation whose X value gives it large leverage
  • 19. Unusual Observations Unusual Observations Unusual Observations Obs Staff CallsAnswd Fit SE Fit Residual St Resid Obs Staff CallsAnswd Fit SE Fit Residual St Resid 1 3.0 6.000 0.454 1.995 5.546 0.48 X 1 3.0 6.000 0.454 1.995 5.546 0.48 X 3 4.0 7.000 3.554 1.913 3.446 0.30 X 3 4.0 7.000 3.554 1.913 3.446 0.30 X 6 4.0 11.000 3.554 1.913 7.446 0.64 X 6 4.0 11.000 3.554 1.913 7.446 0.64 X 7 4.0 12.000 3.554 1.913 8.446 0.73 X 7 4.0 12.000 3.554 1.913 8.446 0.73 X 8 3.0 5.000 0.454 1.995 4.546 0.39 X 8 3.0 5.000 0.454 1.995 4.546 0.39 X 9 3.0 7.000 0.454 1.995 6.546 0.57 X 9 3.0 7.000 0.454 1.995 6.546 0.57 X 67 29.0 109.000 81.040 0.901 27.960 2.39R 67 29.0 109.000 81.040 0.901 27.960 2.39R 78 32.0 127.000 90.339 1.071 36.661 3.14R 78 32.0 127.000 90.339 1.071 36.661 3.14R 141 30.0 134.000 84.140 0.952 49.860 4.26R 141 30.0 134.000 84.140 0.952 49.860 4.26R 151 30.0 114.000 84.140 0.952 29.860 2.55R 151 30.0 114.000 84.140 0.952 29.860 2.55R 161 33.0 128.000 93.438 1.136 34.562 2.96R 161 33.0 128.000 93.438 1.136 34.562 2.96R 171 33.0 126.000 93.438 1.136 32.562 2.79R 171 33.0 126.000 93.438 1.136 32.562 2.79R 183 32.0 59.000 90.339 1.071 -31.339 -2.68R 183 32.0 59.000 90.339 1.071 -31.339 -2.68R 199 29.0 48.000 81.040 0.901 -33.040 -2.82R 199 29.0 48.000 81.040 0.901 -33.040 -2.82R 202 25.0 35.000 68.643 0.768 -33.643 -2.87R 202 25.0 35.000 68.643 0.768 -33.643 -2.87R 235 4.0 14.000 3.554 1.913 10.446 0.90 X 235 4.0 14.000 3.554 1.913 10.446 0.90 X 236 4.0 7.000 3.554 1.913 3.446 0.30 X 236 4.0 7.000 3.554 1.913 3.446 0.30 X R denotes an observation with a large standardized residual (>2) X denotes an observation whose X value gives it large leverage 240 Data points Unusual observation 4% = 9 points Residual values (R) ≤ 5% of total observation
  • 20. Original Vs Altered Original Graph Altered Graph
  • 21. Interpreting The Model Regression Analysis: CallsAnswd versus Staff The regression equation is CallsAnswd = - 8.84 + 3.10 Staff Predictor Coef SE Coef T P Constant -8.844 2.247 -3.94 0.000 Predictor Table Staff 3.09947 0.08976 34.53 0.000 S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3% Additional Statistics Analysis of Variance Source DF SS MS F P Regression 1 164091 164091 1192.37 0.000 ANOVA Table Residual Error 238 32753 138 Total 239 196844
  • 22. Interpreting The Model Regression Analysis: CallsAnswd versus Staff The regression equation is CallsAnswd = - 8.84 + 3.10 Staff Predictor Table Predictor Coef SE Coef T P Constant -8.844 2.247 -3.94 0.000 Staff 3.09947 0.08976 34.53 0.000 Ho: Slope = 0 •No difference in Y when X changes S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3% •X has no impact on y Analysis of Variance Ha: Slope ≠ 0 Source DF SS MS F P Regression 1 164091 164091 1192.37 0.000 •Y changes as X changes Residual Error 238 32753 138 •X has impact on y Total 239 196844
  • 23. Interpreting The Model Regression Analysis: CallsAnswd versus Staff The regression equation is CallsAnswd = - 8.84 + 3.10 Staff Additional Statistics Predictor Coef SE Coef T P Constant -8.844 2.247 -3.94 0.000 R-squared Staff 3.09947 0.08976 34.53 0.000 •The amount of variation explained by this variation S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3% R-squared (Adjusted) •Account for the no. of X’s used in model (Used in multiple Analysis of Variance regression) Source DF SS MS F P S Regression 1 164091 164091 1192.37 0.000 Residual Error 238 32753 138 •Standard deviation for the Total 239 196844 leftover or unexplained variation. It is used for designing the confidence interval
  • 24. Interpreting The Model Regression Analysis: CallsAnswd versus Staff The regression equation is CallsAnswd = - 8.84 + 3.10 Staff ANOVA Table Predictor Coef SE Coef T P Constant -8.844 2.247 -3.94 0.000 Regression Staff 3.09947 0.08976 34.53 0.000 •The explained variation S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3% Residual Error •Unexplained Variation Analysis of Variance Source DF SS MS F P Ho: Regression 1 164091 164091 1192.37 0.000 •The model does not explain the Residual Error 238 32753 138 observed variation Total 239 196844 Ha: •The model does explain the observed variation
  • 25. Confidence on Output CallsAnswd = -8.844 + 3.099 Staff If Staff 10 then, - 8.844 + 3.099 (10) = 22 If Staff 20 then, - 8.844 + 3.099 (20) = 53 If Staff 30 then, - 8.844 + 3.099 (30) = 84 Confidence = ???% R-squared = 83.4% Report result using Unexplained Variation = 16.6% confidence intervals
  • 26. Confidence on Output 95% confidence the true regression line lies within 95% prediction interval expect 95% of the data points to fall within
  • 27. Confidence on Output Confidence interval report average performance Prediction intervals predict actual values
  • 28. Predicted Value for Y Let management decide max 20 staff, then how may call answer by these 20 executive Predicted Values for New Observations New Obs Fit SE Fit 95% CI 95% PI 1 53.145 0.822 (51.526, 54.765) (29.979, 76.312) Values of Predictors for New Observations New Obs Staff 1 20.0
  • 29. Review Methodology 1. Get Familiar with the data 3. Check the model & the assumption • Identify the output or Y variable • Look at the residual plot • Identify the X variable or predictor  Are assumption valid? • Look at the time series, dot plots, histogram  Do residual look ok? and scatter plot  Are there unusual observation? If yes, • Run descriptive statistics address, if possible and rerun the analysis  Check for outliers • What is r-squared?  Check for gaps in the data • How much variation can be explained by the model? • Look at P –values  Does X have significant impact on Y? 2. Fit the model to the data 4. Report Result & Use Equation • Run regression • Summarize the results for your stakeholders • Use the regression • If you have excluded data, explain why • If you kept outlier in the data, explain why • How much variation in Y can be explained by X? • Remember causation vs. correlation • Make a predictions using confidence and prediction intervals.
  翻译: