240 likes | 346 Views
Prediction Intervals. Review: PROBE’s regression calculations give us estimates (projections) for size (A+M LOC)…. …and time. But just how good are these estimates???. Off by 5%, 10%, 50%, 100%, 500%? Does it matter? Do you want to bet: Your weekends ? Your reputation ? Your JOB ?.
E N D
Prediction Intervals SE-280Dr. Mark L. Hornick
Review: PROBE’s regression calculations give us estimates (projections) for size (A+M LOC)… SE-280Dr. Mark L. Hornick
…and time SE-280Dr. Mark L. Hornick
But just how good are these estimates??? Off by 5%, 10%, 50%, 100%, 500%? Does it matter? Do you want to bet: • Your weekends? • Your reputation? • Your JOB? SE-280Dr. Mark L. Hornick
Which of the following regression projections would you trust more? SE-280Dr. Mark L. Hornick
Example A10 data pointsCorrelation = 0.75 800 700 600 500 Actual Total LOC 400 300 200 100 0 0 100 200 300 400 Estimated Object LOC SE-280Dr. Mark L. Hornick
Example B25 data pointsCorrelation = 0.75 800 700 600 500 Actual Total LOC 400 300 200 100 0 0 100 200 300 400 Estimated Object LOC SE-280Dr. Mark L. Hornick
A Prediction Interval calculation computes the bounds on the likely error of an estimate UPI = estimated A+M LOC + Range LPI = estimated A+M LOC - Range UPI Range Projection (Estimate) Range LPI Strictly speaking, the UPI/LPI "lines" are parabolas, and Range varies.
If you had this kind of information about your estimates, how would you use it? 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 Suppose your time projection said that a project would take 8 weeks. But, your prediction interval has a range of 3 weeks. How should you make your plan? What should you tell management? SE-280Dr. Mark L. Hornick
3 3 If you had this kind of information about your estimates, how would you use it? 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 Suppose your time projection said that a project would take 8 weeks. But, your prediction interval has a range of 3 weeks. How should you make your plan? What should you tell management? SE-280Dr. Mark L. Hornick
6 6 If you had this kind of information about your estimates, how would you use it? 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 Suppose your time projection said that a project would take 8 weeks. What if the range was 6 weeks? How should you make your plan? What should you tell management? SE-280Dr. Mark L. Hornick
70% limits(area) The prediction interval is based on the t distribution. Regression-projected value Range Lower prediction interval limit (LPI) Upper prediction interval limit (UPI) SE-280Dr. Mark L. Hornick
Prediction Interval Usage • Range within which data is likely to fall • Assuming variation is this estimate is similar to that in prior estimates • PSP uses 70% and 90% limits • Computes range in which actual value will likely fall • 70% of the time • 90% of the time • Helps to assess planning quality SE-280Dr. Mark L. Hornick
To get the prediction interval, we must calculate the range: Text, page 128; may have error in formula (n instead of d), depending on textbook revision. Note: this is for one independent variable.
For multiple regression, the range calculation is just extended a little.
The s ("sigma") value is computed in the following way. xi,j = previous independent variable values yi = previous dependent variable (estimate) values n = number of previous estimates m = number of independent variables d = n-(m+1) [degrees of freedom] bj = regression coefficients calculated from previous data Same for one independent variable. Alternate form:
-t t The range formula requires us to find the integration limit that yields the correct integral value. Two-sided integral value = p For a 70% interval,we want p = 0.70 Question: what integration limit “t” gives this value? For a 90% interval,we want p = 0.90 We have to search (try t values) in order to find out.
When thinking about searching for the desired integral value, it may be helpful to plot the integral of the t-distribution function. Hint: create a "function object" that calculates the two-sided p integral, given a specified t value.
The calculation needed is the reverse of that used in the significance calculation, since we are seeking "t" instead of "p". p For a specified "p" (integral) value, we want to find the corresponding "t" (integration limit). t How should we do the search? SE-280Dr. Mark L. Hornick
In the significance calculation, we calculated "p" for a given "t"; now we are seeking "t" that will give us a desired "p". p For a specified "p" (2-sided integral) value, we want to find the corresponding "t" (integration limit). t How should we do the search?
The textbook's suggests a state-machine approach that requires the function to be monotonic (pg. 246). p t The sign of the error (desired versus actual "p") tells you whether to increase or decrease the trial "t" value to get closer to the desired answer. The increase/decrease step size is halved when changing search direction.
An alternative search method brackets the answer and bisects the interval.
How does the interval bisection method work? error (+) p error (-) t