Estimating the accuracy of the approximation (surrogate)

Estimating the accuracy of the approximation (surrogate) • From assumption that error is due to normally distributed uncorrelated random variables, get estimate to error standard deviation (called standard error) • Standard measure of accuracy • Coefficient of multiple determination measures how much of variability in data is captured by approximation • Adjusted coefficient of multiple determination accounts for the fitting bias

Curve fit • noise=randn(1,30); x=1:1:30; y=x+noise • 3.908 2.825 4.379 2.942 4.5314 5.7275 8.098 …………………………………25.84 27.47 27.00 30.96 • [p,s]=polyfit(x,y,1); yfit=polyval(p,x); plot(x,y,'+',x,x,'r',x,yfit,'b') With dense data, functional form is clear. Fit serves to filter out noise

Example with y=0.1*x • noise=randn(1,30); x=1:1:30; y=0.1*x+noise; • xx=[ones(30,1),x']; [B,BINT,R,RINT,STATS] = regress(y',xx) • Stat 0.3016 12.0896 0.0017 1.7498

Estimating error in coefficients • Some coefficients are more accurately estimated than others • Standard error in coefficient is • t-statistic is ratio of coefficient to standard error, would like it to be at least 2 • Coefficients that are poorly estimated may be dropped to improve accuracy of predictions • Dropping one coefficients changes t-statistics for others • Need to iterate in dropping and adding coefficients

Regression in Excel (add-in data analysis)

Regression output

Output with y=0.1x

Example 3.2.1 • Given data • Use Microsoft Excel to fit linear and quadratic polynomials • Compare standard errors and t-statistics of coefficients

Linear fit

Quadratic fit

Graphical comparison .

Cross validation • Error estimates based on model assumptions are vulnerable • For polynomial response surface approximations assumptions are rarely satisfied • Cross validation divides data into ng groups • Fit the approximation to ng -1 groups, and use last group to estimate error. Repeat for each group • When each group consists of one point, error called PRESS (prediction error sum of squares) • Calculate error at each point and then presenting r.m.s error • Can be shown that • Can be used only if not ill-conditioned

Questions • The pairs (0,0), (1,1), (2,1) represent strain (millistrains) and stress (ksi) measurements. • Estimate Young modulus using the three commonly used error norms. • Estimate the error in Young modulus using cross validation

Estimating the accuracy of the approximation (surrogate)

Estimating the accuracy of the approximation (surrogate)

Presentation Transcript

Effective Planning for Health Care Decision-Making at the End of Life American Bar Association Revised 2007

Estimating the cost to Helmert heights of a lateral approximation of the topographical density distribution

The Quasiharmonic Approximation

Statistical issues in the validation of surrogate endpoints

The role of the form tutor

INCLUDING UNCERTAINTY MODELS FOR SURROGATE BASED GLOBAL DESIGN OPTIMIZATION The EGO algorithm

Two objectives of surrogate fitting

The Normal Approximation

The future of voice in Africa

Surrogate Endpoints: The Challenges are Greater than they Seem March 7, 2005

Accuracy of the Relativistic Distorted-Wave Approximation (RDW) A. D. Stauffer York University

The TSP : NP-Completeness Approximation and Hardness of Approximation

Estimating the age of

Get in the Game!

The Individualized Education Program: The Mantra of Special Education

The Normal Approximation

Surrogacy Nepal journal ethics in the world

Cost of Surrogate Mothers in India-Surrogate Mothers

The Life Style of Surrogate Mothers

The Life Style of Surrogate Mothers

Hit by the machine to achieve the desired accuracy in estimating the mean

Everything To Know About The Screening Process Of Surrogate Mothers