530 likes | 692 Views
Geogg124: Data assimilation. P. Lewis. What is Data Assimilation?. Optimal merging of models and data Models Expression of current understanding about process E.g. terrestrial C model Data Observations E.g. EO. Some basic stats. Gaussian PDF:. The uncertainty matrix.
E N D
Geogg124: Data assimilation P. Lewis
What is Data Assimilation? • Optimal merging of models and data • Models • Expression of current understanding about process • E.g. terrestrial C model • Data • Observations • E.g. EO
Some basic stats • Gaussian PDF:
Combining probabilities • Bayes theorum
Observation operator • Model of y: • The PDF of the observations is the PDF of the observations givenx • The observation PDF
Using Bayes theorum • So … we can improve on a prior estimate (xb) by multiplying by the observation PDF
Differentials of J • Minimisation: • Need to know the derivatives • J’ : Jacobian • Uncertainty: • Posterior uncertainty: • Curvature of cost function • Inverse of 2nd O derivative of J, J’’ • Hessian
Posterior uncertainty • Follows from previous that (for linear case)
Univariate example • If H=I (identity operator) • Univariate: • Equal variances:
Optimal estimate • Solve for univariate case: • Samples (xa, xb) (sigmaa, sigmab):
Finding solutions • DA methods • Variational (strong, weak constraint) • Sequential • MCMC (briefly)
Variational data assimilation: strong constraint • Cost function minimisation • Multiple obs constraints • Plus prior • Strong constraint: • Prediction must be a valid model state • ‘trust the model’
Variational data assimilation: weak constraint • Sometime want model as a ‘guide’ but don’t trust it. • So define model uncertainty • Use appropriate cost function for model • E.g. model
Dx: ~equivalent to convolution with Laplace function: smoother
Update step: 1 Calculate the residual (‘innovation’) Innovation uncertainty
Update step: 2 • Optimal Kalman gain
Variants • Extended Kalmanfilter • deal with non-linearity by linearsing the operators • Particle filter • Instead of linearising the operators, sample the distributions using Monte Carlo methods • Ensemble Kalmanfilter • Deal with non-linearities by running an ensemble of sample trajectories through the model. • MCMC • Use Markov chains to sample space. Clever rules such as MH. • MH: propose update of x->x’. Measure improvmenta • Accept is a>1or use a as probability of acceptance else
Applications • Carbon Cycle Data Assimilation System • DA system around BETHY model (+adjoint) • Deal with e.g. atmospheric conc. data. Through coupled atmospheric tracer model
EnKF of surface reflectance data into an ecosystem model • CCDAS has simple interface to observations • But difficult to make products (fAPAR) & use in model consistent • Difficult to track noise in satellite product • So, Quaife et al. (2008) attempt to link model state (LAI) to EO surface reflectance • using NL obs. op.
DA • EnKF • Used to affect leaf C (LAI) • EO data fn of ‘ancillary’ terms • Leaf chl., water etc. • Assumed constant (no uncertainty)
15 65 gC/m2/year Mean NEP for 2000-2002 Flux Tower 1 4.5 km Spatial average = 50.9 Std. dev. = 9.7 (gC/m2/year)
EO-LDAS • Earth Observation Land Data Assimilation • Focus on EO • Simple models only (regularisers, Dx, D2x …) • Observation operator • Semi-discrete (Gobron) + soil + ~PROSPECT • + adjoint!
Simple DA: weak prior + MERIS observation Reduces uncertainty Models observation well Solve for 7 biophysical parameters here Uncertainty on most are high From single meris observation
Predict MODIS from MERIS parameters Poor performance in extrapolation Due to correlations and large uncertainties