Towards a Decision-Centric Framework for Uncertainty Propagation and Data Assimilation

Towards a Decision-Centric Framework for Uncertainty Propagation and Data Assimilation Gabriel A. Terejanu Doctoral Dissertation Defense Department of Computer Science & Engineering University at Buffalo Committee Dr. Peter D. Scott (Chair) Dr. Jason J. Corso Dr. PuneetSingla

Introduction and Motivation for a Decision Centric Framework for Uncertainty Propagation Aleatory Uncertainty Propagation based on an Adaptive Gaussian Mixture Model Decision Centric Resource Allocation for Gaussian Mixture Models Hybrid Aleatory-Epistemic Uncertainty Propagation Conclusions and Future Work

Motivation Chernobyl disaster – Ukraine, April 26th 1986 Highly radioactive fallout plume drifted over large parts of Europe and Soviet Union. Chemical Biological Radiological Nuclear • Decision Makers • require accurate and relevant predictions of toxic cloud evolution in order to • deploy emergency responders • evacuate cities • shelter population • cache medical gear Source: Wikipedia.org

Example Mathematical Model Mathematical Model: puff dispersion model • Process Model • Initial Conditions • Measurement Model Source: SCIPUFF + Dipole Pride 26 [Terejanu08] INTEGRITY (Intrinsic) - Producer RELEVANCE (Extrinsic) - Consumer Inaccurate & Unreliable information is largely irrelevant to the decision maker “like beauty, what is truly information is largely in the eyes of the beholder” [Endsley01]

PROPOSAL Decision-Centric Framework FORECAST PRODUCER FORECAST CONSUMER/USER General Use Algorithms Specific Use INTEGRITY RELEVANCE Reconcile the two views into a Decision-Centric Framework which provides both a more accurate and relevant approximation to the uncertainty propagation.

Types of Uncertainty Stochastic Difference Equations (for Discrete-time dynamics) Stochastic Differential Equations (for Continuous-time dynamics) Aleatory Uncertainty: characterized by randomness with known probability distributions. Discrete-time measurement model Uncertain initial condition (example) Epistemic Uncertainty: used to model ignorance. Is the parameter in discussion random or not? If it is random what is the underlying probability distribution ? [Ferson96] Hybrid Behavior = Aleatory Uncertainty + Epistemic Uncertainty

Example Uncertainty Propagation • Model parameters are all known – only aleatory uncertainty • Model parameters are uncertain (epistemic uncertainty) • infinite sets of pdfs

1 Global View REALITY Fundamental Laws Qualitative Model based on sciences (no equations) Mathematical Model (no values for parameters) System Sensors Numerical Solution Model Conceptualization Verification 2 Noisy observations LABORATORY Output Calibrated Computational Model (estimated parameters and distributions) Validation System PRODUCER CONSUMER Sensors Calibration Verification and Validation The focus of the present dissertation 3 Approximate Uncertainty Propagation REALITY Noisy observations Validated Model System Data Assimilation Experts (parameters and initial conditions) Sensors Uncertainty Propagation and Data Assimilation 4 Approximate Probability Density Function Decision Maker Expected Utility Theory Utility Function Actions Decision Making

Proposed View Decision – Centric 3 Aleatory Uncertainty (eq. process noise, measurement noise) REALITY Novel aleatory uncertainty propagation algorithm using Gaussian Mixture Models Implementation of a parallel Particle Filter for Data Assimilation Application in a CBRN incident with Real Data Novel adaptive Gaussian Sum Filters for both discrete-time and continuous-time dynamical Derivations of the Unscented Kalman Smoother, both two-filter form and the RTS form, using the Weighted Statistical Linearization Novel decision-centric computational resource allocation algorithm for uncertainty propagation Novel epistemic uncertainty propagation using Polynomial Chaos expansion and the Bernstein Form Novel propagation of both epistemic and aleatory uncertainty through dynamical systems with stochastic forcing System Sensors Surrogate Model (eq. linearized model) Decision Sensitive Approximate Uncertainty Propagation Validated Model Noisy observations Data Assimilation Epistemic Uncertainty (parameters and initial conditions) Uncertainty Propagation Algorithm Decision-Centric Uncertainty Propagation and Data Assimilation Resource Allocation Mixed Aleatory and Epistemic Approximate Uncertainty Decision Making – Uncertainty Propagation Interaction Level 4 Ignorance Level Utility Function Constructed Probability Density Function & Ignorance Function Ignorance Averse Expected Utility Theory Decision Maker Utility Function Actions Decision Making

Exact Solution • For continuous-time dynamical systems: Fokker-Planck-Kolmogorovequation (FPKE) • Analytical solution exists only for stationary pdfs and they are restricted to a limited class of dynamical systems • For discrete-time dynamical systems: Chapman-Kolmogorov equation (CKE) • Difficult to solve and approximate in general nonlinear case • Positivity constraint • Normalization constraint • No fixed Solution Domain

Uncertainty Propagation Conventional Methods • Gaussian Closure Approach • Assume the underlying pdf to be Gaussian • Statistical linearization, Stochastic averaging • Monte Carlo approximation • Sample initial distribution • Propagate individual points through the exact nonlinear dynamics • Study the statistics of the propagated points

Gaussian Mixture Models • With sufficient number of Gaussian components, any continuous pdf may be approximated as closely as desired FPKE Linearized propagation True pdf + + = Component 1 w1 Component 2 w2 Component 3 w3 Approximation w1 + w2 + w3 = 1 • Assumption:covariances are small enough such that the linearizations become representative for the dynamics around the means [Sorenson71] • Easily violated: strong nonlinearities, computational constraints

Adaptive Gaussian Mixture Model • Pdf approximated by a finite sum of Gaussian densities • Exact solution given by FPKE • We want our Gaussian mixture to satisfy the FPKE • Compute FPKE error • Where:

Adaptive Gaussian Mixture Model cont. • Approximation – first forward difference FPKE Update • The error is linear in the new weights • New weights obtained by minimizing

Numerical Example • Quintic Oscillator • 11 Gaussian components • Coordinates of the means linearly spaced between [-5,-5] x [5,5] • Equal covariance matrices • Equally weighted • Propagation: 1000 sec • Stationary pdf:

Numerical Example cont. Conventional Exact Solution Adaptive GMM

Nonlinear Filtering for Data Assimilation Measurement Process noise noise True State Physical Physical Sensor Process Process Initial Guess of Measurement Process Model - the State Model Corrector (Bayes rule) • Filtering is the problem of finding the probability density function (pdf) of the states of a system at the current time given all the observations available up to the current time Reality Abstract

Extended Kalman Filter initial pdf forecast pdf posterior pdf EKF Propagation Step EKF Measurement Update

Classic Gaussian Sum Filter 0.3 0.3 0.3 0.5 0.5 0.3 0.2 0.2 0.4 =

Adaptive Gaussian Sum Filter 0.3 0.5 0.4 0.3 0.5 0.2 0.3 0.3 0.2 ≠ Update I: Continuous-time dynamical systems Updates the weights by constraining the Gaussian sum approximation to satisfy the Fokker-Planck equation Update II: Discrete-time nonlinear systems Weights to minimize the integral square difference between the true forecast pdf and its Gaussian sum approximation

Numerical Example • Lorenz system [Lorenz31] • Performance Measures

Numerical Results

Part I: Concluding Remarks • Two update schemes for the forecast weights are presented in order to obtain a better Gaussian sum approximation to the forecast pdf. • Continuous Dynamical Systems: minimize the FPKE error • Discrete Dynamical Systems: minimize the integral square difference between the true pdf and its approximation • Both methods result in a strictlyconvex quadratic programming problems - guaranteed to have a unique solution. • Functional representation of the pdf • Computational Complexity: • Useful in pure propagation, low SNR, low frequency of measurements, limited information provided by the measurement model (eq. quadratic model).

But … … even by improving the propagation method, chances are that we may still get a poor approximation to the forecast probability distribution where it matters in the decision making process. Eq. in the tails of the distribution. Why? Because we still have a finite representation.

Conventional Methodology… NO evacuation action Decision Maker CONSUMER PRODUCER 0.5 0.2 0.3 Uncertainty Propagation (Approximations) params 0.4 0.5 0.1 10,000 residents

Decision-Centric Approach EVACUATE action Decision Maker 10,000 residents Interaction Level (PSGC) 0.5 0.2 0.2 0.1 Uncertainty Propagation (Approximations) 0.0 params 0.4 0.5 0.1 0.0 10,000 residents

Decision-Centricvs Conventional Exact Solution Loss function Expected Loss FPKE Conventional Decision-centric Lin.prop. +weig.upd Lin.prop. +weig.upd

Progressive Selection of Gaussian Components Initial PDF • Sample the means of the new Gaussian components • and their covariance • Propagate moments until the decision time • Inflation coefficient based on the most distant component and inflate the loss function Td • New weights • (regression) Ti • Copy the weights

Progressive Selection of Gaussian Components Initial PDF • Sample the means of the new Gaussian components • and their covariance • Propagate moments until the decision time • Inflation coefficient based on the most distant component and inflate the loss function • New weights • (regression) Convergence – Evolution of the proposal pdf used to sample the means of the new Gaussian components • Copy the weights

Example TRUTH Uncertainty propagation: Loss function: EKF Decision time: Td = 8 sec Expected loss:

Results

CBRN scenario

Classic Reference Decision Centric

Part II: Concluding Remarks Interaction Level between decision maker and the prediction module – incorporate contextual information held by the decision maker Progressive Selection of Gaussian Components to supplement the initial uncertainty with new Gaussian components that are sensitive to the loss function at the decision time. The cost of the overall improvement is an increase in the number of Gaussian components. Significantly enhanced accuracy within the decision maker’s region of interest. The new probability density function addresses the region of interest and provides a better approximation overall, if any probability density mass is moving naturally towards the region of interest.

But … … the methods provided increase the accuracy and the relevance of the propagated probability density function when the prior, process noise and the model parameters are precisely known. This is also known as the Bayesian dogma of precision [Walley91]. These precise values are difficult to obtain in practice, due to the amount of information available, incomplete knowledge of the system or systematic underestimation of uncertainty which arises in the elicitation process.

Interval information [Ferson07] Plus-or-minus sign uncertainty reports Significant digits Intermittent measurements Non-detects / right censoring Data binning and rounding Missing data Gross ignorance

Theory of Evidence • Primitive function: the basic probability assignment (bpa) [Shafer76] • Frame of discernment • Power set • Focal element • Normalized belief structures

Theory of Evidence • Belief function, or the lower bound (total amount of support) • Plausibility function, or the upper bound (total amount of potential) • The precise probability • Degree of ignorance

Theory of Evidence • Pignistic transformationused once decisions have to be made [Smets90] 0.1 0.4 0.5 0.25 0.25 0.1 0.2 0.2 0.1 0.25 0.45 0.2 • Dempster’s rule of combination of two independent arguments

Objective Epistemic Uncertainty Aleatory Uncertainty Propagate both epistemic and aleatory uncertainty separately, and combine them only when a decision has to be made. Propose a two-level hierarchical framework. Dempster Shafer structure Level 2: Epistemic Uncertainty Level 1: Aleatory Uncertainty Precise probability distributions Dempster Shafer structures Certainty Total ignorance Lose information Assume more than known

DS structures on closed intervals • Probability boxes and DS structures on closed intervals [Ferson03] 1 4 m([1,4])=2/3 6 3 m([3,6])=1/3 Precise Probability Interval Uncertainty 0% NIDI = 60% 100%

Towards a Decision-Centric Framework for Uncertainty Propagation and Data Assimilation

Towards a Decision-Centric Framework for Uncertainty Propagation and Data Assimilation

Presentation Transcript

Object-oriented Software for Uncertainty Propagation

Decision-driven Framework for a National Transportation Data Program

Propagation of Uncertainty

Towards a Query Optimizer for Text-Centric Tasks

Ensemble Data Assimilation and Uncertainty Quantification

Data-Centric Security Framework

Uncertainty and Its Propagation Through Calculations

A High-Performance Framework for Earth Science Modeling and Data Assimilation

Towards a Framework for Organized Analysis

A Decision Framework for the

FLOOD PROPAGATION UNCERTAINTY

Towards a Query Optimizer for Text-Centric Tasks

A Data-Centric Framework for Mobile Target Tracking in Sensor Networks

Towards a Framework for QoE

WASTEMINZ Towards a national waste data framework

Propagation of Uncertainty: Multiplication and Division

A High-Performance Framework for Earth Science Modeling and Data Assimilation

Towards a Query Optimizer for Text-Centric Tasks

A unifying framework for hybrid data-assimilation schemes

Towards a Framework for Organized Analysis