560 likes | 747 Views
Data and modeling issues in population biology. Alan Hastings (UC Davis) and many colalborators Acknowledge support from NSF. Goals. Understand ecological principles or determine which processes are operating E.g., How important is competition? Make predictions
E N D
Data and modeling issues in population biology • Alan Hastings (UC Davis) and many colalborators • Acknowledge support from NSF
Goals • Understand ecological principles or determine which processes are operating • E.g., How important is competition? • Make predictions • What will the population of a species be in the future? • Management • Fisheries • Infectious diseases • Invasive species • Endangered species
Time series • Population biology typically follows populations through time (and sometimes space) • Data is of varying qualities Limited extent Measurement error Often cannot go back and get more data
Time series of total weekly measles notifications for 60 towns and cities in England and Wales, for the period 1944 to 1994; the vertical blue line represents the onset of mass vaccination around 1968. (Levin, Grenfell, Hastings, Perelson, Science 1997)
Purposes of time series analysis • Parameter estimation • For a ‘known model’, estimate the parameters • Determine importance of biological factors operating • Model identification • The? model with the ‘highest likelihood’ is chosen as the model that describes the system • Prediction • Management
Underlying modeling issue • Mechanistic models versus using general models • Linear time series analysis • Less of an issue • Nonlinear time series • Use general model • Use specific model • Use ‘mixed’ model
Modelling approaches • General model • E.g. cubic splines • Mechanistic model • E.g., SIR model • Mixed • TSIR • Know that a single infection removes a single susceptible, and know dynamics of I to R whereas S to I is more problematic
How ‘noise’ enters • Process noise • Environmental variability • Role of species or factors not included • Demographic factors • How mechanistic should this be? • Other species, or environment • Measurement error • How good are population estimates? • How mechanistic should this be?
True population at time t True population at time t + 1 Dynamics + ‘noise’ Observation process, possibly with error Observed population at time t Observed population at time t + 1 The time series
Use Kalman Filter Linear Dynamics + ‘noise’ True population at time t True population at time t + 1 Linear Observation process, possibly with error Observed population at time t Observed population at time t + 1
Observation error only Use model to generate the whole time series, minimize difference between every observation and every prediction True population at time t True population at time t + 1 Dynamics + ‘noise’ Observation process, possibly with error Observed population at time t Observed population at time t + 1
Process error only True population at time t True population at time t + 1 Dynamics + ‘noise’ Observation process, possibly with error Observed population at time t Observed population at time t + 1
Process error only True population at time t Observed = true population at time t + 1 Dynamics + ‘noise’ Noise Dynamics only Make one step ahead predictions only Minimize difference between one step ahead and observation Predicted population at time t + 1
Resample from ‘noise’ to demonstrate that observed dynamics result – essentially continual transients
Conclusions • Much more work needed • Mechanistic models can be used • Using mechanistic models can be important in highlighting ecological processes
Definition of state space model ‘true’ population dynamics noise noise Observation process Observed population
The time series True population at time t True population at time t + 1 Dynamics + ‘noise’ Observation process, possibly with error Observed population at time t Observed population at time t + 1
Likelihood is defined as probability of observation parameters Likelihood Probability defined iteratively Superscripts on y’s mean observations up to and including that time, subscripts denote observations only at that time
Begin iterative calculation of likelihood • Probability of first observation is found by summing the probabilities of all possible first states times probability of observation given state • Then adjust distribution of states to reflect first observation
The time series True population at time t True population at time t + 1 Dynamics + ‘noise’ Observation process, possibly with error Observed population at time t Observed population at time t + 1
Change all computations to computations of pdf’s • Omit details
Use Beverton-Holt and Ricker models with process noise linear on log scale
Assume observation noise is linear Noise structure could be more general
Dynamics and fitting • Beverton-Holt • Always stable • Use one set of parameter values • Ricker • Period doubles, etc • Stable equilibrium • Two cycle • Four cycle
Process noise and observation error combinations • Large process noise, small observation error • Small process noise, large observation error • Large process noise, large observation error • Generate 300 time series of length 20 for each of the 12 cases (3 error structures by 4 model structures)
Parameter estimation • For each case, use each method • NISS should handle large noise • LSPN (least squares process noise) • LSOE (least squares observation error)
Estimate of growth rate when data generated by one model, fit by anotherTop row, generated by BH, fit by Ricker, bottom row is reverse
Now, model identification • Generate data with either model, see which model has the highest likelihood
correct incorrect ‘true’ model incorrect correct