980 likes | 1.17k Views
Dirk Husmeier. Inferring gene regulatory networks from transcriptomic profiles. Biomathematics & Statistics Scotland. Overview. Introduction Limitations Methodology Application to morphogenesis Application to synthetic biology. Objective: reverse engineering regulatory networks.
E N D
Dirk Husmeier Inferring gene regulatory networks from transcriptomic profiles Biomathematics & Statistics Scotland
Overview • Introduction • Limitations • Methodology • Application to morphogenesis • Application to synthetic biology
Objective: reverse engineering regulatory networks From Sachs et al Science 2005
Model Parameters q Probability theory Likelihood
Mechanistic model: description with differential equations Concentrations Kinetic parameters q Rates
Model Parameters q Probability theory Likelihood
1) Practical problem: numerical optimization q 2) Conceptual problem: overfitting ML estimate increases on increasing the network complexity
Overfitting problem True pathway Poorer fit to the data Equal or better fit to the data Poorer fit to the data
Regularization E.g.: Bayesian information criterion (BIC) Regularization term Data misfit term Maximum likelihood parameters Number of parameters Number of data points
Likelihood BIC Complexity Complexity
Model selection: find the best pathway Select the model with the highest posterior probability: This requires an integration over the whole parameter space:
MCMC based schemes q Problem: excessive computational costs
Accuracy Mechanistic models DynamicBayesian networks Computational complexity
Marriage between graph theory and probability theory Friedman et al. (2000), J. Comp. Biol. 7, 601-620
Bayes net ODE model
Model Parameters q Bayesian networks: integral analytically tractable!
Identify the best network structure Ideal scenario: Large data sets, low noise
Uncertainty about the best network structure Limited number of experimental replications, high noise
Sample of high-scoring networks Feature extraction, e.g. marginal posterior probabilities of the edges
Sample of high-scoring networks Feature extraction, e.g. marginal posterior probabilities of the edges Uncertainty about edges High-confident edge High-confident non-edge
Sampling with MCMC Number of structures Number of nodes
Overview • Introduction • Limitations • Methodology • Application to morphogenesis • Application to synthetic biology
Model Parameters q Bayesian networks: integral analytically tractable!
Homogeneity assumption Parameters don’t change with time
Homogeneity assumption Parameters don’t change with time
Overview • Introduction • Limitations • Methodology • Application to morphogenesis • Application to synthetic biology
Our new model: heterogeneous dynamic Bayesian network. Here: 2 components
Changepoint model Parameters can change with time
Changepoint model Parameters can change with time
Our new model: heterogeneous dynamic Bayesian network. Here: 2 components
Our new model: heterogeneous dynamic Bayesian network. Here: 3 components
Extension of the model q Allocation vector h k Number of components (here: 3)
Analytically integrate out the parameters q Allocation vector h k Number of components (here: 3)
RJMCMC within Gibbs P(network structure | changepoints, data) P(changepoints | network structure, data) Birth, death, and relocation moves
Model extension So far:non-stationarity in the regulatory process
Model Parameters q Use prior knowledge!