850 likes | 1.06k Views
Reverse engineering of regulatory networks. Dirk Husmeier & Adriano Werhli. Systems biology Learning signalling pathways and regulatory networks from postgenomic data. Reverse Engineering of Regulatory Networks. Can we learn the network structure from postgenomic data themselves?
E N D
Reverse engineering of regulatory networks Dirk Husmeier & Adriano Werhli
Systems biology Learning signalling pathways and regulatory networks from postgenomic data
Reverse Engineering of Regulatory Networks • Can we learn the network structure from postgenomic data themselves? • Statistical methods to distinguish between • Direct correlations • Indirect correlations • Challenge: Distinguish between • Correlations • Causal interactions • Breaking symmetries with active interventions: • Gene knockouts (VIGs, RNAi)
Bayesian networks versus Graphical Gaussian models Directed versus undirected graphs Score based versus constrained based inference
Evaluation • On real experimental data, using the gold standard network from the literature • On synthetic data simulated from the gold-standard network
Evaluation: Raf signalling pathway • Cellular signalling network of 11 phosphorylated proteins and phospholipids in human immune systems cell • Deregulation carcinogenesis • Extensively studied in the literature gold standard network
Data • Laboratory data from cytometry experiments • Down-sampled to 100 measurements • Sample size indicative of microarray experiments
Evaluation • On real experimental data, using the gold standard network from the literature • On synthetic data simulated from the gold-standard network
Comparison with simulated data 2 Steady-state approximation
Evaluation 2: TP scores We set the threshold such that we obtained 5 spurious edges (5 FPs) and counted the corresponding number of true edges (TP count).
Conclusions 1 • BNs and GGMs outperform RNs, most notably on Gaussian data. • No significant difference between BNs and GGMs on observational data. • For interventional data, BNs clearly outperform GGMs and RNs, especially when taking the edge direction (DGE score) rather than just the skeleton (UGE score) into account.
Conclusions 2 Performance on synthetic data better than on real data: • Real data: more complex • Real interventions are not ideal • Errors in the gold-standard network
Reconstructing gene regulatory networks with Bayesian networks by combining microarray data with biological prior knowledge
Biological prior knowledge matrix Indicates some knowledge about the relationship between genes i and j Biological Prior Knowledge
Biological prior knowledge matrix Indicates some knowledge about the relationship between genes i and j Biological Prior Knowledge Define the energy of a Graph G
Energy of a network Prior distribution over networks
Energy of a network Rewriting the energy