Dynamic modelling of microarray data .

Dynamic modelling of microarray data. Martino Barenco Institute of Child Health / UCL

Outline Goal: predict targets of a known transcription factor in a complex response using dynamic models and time course microarray data. HVDM: Hidden Variable Dynamic Modelling Principle + Results (Genome Biology 2006) Techniques (R/Bioconductor implementation: rHVDM)

Sj 36 Sj 36 Bj 010 Bj 010 Dj 10.1 Dj 10.1 Gene expression model Transcript concentration Xj(t): degradationrate transcription rates Xj(t) transcription factor activity Bj=0 Sj=3 Dj=1 f(t)

Algorithm Principle: Training step: • Inputs: • Previous biological knowledge: known targets of the • transcription factor • - Expression values of those targets Output: - Transcription factor activity (the hidden variable) - Kinetic parameters for the training genes Screening step (for each single gene): • Input: • - Transcription factor activity • Expression profile of the gene Output: - Dependency status of the gene: target or not?

Training step (j: training genes) dX ( t ) j B S f ( t ) D X ( t ) = + - j j j j dt Screening step (j: individual gene being screened) dX ( t ) j B S f ( t ) D X ( t ) = + - j j j j dt

The p53 network Jun myb G2/M Arrest DNA Damage CHK2 bcl2 Survival Active p53 Fas Pidd DR5 p53 Active ATM ATM Death Receptor Bax p53AIF Puma MDM2 p19Arf Mitochondrial Apoptosis E2F1 Jun-B p21 Rb 14-3-3 Rb/E2F1 Cell Cycle G1/S Arrest CDK4 p73

Experimental setup • Human T cells (MOLT4/p53 wild-type) submitted to 5Gy irradiation. • mRNA harvested 2,4,6,8,10,12 hours after irradiation, and just before (0 hrs time point). • Affymetrix microarrays (HG-U133) were then run. • Experiment was run in triplicates.

Results of training step: activity profile of p53

Screening • Q: what are the other genes that are p53 activated? • Putative p53 targets must both:a) Fit the model wellb) Have a sensitivity coefficient Sj>0

P21: part of training set CD38: Uncovered by screening

Verification experiment siRNA knock down of p53: HVDM predictions:

Ingredients needed 1) ODE integrator: + parameter values 2) Model fitting: Find set of parameter values s.t. 3) Want to take measurement noise into the data into account 4) Specifically for the Bioconductor implementation: be reasonably quick

Formal solution: ODE integration - Want to estimate slope of at t=6 - Slope=weighted sum of time points around t=6 - i.e. the ODE is turned into a system of linear equations

2) Model fitting • Start with a “random” set of parameters: • Compute a solution: • Compare with data using a merit function: • Vary p systematically until a minimum value for M(p) is reached.

Fitting algorithms: • Originally used simplex-based method (Nelder-Mead) (GB paper) • Followed by a MCMC step to determine confidence intervals (GB paper) • rHVDM (Bioconductor) uses Levenberg-Marquardt (gradient-based). • By-product is the Hessian, which allows to compute confidence intervals.

Difference between MCMC and LM confidence intervals.

Importance of confidence intervals • Biological data is inherently noisy. Don’t want to assume that measurement are exact. • example: • Genes with a flat profile would be a good fit to the equation (Sj=0) • Essential to identify these situations to detect targets of the transcription factor

Parameter count reduction / identifiability Replace f(t) with Solution: Let Sp21=1 (removes “a”’’ ambiguity) and f(0)=0(removes “b’’ ambiguity)  parameter count is reduced by 2

Solution measure one of the kinetic parameters independently, integrate that in the fitting: Confidence intervals importance II Initial fitting:

Measurement error Algorithmic speed Parameter identifiability Parameter count reduction Confidence intervals

Acknowledgements Sonia Shah (Bloomsbury Centre for Bioinformatics) Dan Brewer (Institute of Cancer Research) Crispin Miller (Patterson Institute for Cancer Research) Daniela Tomescu (ICH) Mike Hubank (ICH) Robin Callard (ICH) Jaroslav Stark (CISBIC, Imperial College)

Dynamic modelling of microarray data .

Dynamic modelling of microarray data .

Presentation Transcript

Microarray Data Analysis

Microarray Data Analysis

Normalization of Microarray Data

Analysis of microarray data

Normalisation of Microarray Data

MICROARRAY DATA

Dynamic Causal Modelling

Analysis of Microarray Data

Microarray data analysis

Classification of Microarray data

Modelling dynamic information

Analysis of Microarray Data

Dynamic Flow Modelling

Dynamic Modelling (1)

Dynamic Causal Modelling

Microarray Data Analysis

Classification of Microarray Data

Dynamic Causal Modelling

Modelling Dynamic Web Data