570 likes | 705 Views
Predicting Gene Expression using Logic Modeling and Optimization Abhimanyu Krishna. New Challenges in the European Area: Young Scientist’s 1st International Baku Forum. Input Stimuli. Input Stimuli. p. B. p. p. A. A. p. A. B. C. C.
E N D
Predicting Gene Expression using Logic Modeling and Optimization Abhimanyu Krishna New Challenges in the European Area: Young Scientist’s 1st International Baku Forum
Input Stimuli Input Stimuli p B p p A A p A B C C Whatis Gene Expression? -> Regulation? -> Gene Regulatory Network? Introduction: R A R C TRB TRC Gene Regulatory Network reconstruction
Objective How to contextualize literature to our experimental conditions + Experimental expression data Literaturebased Gene Regulatory Network Missing expression values in grey
“Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Introduction: Biologicalprocessesrepresented as transitions in a landscape Networks of interactions Unstabletransientstate Stablestate Stablestate 4
“Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Whythesepredictions are not trivial? Noisy network reconstruction process
“Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Problem: Inconsistencybetween network and experimental expression data Solution: Contextualize the Network usingexperimental expression data 6
“Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Whyisthis an optimizationproblem?
“Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Whyisthis an optimizationproblem? Local consistency
“Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Whyisthis an optimizationproblem? Edgeremoval Local consistency
“Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Whyisthis an optimizationproblem? Global consistency Local consistency
“Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Whichproperty are wegoingto use in theoptimization? Network stability Unstabletransientstate Stablestate Stablestate 11
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 12
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 14
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 15
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 16
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 17
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 18
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 19
“Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” But the contribution of interactions to the network stabilityitis not linearlyindependent. The evaluation of one specificlinkishighlydependent of the links alreadyremoved or, in otherwords, the order of removal. We are going to capture interdependenciesbetween variables consideringsequentiallyboth the probability distribution of positive circuits and separatededges. Positive circuits are necessary condition to have severalfixed points Thomas R, Thieffry D, Kaufman M: DYNAMICAL BEHAVIOR OF BIOLOGICAL REGULATORY NETWORKS .1. BIOLOGICAL ROLE OF FEEDBACK LOOPS AND PRACTICAL USE OF THE CONCEPT OF THE LOOP-CHARACTERISTIC STATE.Bulletin of Mathematical Biology 1995, 57:247-276. Positive circuit Negative circuit Positive circuit 20
Iterative network pruning Positive Circuit 1 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 21
Iterative network pruning Positive Circuit 2 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 22
Iterative network pruning Positive Circuit 3 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 23
“Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Whichproperty are wegoingto use in theoptimization? Network stability
“Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Biological scope targeted by thisapproach: transitions between long term expression patterns or stable states Epithelial Mesenchymal Example: Epithelial-mesenchymaltransition
“Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Computingattractors in a discretedynamical system (Boolean) Based on logicfunctions and the assumption of only 2 possible gene states: active (ON or 1) and inactive (OFF or 0). Logicfunctions: Types of attractors: fixed points and limit cycles Fixed point The state of the node xiat time t+1 depends on the state of itsregulatorsat time t. Updatingscheme: Synchronous Limit cycle 26
“Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Consistencybetweenexpression data and networkstablestates 27
“Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Iterative network pruning Optimization of h(x) (objective function) h(x) = X1+X2+X3+X4+X5+x6 Xi = 0 or 1 Network topologyoptimizedusing an Estimation of Distribution Algorithm (EDA) Toyexample:
EDA: toyexample Initial population Next population Top 10 solutions
EDA: toyexample Initial population Next population Top 10 solutions
EDA: toy example Initial population Next population Top 10 solutions
EDA: toy example Initial population Next population Top 10 solutions
EDA: toy example Initial population Next population Top 10 solutions 0.7
EDA: toy example Initial population Next population Top 10 solutions 0.7 0.7
EDA: toy example Initial population Next population Top 10 solutions 0.7 0.7 0.6
EDA: toy example Initial population Next population Top 10 solutions 0.7 0.7 0.6 0.6
EDA: toy example Initial population Next population Top 10 solutions 0.7 0.7 0.6 0.6 0.8
EDA: toy example Initial population Next population Top 10 solutions 0.7 0.7 0.6 0.6 0.8 0.7
EDA: toy example Initial population Next population Top 10 solutions 0.7 0.7 0.6 0.6 0.8 0.7
EDA: toy example Initial population Next population Top 10 solutions 0.7 0.7 0.6 0.6 0.8 0.7 STOP CRITERIA
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 41
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 43
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 44
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 45
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 46
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 47
Iterative network pruning Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 48
“Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” But the contribution of interactions to the network stabilityitis not linearlyindependent. The evaluation of one specificlinkishighlydependent of the links alreadyremoved or, in otherwords, the order of removal. We are going to capture interdependenciesbetween variables consideringsequentiallyboth the probability distribution of positive circuits and separatededges. Positive circuits are necessary condition to have severalfixed points Thomas R, Thieffry D, Kaufman M: DYNAMICAL BEHAVIOR OF BIOLOGICAL REGULATORY NETWORKS .1. BIOLOGICAL ROLE OF FEEDBACK LOOPS AND PRACTICAL USE OF THE CONCEPT OF THE LOOP-CHARACTERISTIC STATE.Bulletin of Mathematical Biology 1995, 57:247-276. Positive circuit Negative circuit Positive circuit 49
Iterative network pruning Positive Circuit 1 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) with 50