400 likes | 648 Views
p’ = p ( pW AA + qW AS ) /. W. The Biological ESTEEM Project: Linear Algebra, Population Genetics, and Microsoft Excel. Anton E. Weisstein, Truman State University. BIO 2010: Transforming Undergraduate Education for Future Research Biologists National Research Council (2003).
E N D
p’ = p (pWAA + qWAS) / W The Biological ESTEEM Project:Linear Algebra, Population Genetics, and Microsoft Excel Anton E. Weisstein, Truman State University
BIO 2010:Transforming Undergraduate Education for Future Research BiologistsNational Research Council (2003) Recommendation #1: “Those selecting the new approaches should consider theimportance of mathematics...” Recommendation #2: “Concepts, examples, and techniquesfrom mathematics…should be included in biology courses. …Faculty in biology, mathematics, and physical sciences must work collaboratively to find ways ofintegrating mathematics…into life science courses…”
BIO 2010:Transforming Undergraduate Education for Future Research BiologistsNational Research Council (2003) Specific strategies: • A strong interdisciplinary curriculum that includes physical science, information technology, and math. • Meaningful laboratory experiences.
BiologicalTopics Spread of infectious diseases Tree growth Enzyme kinetics Population genetics
MathematicalTopics Graph theory Random walks Linear algebra Optimi-zation
Unpacking “ESTEEM” • Excel: ubiquitous, easy, flexible, non-intimidating • Exploratory: apply to real-world data; extend & improve • Experiential: students engage directly with the math
y = axb y = axb ? Black box: Hide the model Glass box: Study the model No box: Build the model! Three Boxes How do students interact with the mathematical model underlying the biology?
the software, w/proper attribution Users may freely Copyleft • download • use • modify • share More info available at Free Software Foundation website
Synthesizing and Applying Math Concepts Using Biological Cases 3. Survival of the Slightly Better: Exploring an Evolutionary Paradox with Linear Algebra 1. Intro to Population Genetics: Hardy-Weinberg Equilibrium and the Binomial Theorem 2. Evolutionary Analysis: Microevolution, Statistics, and Stability Analysis
IAIA Type A IA IAIB Type AB IAi Type A i IB IBi Type B ii Type O IBIB Type B Definitions Allele: One variant of a specific gene. Genotype: The set of alleles carried by an individual. Phenotype: The detectable manifestations of a specific genotype. Example: ABO blood type
Gametes (eggs & sperm) Life Cycle Adults (reproductively mature) Juveniles (reproductively immature) Zygotes (fertilized eggs)
Gametes (eggs & sperm) Life Cycle Adults (reproductively mature) Juveniles (reproductively immature) Zygotes (fertilized eggs)
Gametes (eggs & sperm) Life Cycle Adults (reproductively mature) Juveniles (reproductively immature) Zygotes (fertilized eggs)
Gametes (eggs & sperm) Life Cycle Adults (reproductively mature) Juveniles (reproductively immature) Zygotes (fertilized eggs)
Recursion Equations Let x = # AA adults; y = # Aa adults; z = # aa adults. Define p = # A gametes = x + y/2 ; q = # a gametes = y/2 + z . Determine expected # adults of each genotype in next generation. (For now, feel free to make any simplifying assumptions.)
Hardy-Weinberg Equilibrium Genotypes reach ratios p2 : 2pq : q2 in one generation, then stay there forever! Assumptions? • Gametes combine at random • All individuals have equal chance of survival • Each gen. a perfectly representative sample of the previous
Synthesizing and Applying Math Concepts Using Biological Cases 3. Survival of the Slightly Better: Exploring an Evolutionary Paradox with Linear Algebra 1. Intro to Population Genetics: Hardy-Weinberg Equilibrium and the Binomial Theorem 2. Evolutionary Analysis: Microevolution, Statistics, and Stability Analysis
The Case of the Sickled Cell • The S allele for sickle-cell anemia has a frequency of ~11% in some African populations. • Why is it so common? • If it provides a selective advantage, why isn’t its frequency 100%?
Definitions Reproductive fitness: The average number of offspring produced by an organism in a specific environment. Natural selection: An evolutionary mechanism that tends to increase the freq. of traits that increase an organism’s fitness. Examples: • Antibiotic resistance • Camouflage • Resistance to infectious diseases Source: Jeffrey Jeffords, DiveGallery.com
Selection and Sickle-Cell Alleles: A: “normal” hemoglobin S: sickle-cell hemoglobin Natural selection: Malaria susceptibility: ~90% survive to reproductive age Sickle-cell anemia: ~20% survive to reproductive age
Zygote p2 2pq q2 Juvenile p2 2pq q2 Adult p2WAA 2pqWAS q2WSS Normalization: W = p2WAA + 2pqWAS + q2WSS W W W p’ = p (pWAA + qWAS) / W Recursion Equations p = # A gametes; q = # S gametes. Life stage AA (W = 0.9) AS (W = 1.0) SS (W = 0.2)
p’ = p (pWAA + qWAS) / W Selection and Sickle-Cell Biological Question: • How will this population evolve over time? Mathematical Question: What are the equilibria for this recursion equation?
Solving for Equilibria Set p’ = p and solve: or Substitute q = 1 – p and factor: or or Nontrivial solution:
Stability Analysis:NatSelDiffEqns (Tim Comar, Benedictine College) Is q = 0.11 stable or unstable?
The Case of the Protective Protein • HIV docks with the CCR5 surface protein present on some cells of immune system • CCR5 32 allele partially protects against HIV infection Peterson 1999. JYI 2: ?
The Case of the Protective Protein • Based on genetic evidence, 32 arose ~700 years ago. • Present in ~10% of Caucasians; largely absent in other groups. Why? Hypothesis: May also have protected vs. plague and/or smallpox. Biological Question: How much selective advantage must 32 have given to become so common in only 700 years? Mathematical Question: For what fitness values does 700 years lie within the 95% CI of 32’s age?
Definitions Genetic drift: An evolutionary mechanism by which allele frequencies change due to chance alone, independent of those alleles’ effects on fitness. Examples: • Absence of blood type B in Native Americans • Northern elephant seal: virtually no genetic variation 100 years after near-extinction
pq 2N 1 2N ≈ N(p, ) p’ = B(2N, p) Modeling Genetic Drift Let N = population size (constant). Assume this pop. produces ∞ gametes: f(A) = p, f(B) =q . But only 2N of those gametes (chosen at random) combine to form the zygotes that develop into the next generation!
N = 2000 N = 200 N = 20 pq 2N 1 2N ≈ N(p, ) p’ = B(2N, p) Genetic Drift as a Random Walk • Largest fluctuations in small pops. • p = 0 and p = 1 are absorbing states
Synthesizing and Applying Math Concepts Using Biological Cases 3. Survival of the Slightly Better: Exploring an Evolutionary Paradox with Linear Algebra 1. Intro to Population Genetics: Hardy-Weinberg Equilibrium and the Binomial Theorem 2. Evolutionary Analysis: Microevolution, Statistics, and Stability Analysis
Sickle Cell Strikes Back! • In addition to the A and S alleles, there is also a C allele for hemoglobin! • C confers even stronger malaria resistance than AS but with no anemia! • But C is found only in a few isolated populations. Why might this happen? Extend previous analysis to 3 alleles: some surprising results!
Selection and Sickle-Cell Hemoglobin alleles: A, S, C Malaria susceptibility Malaria susceptibility Sickle-cell anemia Mild anemia Strong malaria resistance C is beneficial only when common!
p’ = p (pWAA + qWAS + rWAC) / q’ = q (pWAS + qWSS + rWSC) / r’ = r (pWAC + qWSC + rWCC) / W W W p = DA / D, q = DS / D, r = DC / D Selection and Sickle-Cell Recursion Equations: Equilibria: where DA = (WAS – WSS)(WAC – WCC) – (WAS – WSC)(WAC – WSC) DS = (WAS – WAA)(WSC – WCC) – (WAS – WAC)(WSC – WAC) DC = (WAC – WAA)(WSC – WSS) – (WAC – WAS)(WSC – WAS) D = DA + DS + DC
2 alleles: Landscape W(p) is a curve in R2 3 alleles: Landscape W(p, q, r) is a sheet in R3 Plotting the Adaptive Landscape Constraint: p + q + r = 1
where Stability Analysis Re-express W(p, q, r) as W(x, y) Calculate Hessian matrix: 3. Take the determinant and apply the 2nd derivative test:
Local maximum: C allele eliminated Saddle point Global maximum: only C allele present Survival of the Slightly Better:DeFinetti
Cases & Mathematics:Explicit Connections • Binomial & Normal Distributions • Combinatorics • Equilibria & Stability Analysis • Normalization • Recursion & Difference Eqns. • Stochasticity • Geometry of Curves & Solids • Matrix & Linear Algebra • Partial Derivatives