570 likes | 886 Views
Medical Genetics: Complex disorders. Lecturer: David Saffen. Ph.D. Laboratory for Molecular Neuropsychiatric Genetics Department of Cellular and Genetic Medicine School of Medicine, Fudan University saffen@fudan.edu.cn. Outline. Historical background Phenotypes in populations
E N D
Medical Genetics:Complex disorders Lecturer: David Saffen. Ph.D. Laboratory for Molecular Neuropsychiatric Genetics Department of Cellular and Genetic Medicine School of Medicine, Fudan University saffen@fudan.edu.cn
Outline • Historical background • Phenotypes in populations • Genes in populations • Mapping disease genes • E. Complex disorders
A.Historical background • Francis Galton: normal distributions of quantitative traits • Ronald A. Fisher: polygenic models for quantitative traits
Biometrics and Mendelian genetics Francis Galton was a pioneer in using statistical methods to quantify human traits and behaviors. For example, he recognized that the distribution of many traits such as height, weight, intelligence closely approximate the “normal” (aka “Gaussian”) distribution. He also recognized that inherited traits tended to move toward average values, a phenomenon he termed, “regression to the mean.” Most of Galton’s work on inheritance was carried out before the re-discovery of Mendel’s experiments. The paradigm under which Galton and other “biometricians” worked was that inheritance of human traits involved the mixing or blending of factors present in the parents. This picture is very different from that obtained from Mendel’s experiments, which implied that inherited traits are determined by discrete factors that remain unchanged from generation to generation. Sir Francis Galton 1822-1911 (English Victorian polymath; Cousin of Charles Darwin; biometician; eugenicist)
Normal distributions of quantitative traits and “regression to the mean” 45° slope Mean height SD = standard deviation
Unification of biometrics and Mendelian genetics RA Fisher was a 20th century genius who made fundamental contributions to the fields of statistics and biology. In statistics, he developed analysis of variance (ANOVA), the maximum likelihood method for estimating the values of parameters based on experimental data, permutation testing to estimate statistical significance (P-values), and exact tests for estimating statistical significance in small samples. In biology, Fisher (together with Sewall Wright and J.B.S. Haldane) is considered one of the founders of the “Modern Evolutionary Synthesis,” which unified Darwin’s theory of natural selection with Mendel- inspired concepts of modern genetics. Among many contributions, Fisher was the first to propose the idea of heterozygote advantage to explain the persistence of harmful genetic variants in certain populations. Sir Ronald A. Fisher (1890-1962) English statistician, evolutionary biologist, geneticist, eugenicist; Published:“The correlation Between relatives on the supposition of Mendelian Inheritance” in 1918.
The combined effects of multiple genes can produce normal distributions for quantitative traits
B. Phenotypes in populations • Familial aggregation of disease • Disease risk as a quantitative trait • Factors that obscure patterns of inheritance
Familial aggregation of disease • Relative risk • Concordance and allele sharing among relatives Prevalence of the disease among the relatives of an affected person Prevalence of the disease in the general population lr = 0.25 (2 shared alleles) + 0.5 (1 shared allele) + 0.25 (0 shared alleles) = 1 shared allele (average). One-of-two alleles shared = 50% shared alleles.
Correlations between risk of developing schizophrenia and the degree of relatedness among relatives (1/8) (1/4) (1/2)
Disease risk as a quantitative trait:Susceptibility and protective genes influence the liability (risk) of developing a polygenic disease
Factors that obscure patterns of disease inheritance • Phenocopy • Variable penetrance and expressivity • Locus heterogeneity • Allelic heterogeneity • Environmental influences
C. Genes in populations • DNA polymorphisms • Allele and genotypes frequencies • Ethnic differences in allele frequencies and susceptibility to disease • Out of Africa
DNA polymorphisms Single nucleotide Microsatellites Minisatellites polymorphisms (SNPs) Technically, a polymorphism is a genetic variant that is present in a population at a frequency of > 1%.
SNPs and haplotypes rs384756 (A > C) or (T > G) [0.67/0.33] AATGCCGATTCAGGGCTTAACG TTACGGCTAAGTCCCGAATTCG Chromosome 1 Chromosome 2 AATGCCGATTCCGGGCTTAACG TTACGGCTAAGGCCCGAATTCG SNP1 (A/C) SNP2 (G/T) SNP3 (C/A) SNP4 (T/C) Chromosome A G C T H1 A T C T H2 C T A C H3 C G A T H4 Haplotypes
Allele and genotype frequencies in populations: the Hardy-Wienberg equilibrium (HWE) Let: PA= population frequency of A-allele = p Pa = population frequency of a-allele = q PA + Pa = p + q = 1 AA 2Aa aa p2 + 2pq + q2 = 1 Assumptions: large population; no: new mutations, selection; migration
Frequency of DCCR5 alleles in Europe, the Middle East and India DCCR5 is a 32 bp deletion within the cytokine receptor, subtype 5 (CCR5) gene. (Individuals homozygous for DCCR5 are resistant to some strains of HIV virus.)
D. Mapping disorder genes • Linkage • Linkage analysis within pedigrees • Linkage analysis within populations • Whole exome or genome sequencing • The “architecture” of complex disorders
Linkage “Marker” locus Disease risk locus (a = liability allele) M d M D Recombination Chromosome 1 Chromosome 2 m d m D M d M D M D M d Gametes m d m D m d m D The alleles of chromosomal markers located in close proximity to a disease risk locus tend not to be separated by recombination and therefore co-assort with the risk and non-risk alleles. In the case above, the “m” marker allele co-assorts with the “a” risk allele. For this reason, the marker tends to be co-inherited with the disease.
Linkage analysis within pedigrees • Examines the coinheritance of a disease with chromosomal markers within multiple extended families • SNPs, mini- or “micro-satellites are often used as markers • Usually allows localization to only a general region of a particular chromosome (e.g. within several million bp) • Additional mapping is required to identify the disease gene • Relatively insensitive to allelic heterogeneity
Linkage-analysis within populations:case-control association studies • Depends upon “linkage-disequilibrium” between genetic markers and disease risk variants • Genetic markers are usually SNPs or CNVs. Commercially available genotyping platforms allow over 1 million SNPS to be examined for association with disease in each individual tested. • Sensitive to allelic heterogeneity, but allows localization of susceptibility alleles to within 10 – 100 kb.
Linkage-disequilibrium (LD) A B SNP1 (A/a) SNP2 (B/b) A b a B a b D’ = [PAB- PAPB]/DMAX PA = frequency of allele A in population PB = frequency of allele B in population PAB = frequency of association of alleles A and B DMAX = Maximal value of |PAB-PAPB| D’ = 0 when there is no LD; ±1 for complete association or dissociation.
Linkage-disequilibrium (D’) map of the tryptophan hydroxylase 2 (TPH2) gene
Whole exome sequencing and whole genome sequencing Lupski JR et al, Whole genome sequencing in a patient with Charcot-Marie-Toot Neuropathy, The New England Journal of Medicine362, 1181-1191, 2010
The “architecture” of complex diseases Common disease-common variant (CDCV) model vs Common disease-rare variant (CDRV) model Case-control association studies are an effective tool for identifying common susceptibility alleles of moderate effect. By contrast, large-scale DNA sequencing is more efficient for detecting rare genetic variants of large effect.
Sequenced genetic variants in an individual(Ref: Lupski JR et al, New England J. Medicine362, 2010) *In this table, ”SNPs” also includes small indels and other possible duplications or deletions. Total “SNPs” = 2,858,587 known + 561,719 novel. Note: this study also identified 234 CNVs ranging in size from 1690 to 1,627,813 bp; 220 of these overlap with known CNVs.
E. Selected complex disorders • Digenic retinitis pigmentosa • Venous thrombosis • Hirschsprung disease • Coronary artery disease • Alzheimer’s disease
Digenic retinitis pigmentosa Ref: Kajiwara K et al, Science264, 1994
Venous thrombosis (Risk influenced by two genes + “environmental” factor) Factor V Arg506Gln WT Leiden (Arg) (Gln) 3’-UTR SNP G20210A (rs1799963) A-allele increases levels of prothrombin mRNA. ~2.9% of Caucasians are heterozygous at this locus. Factor V Leiden (FVL) is more stable than wild type Factor V Risk for thrombosis is increased 7-fold for heterozygotes (80-fold for homozygotes) ~ 5% of Caucasians are heterozygous for FVL Note: OC use + rs1799963-A increases risk of cerebral vein thrombosis 30 - 150-fold! OC = oral contraceptives increase expression levels of Factor X and prothrombin a = activated form
Hirschsprung disease [HSCR](Congenital aganglionic megacolon) Incidence ~1/5000 children; males affected 2-4 times as frequently than females. Cause: incomplete development of sympathetic nervous system (myenteric plexis) in one or more segments of colon. Lack of these nerves prevents the colon from relaxing, resulting in intestinal blockage. To date, at least ten genes have been implicated in HSCR. Among these, the tyrosine receptor kinase RET has been identified as the major disease-causing gene. Common SNPs that disrupt the binding of transcription factors to an enhancer element located within the first intron of RET reduce RET mRNA expression and are highly associated with sporadic HSCR.
Coronary artery disease [CAD] Coronary artery disease kills about 450,000 every year in the US. Cast of coronary arteries (yellow = right; red = left arterial trees) Steps leading to CAD
Genetics of CAD Familial CAD Familial hypercholesterolemia: autosomal dominant disorder caused by inactivating mutations of the low-density lipoprotein receptor (LDLR) gene located at 19p13.2. Familial aggregation *Compared to general population; **After controlling for risk factor including diabetes, hypertension and smoking Idiopathic CAD Genetic risk factors: hypertension, obesity, diabetes mellitus (each a disease with complex genetic components) Non-genetic risk factors: Age, sex (male>female), smoking, physical inactivity, stress GWAs studies have identified candidate CAD risk genes that function within biological pathways related to serum lipid transport and metabolism, vasoactivity, blood coagulation, inflammatory and immune pathways, and arterial wall components
Alzheimer’s disease [AD] • Progressive, incurable neurodegenerative disease that leads to dementia and death • Symptoms usually appear after age of 65; early on-set forms also known • Familial AD (FAD): accounts for ~5% of cases; Sporadic AD (Late-onset AD: LOAD) • Currently thought to affect 5 M individuals in the US and 35 M individuals worldwide; this number may increase to > 115 M worldwide by 2050!
Brain pathology in AD (1) AD brainnormal brain
Brain pathology in AD (2) AD brainnormal brain
Brain pathology in AD (3)amyloid plaques and neurofibrillary tangles • Low-power: amyloid plaques, • (B) high-power: amyloid plaque, • (C) neurofibrillary tangles (NFT), silver stained • (D) electron micrograph of neurofibrillary tangles composed of hyperphosphorylated tau Co-staining of amyloid plaques & NFT
Pathways of amyloid protein precursor (APP) proteolysis Li H, Wolfe MS and Selkoe DJ, Structure17, 2009
miR-107 miR-9, mir29a/b-1 BACE1-AS RNA CALHM1 ATXN1 IL33 CHRM1 PRKCA APP APP ADAM9 ADAM10 ADAM17 α-secretase BACE1 BACE2 b-secretase PION TNFRSF1, 2 TNF , GSK3A C83 + sAPPa C99 + sAPPb γ-secretase PSEN1, PSEN2 PSENEN, NCSTN APH1A, APH1B g-secretase p3 + AICD Ab40/42+ AICD
degraded Ab (intracellular) degraded Ab (extracellular) degraded Ab (liver) LRP1 LRP2 LDLR CLU ABCA1 APOEe3 APOEe4 IDE NEP PLAU* C3 CR1* UCHL1 PARK2 SORL1 SORC1 PICALM* proteolysis proteolysis clearance secretion Ab (extracellular) Ab (intracellular) Ab deposition uptake serum sRAGE CST3 APOeE RAGE APOEe4 LRP1 FPRL1 CHRNA7 blood vessel (BBB) oligomers (intracellular) oligomers (extracellular) plaques (extracellular)
lipid homeostasis neuroinflammation LRP1 APOE4 blood pressure LDL CETP APP (endosomal) APP (surface) IL1,6, 8 TNF,TNK1 APOEe4 SORL1 SORC1 PICALM* LRP1B APOE2 cytokine secretion ACE APBA1 APAB2 PIN1 APBB1 activation of microglia blood vessel pathology VEGF IL33 RAGE MAPK1,3,14 PTPRC phagocytosis & complement-mediated clearance (in liver) C3 CR1* Ab burden (monomers, oligomers, plaques) MAPT GSK3A GSK3B CDK5 CDK5R1 GAB2 synapse dysfunction free-radical production PICALM* PIN1 CHRNA7 CHRNB2 CHRM1 neurofibrillary tangles (hyper-phosphorylated tau) cholinergic neurotransmission mitochondria dysfunction TFAM TNF TNK1 FPRL1 DAPK11 NGF BDNF WWC1 neuronal cell death AD
References and further reading RI Nussbaum, RR McInnes and HF Willard, “Thompson & Thompson Genetics in Medicine, Edition 7,” 2007, Saunders Elsevier, Philadelphia, PA; ISBN: 978-1-4160-3080-5 (Chapters 8 -10) T Strachan and A Read, “Human Molecular Genetics, 4th Edition,” 2011, Garland Science, New York, New York ISBN: 978-0-815-34149-9 (Chapters 3, 14 &15)
Appendix • Normal (Gaussian) distributions • Punnett squares for multiple loci (genes)