1 / 22

Chapter 2 2.The content of the genome

Chapter 2 2.The content of the genome. 王心宇 副教授 College of Life Sciences. 4.1 Introduction. The key question about the genome is how many genes it contains. We can think about the total number of genes at four levels, corresponding to successive stages in gene expression:

Download Presentation

Chapter 2 2.The content of the genome

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Chapter 2 2.The content of the genome 王心宇 副教授 College of Life Sciences

  2. 4.1Introduction • The key question about the genome is how many genes it contains. We can think about the total number of genes at four levels, corresponding to successive stages in gene expression: • The genome is the complete set of genes of an organism. It is defined by the complete DNA sequence. • The transcriptomeis the set of expressed genes. The transcriptome includes noncoding RNAs as well as mRNAs. • The proteome is the complete set of proteins. It can be used to refer to the set of proteins coded by the whole genome or produced in any particular cell or tissue.

  3. 4.2Genomes can be mapped by linkage, restriction cleavage, or DNA sequence Defining the contents of a genome essentially means making a map. We can think about mapping genes and genomes at several levels of resolution. 1. A genetic (or linkage) map identifies the distance between mutations in terms of recombination frequencies. 2. A linkage map can also be constructed by measuring recombination between sites in genomic DNA. 3. The ultimate map is to determine the sequence of the DNA. YAC, BAC, FOSMID, etc.

  4. 4.3Individual genomes show extensive variation • The coexistence of multiple alleles at a locus is called genetic polymorphism. • A change in a single nucleotide when alleles are compared is called a single nucleotide polymorphism (SNP). • A difference in restriction maps between two individuals is called a restriction fragment length polymorphism (RFLP).

  5. 4.3Individual genomes show extensive variation Figure 4.2A point mutation that affects a restriction site is detected by a difference in restriction fragments

  6. 4.3Individual genomes show extensive variation Figure 4.3Restriction site polymorphisms are inherited according to Mendelian rules. Four alleles for a restriction marker are found in all possible pairwise combinations, and segregate independently at each generation.

  7. 4.4RFLPs and SNPs can be used for genetic mapping Figure 4.4A restriction polymorphism can be used as a genetic marker to measure recombination distance from a phenotypic marker (such as eye color). The figure simplifies the situation by showing only the DNA bands corresponding to the allele of one genome in a diploid.

  8. 4.4RFLPs and SNPs can be used for genetic mapping • The identification of a RFLP that is linked to a disease has two important consequences: • It may offer a diagnostic procedure for detecting the disease. • It may lead to isolation of the gene.

  9. 4.4RFLPs and SNPs can be used for genetic mapping The existence of RFLPs provides the basis for a technique to establish unequivocal parent-progeny relationships. In cases where parentage is in doubt, a comparison of the RFLP map in a suitable chromosome region between potential parents and child allows absolute assignment of the relationship. The use of DNA restriction analysis to identify individuals has been called DNA fingerprinting.

  10. 4.5Why are genomes so large? The C-value is the total amount of DNA in the genome (per haploid set of chromosomes). The C-value paradox describes the lack of relationship between the DNA content (C-value) of an organism and its coding potential. Figure 4.6DNA content of the haploid genome is related to the morphological complexity of lower eukaryotes, but varies extensively among the higher eukaryotes. The range of DNA values within a phylum is indicated by the shaded area.

  11. 4.5Why are genomes so large? Figure 4.7The minimum genome size found in each phylum increases from prokaryotes to mammals.

  12. 4.5Why are genomes so large? Figure 4.8The genome sizes of some common experimental animals.

  13. 4.6Eukaryotic genomes contain both nonrepetitive and repetitive DNA sequences • We can divide the eukaryotic genome into three general types of sequences: • Nonrepetitive sequences are unique; the haploid genome contains only one copy of such a sequence. • Moderately repetitive sequences are found in multiple copies. The copies can be identical to one another or (more typically) can be seen to be related, although they are not identical. • Highly repetitive sequences are very short and are present in large numbers of copies, often organized as tandem repeats. These include very large blocks of material organized as satellite DNA and smaller blocks that are called minisatellites or microsatellites depending on their length.

  14. 4.6Eukaryotic genomes contain both nonrepetitive and repetitive DNA sequences Figure 4.9The proportions of different sequence components vary in eukaryotic genomes. The absolute content of nonrepetitive DNA increases with genome size, but reaches a plateau at ~2 × 109 bp.

  15. 4.7Genes can be isolated by the conservation of exons • Key Terms • A zoo blot describes the use of Southern blotting to test the ability of a DNA probe from one species to hybridize with the DNA from the genomes of a variety of other species. • Key concepts • Conservation of exons can be used as the basis for identifying coding regions by identifying fragments whose sequences are present in multiple organisms.

  16. 4.7Genes can be isolated by the conservation of exons Figure 2.17 A zoo blot with a probe from the human Y chromosomal gene zfy identifies cross-hybridizing fragments on the sex chromosomes of other mammals and birds. There is one reacting fragment on the Y chromosome and another on the X chromosome.

  17. 4.9The conservation of genome organization helps to identify genes Synteny describes a relationship between chromosomal regions of different species where homologous genes occur in the same order. Figure 4.13Mouse chromosome 1 has 21 segments of 1 - 25 Mb that are syntenic with regions corresponding to parts of 6 human chromosomes.

  18. 4.9The conservation of genome organization helps to identify genes • Homolog • ·A gene related to a second gene by descent from a common ancestral DNA sequence. The term, homolog, may apply to the relationship between genes separated by the event of speciation (see ortholog) or to the relationship betwen genes separated by the event of genetic duplication (see paralog). • Ortholog • ·Orthologs are genes in different species that evolved from a common ancestral gene by speciation. Normally, orthologs retain the same function in the course of evolution. Identification of orthologs is critical for reliable prediction of gene function in newly sequenced genomes. (See also Paralogs). • Paralog • ·Paralogs are genes related by duplication within a genome. Orthologs retain the same function in the course of evolution, whereas paralogs evolve new functions, even if these are related to the original one.

  19. 4.10Organelles have DNA The first evidence for the presence of genes outside the nucleus was provided by nonMendelian inheritance in plants

  20. 4.10Organelles have DNA • Maternal inheritance describes the preferential survival in the progeny of genetic markers provided by one parent. • Extranuclear genes reside outside the nucleus in organelles such as mitochondria and chloroplasts. • Cytoplasmic inheritance is a property of genes located in mitochondria or chloroplasts. • Mitochondria and chloroplasts have genomes that show nonMendelian inheritance. Typically they are maternally inherited. • Organelle genomes may undergo somatic segregation in plants. • Comparisons of mitochondrial DNA suggest that humans are descended from a single female who lived 200,000 years ago in Africa.

  21. 4.13Organelles evolved by endosymbiosis • Mitochondrial genomes are more closely related to bacterial genomes than to eukaryotic nuclear genomes. • Mitochondria probably originated when a eukaryotic cell "captured" a bacterium. • Integration of the mitochondrion has involved transfer of genetic information in both directions between it and the nucleus.

  22. 4.14Summary • The sequences comprising a eukaryotic genome can be classified in three groups: • nonrepetitive sequences are unique; • moderately repetitive sequences are dispersed and repeated a small number of times in the form of related but not identical copies;; • and highly repetitive sequences are short and usually repeated as tandem arrays. The proportions of the types of sequence are characteristic for each genome, although larger genomes tend to have a smaller proportion of nonrepetitive DNA. Almost 50% of the human genome consists of repetitive sequences, the vast majority corresponding to transposon sequences. Most structural genes are located in nonrepetitive DNA.

More Related