540 likes | 852 Views
Poxviruses and Adaptive Genome Evolution. Aoife McLysaght Dept. of Genetics Trinity College Dublin. Genome Evolution. Evolution of genome arrangement Evolution of genome content. Genome Evolution. Evolution of genome arrangement Gene order changes Inversions, translocations
E N D
Poxviruses and Adaptive Genome Evolution Aoife McLysaght Dept. of Genetics Trinity College Dublin
Genome Evolution • Evolution of genome arrangement • Evolution of genome content • .
Genome Evolution • Evolution of genome arrangement • Gene order changes • Inversions, translocations • Evolution of genome content • .
Genome Evolution • Evolution of genome arrangement • Gene order changes • Inversions, translocations • Evolution of genome content • Gene gain (sequence divergence, duplication, recombination, horizontal transfer) • Gene loss (deletion) • .
Genome Evolution • Evolution of genome arrangement • Gene order changes • Inversions, translocations • Evolution of genome content • Gene gain (sequence divergence, duplication, recombination, horizontal transfer) • Gene loss (deletion) • One or more genes per event
Genome Evolution • Translate knowledge from sequenced or model genomes to organism of interest • Positional cloning of genes • Use probes designed in one genome to detect a target in another genome • Improve model parameters for phylogenetic inference from genome arrangement
Genome Structure • Not just a bag of genes • Genome organisation contains information • Order of Hox genes corresponds to spatial pattern of gene expression • Clustering of housekeeping genes • By observation of ‘allowed’ changes gain understanding of genomic constraints and plasticity
Multiple Genome Comparison • Greater power to detect change • Precision • Can infer lineage in which change occurred • Detect direction and rate of change • More genomes also increase computational burden
Pox virus genomes • 20 completely sequenced genomes • 150-300kb containing ~200 genes
Poxviruses • Double-stranded DNA viruses, no RNA stage • Replicate in the host cytoplasm • Entomopox – insect infecting • Chordopox – vertebrate infecting • Orthopox – subset of chordopox which includes smallpox (variola) and vaccinia
Questions: • How are these genomes arranged? • How has genome content changed? • Is the rate of change constant?
Questions: • How are these genomes arranged? • How has genome content changed? • Is the rate of change constant? • Can we detect adaptive genome evolution?
Orthologue detection Significant sequence similarity • How significant? over a long stretch of the protein • How long?
Complete linkage • Single-link clustering • Our method
Complete linkage C A E B D
Single-link clustering C A E G F B D
C E A J F D B I C G E H D B
Orthologues • 4042 total proteins • 3384 proteins classified into 875 groups • 813 complete linkage • 521 groups of 1 member • 150 groups of 2 members • 204 ≥ 3 members
Poxvirus Phylogeny 34 orthologues present in all genomes
Poxvirus Phylogeny 34 orthologues present in all genomes
Orthopox phylogeny 92 orthologues present in all orthopox genomes
Counting gene gain and loss • Examine phylogenetic spread of a group of orthologues • Assign gene gain and loss events to branches in the phylogeny
Infer gene gain along the branch to the most recent common ancestor
Rate of Gene Gain • Tested for uniform rate of gene acquisition • Assume a molecular clock
Rate of Gene Gain • Tested for uniform rate of gene acquisition • Assume a molecular clock • Are gene acquisition events distributed randomly throughout the tree?
Rate of Gene Gain • Tested for uniform rate of gene acquisition • Assume a molecular clock • Are gene acquisition events distributed randomly throughout the tree? • Simulations
Significant deficit Significant excess
Increased Gene Gain in the Orthopox Lineage • Slower rate of amino acid substitution within this clade (leading to abberantly short branch lengths) • Takezaki relative rate test • Branch lengths from synonymous distances
Increased Gene Gain in the Orthopox Lineage • Slower rate of amino acid substitution within this clade (leading to abberantly short branch lengths) • Takezaki relative rate test • Branch lengths from synonymous distances • Increased rate of gene gain • Increased selection for the retention of gained genes
Sources of Gene Acquisition • Extensive sequence divergence • Recombination • Horizontal transfer
Horizontal Transfer • AMV-EPB_034 – inhibitor of apoptosis from Amsacta moorei entomopoxvirus (AMV-EPB) • GenBank sequence – inhibitor of apoptosis from Bombyx mori (silkworm) BLAST e-value 9e-81 • Amsacta moorei entomopoxvirus infects Amsacta moorei (Red Hairy Caterpillar) • Bombyx and Amsacta both OrderLepidoptera
Horizontal Transfer • AMV-EPB_034 – inhibitor of apoptosis from Amsacta moorei entomopoxvirus (AMV-EPB) • GenBank sequence – inhibitor of apoptosis from Bombyx mori (silkworm) BLAST e-value 9e-81 • Amsacta moorei entomopoxvirus infects Amsacta moorei (Red Hairy Caterpillar) • Bombyx and Amsacta both OrderLepidoptera • 62% of best non-viral GenBank hits are from same taxonomic Class as viral host
Gene loss modelling • Events are not independent • Depend on previous (in time) gain and loss events of the gene family • Requires a probabilistic model?
Adaptive Evolution • Selection for diversification • Positive selection • Characteristic of host-parasite co-evolution
Detection of Positive Selection • Two classes of DNA substitutions • Synonymous (DNA change without amino acid change) • Nonsynonymous (DNA change causing amino acid change) • Neutral – equal frequencies • Conservative selection – fewer nonsynonymous substitutions • Positive selection – more nonsynonymous substitutions
Detection of Positive Selection • Two classes of DNA substitutions • Synonymous (DNA change without amino acid change) • Nonsynonymous (DNA change causing amino acid change) • Neutral – equal frequencies • Conservative selection – fewer nonsynonymous substitutions • Positive selection – more nonsynonymous substitutions
Detection of Positive Selection • Two classes of DNA substitutions • Synonymous (DNA change without amino acid change) • Nonsynonymous (DNA change causing amino acid change) • Neutral – equal frequencies • Conservative selection – fewer nonsynonymous substitutions • Positive selection – more nonsynonymous substitutions
Detection of Positive Selection • Two classes of DNA substitutions • Synonymous (DNA change without amino acid change) • Nonsynonymous (DNA change causing amino acid change) • Neutral – equal frequencies • Conservative selection – fewer nonsynonymous substitutions • Positive selection – more nonsynonymous substitutions
Detection of Positive Selection • 204 groups of orthologues • Maximum liklihood test for positive selection (PAML) • Significantly higher frequency of nonsynonymous substitutions
Positive Selection on Pox Genes • Detected positive selection on 26 genes • Examples: • Membrane glycoprotein • Haemagluttinin • Immunoglobulin domain protein
Positive Selection on Pox Genes • 13 genes are unique to orthopox clade • Significantly more than expected (P < 0.05) • Disproportionate frequency of positive selection on genes gained within the orthopox lineage
Adaptive Genome Evolution? • Association of positive selection on protein sequences and increased rate of gene acquisition