1 / 27

Student users guide for the Yersinia pestis module

Student users guide for the Yersinia pestis module. Mauve. To work with the 5 Yersinia pestis alignment, you will need to follow a few steps #1) download progressive Mauve version 2.1.1 at http://gel.ahabs.wisc.edu/mauve/download.php

neylan
Download Presentation

Student users guide for the Yersinia pestis module

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Student users guide for the Yersinia pestis module

  2. Mauve To work with the 5 Yersinia pestis alignment, you will need to follow a few steps #1) download progressive Mauve version 2.1.1 at http://gel.ahabs.wisc.edu/mauve/download.php #2) download the compressed folder called Yersinia pestis alignment 5 genome.zip at http://gel.ahabs.wisc.edu/~baumler/ #3) extract (uncompress) all files from the folder

  3. In your start menu under programs go to Mauve 2.1.1, start up Mauve, notice there is a users guide in pdf form in this folder, this will contain useful information and commands to navigate Note: your computer may need to update Java, since mauve uses a Java platform for the alignment. You should see a window for Mauve appear

  4. Next double click on the uncompressed Yersinia pestis alignment 5 genome folder, it should contain the following 29 files, take the one (yersinia_pestis_alignment_5genomes), and drag and drop it into the mauve window It should start to say reading sequences here, and in a few seconds the alignment will appear, note computers with less than 512MB RAM may not be able to open the file

  5. Your alignment should look like this Organism name notice the first is CO92, the second is KIM,the third is 91001, the fourth is Antiqua, and the fifth is Nepal516 Using the up or down arrows, you can switch the position of the genomes

  6. Your tool bar is at the top on the left, the tools you will use are in the View pulldown, and also the buttons Returns the viewer back to home Search for features Zoom in/out, you can also hold down the ctrl button and use the arrows on the keyboard Move left or right, you will find this useful to center a region of interest in the middle of the screen prior to zooming in

  7. Top strand Bottom strand The colored blocks are called local colinear blocks (LCB’s), and represent regions of the genome that Mauve has identified as conserved, the lines connect the LCBS, notice that some are in different positions in the other genomes, some are inverted and appear on the bottom strand of the double stranded genome

  8. When you move you mouse over a region of one genome it will show a black box and also show the corresponding region (boxes) in the other four genomes, try scrolling left to right on one genome

  9. You may find it easier to view the 5 genome alignment without the connecting lines:on your keyboard press Shift L (pressing this again makes them reappear)

  10. When viewing the LCB’s, mauve displays regions that are highly conserved/identical as full color. Areas that are unique/variable to one genome appear in white, and represent unique islands

  11. Notice, that when you scroll (slowly) over a white region (island) the black boxes pause in the other genomes, then comes back once you have passed over the island and back into conserved regions

  12. If you would like to look at all five LCB’s, even though one is in a different position, scroll over one LCB and click the mouse button

  13. Lets use the zoom function, press the home button to restore the alignment to original view Now click on the green LCB in the top genome, and using the right button bring it to the center of the screen, now start to zoom in multiple times You will start to see the genes, scroll over one and pause, and a window will pop-up with the product annotation, so here you can view what genes are present in a single genome, and not in the others

  14. Now place you mouse over one of the genes, Click your mouse once on a gene, and a window will pop-up, scroll down and select View CDS in ERICdb This will open the page in the ERIC database for that gene, containing all of the annotations, you can look to see what is known about it and/or if it is involved in virulence (note you may be prompted to a log-in screen, click on the button that says “Enter ASAP”)

  15. Lets use the search feature to find the genes glpD, napA, and araC #1) Click on the search feature #2) Choose a genome or search all of the genomes #3) Type in a gene name (glpD) #4) Click on search

  16. Notice that it has found the glpD gene (highlighted in blue), and also a corresponding gene in each genome. You need to determine which of the five CDS’s produce the full-length functional protein Method #1: click on each gene and go to the view CDS in ERICdb, look at the length and if any are labeled as pseudogenes. If so look for a note that describes why it is thought to be a pseudogene

  17. Identifying mutations in glpD, napA, and araC cont. Method #2: from the feature page in ERIC Scroll down to the feature context part of the page This is a list of all features that are neighboring your gene in the genome, notice some are upstream, downstream, or contained within Notice that contained within your glpD gene there are polymorphic sites (otherwise known as SNP’s) For SNP analysis, you will use a new tool called “Snippy”

  18. In a new tab or web browser window go to http://asap.ahabs.wisc.edu/~cabot/aep/snippy.php It should look like this: Highlight and copy all feature ID’s for polymorphic sites from glpD and paste them into here and click submit feature ID’s

  19. In the middle of each region you will see the polymorphic site (in this case capitol G’s) and the corresponding base in each genome, note you are interested in variations in YPKIM, YPCO92, YP91001, YPNepal, and YpAntiqua. -in this case there is no difference in these 5 genomes in this analysis, scroll down and search the remaining polymorphic sites and see if there is any difference in the various polymorphic sites in the 5 genomes, if not it probably is a larger deletion or insertion event

  20. In your SNP analysis, you want to look for SNP’s that cause a change in the amino acid that it encodes for. In some cases the change results in a premature stop-codon, which may generate a truncated non-functional protein #1) note Snippy shows you if the SNP variation results in a amino acid change, in this case A (Alanine) to T (Threonine) #2) In this second SNP, the change resulted in a stop codon

  21. Using the DNA sequence obtained from the dental pulp from three corpses (found in the file called Ypestis corpse and CA88-4125YPE genes.doc), conduct a BlastN search within the ERIC database with each sequence against the 91001,Nepal, Kim, Antiqua, and CO92 genomes. For each of the three corpses, which serovar is most similar to the strains that caused the 1st and 2nd pandemics? From the ERIC home page you can select to run a Blast search here (http://www.ericbrc.org/)

  22. Paste the first nucleotide sequence from corpse #1 Select entire genomes Select the genomes to query, hold down the Ctrl key and select Y . pestis genomes 91001, Antiqua, CO92, KIM, and Nepal Finally click on the Submit Query button, repeat with the other two corpses sequences

  23. Next repeat the BlastN process using the gene sequences from a known North American ancestor (Y. pestis CA88-4125/YPE) for glpD, napA, and araC. Of the 5 genomes (91001, Antiqua, CO92, KIM, and Nepal) representing the three serovars, which is most similar to the known North American ancestor? Based on your analysis did Y. pestis arrive in North America via shipping routes over the Atlantic or Pacific? Atlantic? (Serovar Antiqua of African origin) Pacific? Serovar Orientalis or Mediaevalis of Asian origin Courtesy of education.usgs.gov

  24. One last feature you can use in Mauve To find an island that is in 4 out of 5 strains you will use the backbone view Press the home button first Then go to the View pull down select color scheme then backbone color

  25. Your alignment should look like this in backbone color, regions in all five appear in light purple color, there will be regions that are different colors that will correspond to 2, 3, 4 out of 5 genomes (you may have to zoom in a bit to see these regions) Look for a region in the lightest blue color that is present in CO92, KIM, Antiqua, and Nepal, but absent in the 91001 strain. Analyze the contents and determine if any of the genes may contribute to human infection of Y. pestis.

  26. Some genes from Y. pestis have been characterized, and are thought to be involved in virulence. Your instructor will provide you each with a virulence factor gene. Note some have gene names, while some do not, and in the latter case you will use the “locus tag” identifier. In this case to search for the gene in Mauve, #1) click on the search button #2) from the pull down bar choose locus tag #3) Enter the locus tag which consists of 3 capitol letters followed by 4 numbers(ie YPO1234) #4) click on the search button

  27. Blast search against all Y. pestis genomes including draft genomes From the left side of a feature page for a gene, (in this case hmsH in Y. pestis CO92), you can run BLASTN and BLASTP against all other Y. pestis genomes in ERIC(ASAP) You will be prompted to a Blast page, you will need to select the database you would like to query Then choose the genomes to query; for example check all of the Y. pestis genomes, and then click the search button

More Related