1 / 43

Plasmid Extraction and Insert Check: Illumina Sequencing and Data QC

This tutorial provides a step-by-step guide on plasmid extraction, insert check, and Illumina sequencing. It also covers data quality control, sequence editing, and BLAST analysis to improve the identity and possibilities of snails and parasites in DNA.

mlaster
Download Presentation

Plasmid Extraction and Insert Check: Illumina Sequencing and Data QC

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Today • HK • Plasmid extraction, insert check • *.ppt • Illumina sequencing • MBF visit/ GALAXY data QC • Sequence editing and BLAST data quality, improve, identity of snails

  2. SNAILAND PARASITES BIOLOGY DNA “identity, possibilities” phylogenetics CTAB/DNAzol CTAB/DNAzol Illumina (full) genome sequencing gel electrophoresis nanodrop spec PCRrDNA/mito Qubit Fluorometry Covaris fragmentation Ampure (fragment collection) Kapa DNA library preparation kit Pippin size selection QC Bioanalyzer, Qubit, qPCR Illumina run TA cloning, B/W screening electrophoresis Qiagen plasmid extraction Restriction digests direct sequencing M13 sequencing Sequence ID (BLAST) editing Galaxy QC Data file (MT) genome assembly Mitos, manual annotation Gene annotation Primer design, walking Phylogenetics GenBank submission

  3. BECAUSE OF TEMPLATE INDEPENDENT 3’ A-ADDITION TO AMPLICON MORE EFFICIENT THAN LIGASE

  4. PlasmidDesign BioTech No ligase

  5. Friday Each group Select 4 white colonies, sample by “scraping” with yellow tip Eject yellow tip with bacteria in 2.5 ml LB, 100mg/ml kanamycin 4x4 colonies EP1_18S, 3x12 colonies EP1_28S, 3x4 colonies EP3_18S 1,3,5,6 4,8,10 2,7,9SHARE COLONIES IF NEEDED OR….? NO colonies, troubleshoot, repeat!

  6. How to proceed? EXTRACT PLASMIDS CHECK FOR INSERTS

  7. *.ppt

  8. NextSeq Illumina run, 130 million, 2 x 150 (300nt) Paired End Reads Result: ≤ 3,900,000,000 nucleotides(inspect all nts @1/sec: 365/24/60/60 = ~123 years) Sequence quality: Trimming bad sequence + Filtering to remove adaptors, barcodes Assembly https://www.youtube.com/watch?annotation_id=annotation_1533942809&feature=iv&src_vid=HMyCqWhwB8E&v=fCd6B5HRaZ8

  9. Bring your LAPTOP Use W7 LAPTOP URL: http://emil.unm.edu/galaxy Login: UNM account id without @unm. pw:4546L2018 September 24th, Dr LiJing Bu

  10. DNA SANGER SEQUENCING “*.ABI” Sequence analysis?

  11. Forward primer sequencing reaction sequencing generates + strand ATCG ggaa 5’ 3’ dye blobs Reverse primer sequencing reaction CCTT tagc 5’ generates - strand 3’ dye blobs

  12. sequencing + strand sense coding primerR A 5’ primerF 3’ A - strand antisense non-coding primerR 3’ primerF 5’ sense GGAA A 5’ ATCG 3’ NEED TO KNOW WHY SEQUENCE THIS? A antisense CCTT 3’ TACG 5’

  13. sequencing forward + strand sense coding primerR A 5’ primerF 3’ A - strand antisense non-coding primerR 3’ primerF 5’ sense GGAA A 5’ ATCG 3’ Forward primer sequencing reaction ATCG 5’ 3’ A - strand antisense non-coding CCTT 3’ TACG 5’

  14. sequencing results + strand sense coding primerR A 5’ primerF 3’ A - strand antisense non-coding primerR 3’ primerF 5’ Forward primer sequencing reaction generates sense strand ATCG ggaa 5’ 3’ A - strand antisense non-coding CCTT 3’ TACG 5’

  15. sequencing reverse + strand sense coding primerR A 5’ primerF 3’ A - strand antisense non-coding primerR 3’ primerF 5’ sense GGAA A 5’ ATCG 3’ A antisense CCTT 3’ TACG 5’

  16. sequencing reverse + STRAND SENSE CODING GGAA A 5’ ATCG 3’ A - strand antisense non-coding cctt 3’ tagc 5’ Reverse primer sequencing reaction SENSE GGAA A 5’ ATCG 3’ CCTT 5’ 3’ A antisense CCTT 3’ TACG 5’

  17. sequencing results + STRAND SENSE CODING GGAA A 5’ ATCG 3’ A - strand antisense non-coding cctt 3’ tagc 5’ Reverse primer sequencing reaction SENSE GGAA A 5’ ATCG 3’ CCTT tagc 5’ generates - strand 3’

  18. sequencing results + STRAND SENSE CODING GGAA A 5’ ATCG 3’ A - strand antisense non-coding cctt 3’ tagc 5’ Reverse primer sequencing reaction SENSE GGAA A 5’ ATCG 3’ CCTT tagc 5’ generates - strand 3’ ABI File generated 5’-3’ Sequence is reverse complement of coding sequence !

  19. Forward primer sequencing reaction sequencing generates + strand ATCG ggaa 5’ 3’ Reverse primer sequencing reaction CCTT tagc 5’ generates - strand 3’ sequencing generates + strand ATCG ggaa 5’ 3’ reverse complement yields + (coding) sequence two confirmatory sets of sequence data

  20. sequencing results + STRAND SENSE CODING GGAA A 5’ ATCG 3’ A - strand antisense non-coding cctt 3’ tagc 5’ Reverse primer sequencing reaction SENSE GGAA A 5’ ATCG 3’ CCTT tagc 5’ generates - strand 3’ ABI File generated 5’-3’ TTCC cgat 3’ 5’ dnarts - setareneg Sequence is reverse complement of coding sequence !

  21. Sequence editing (H8) Full length of amplicon? Look for other primer Do we have all information? Edit, compare replicate reactions Compare computationally to other sequences BLAST AND Phylogenetic analysis for “best” identification

  22. So full length? Abrupt end Primer? Same for other sequencing reaction

  23. ? LCO1490: 5'-GGT CAA CAA ATC ATA AAG ATA TTG G -3’ HC02198: 5'-TAA ACT TCA GGG TGA CCA AAA AAT CA-3’ Primer is reverse complement at end of sequence run

  24. Sequence reverse complemented LCO1490: 5'-GGT CAA CAA ATC ATA AAG ATA TTG G -3’ HC02198: 5'-TAA ACT TCA GGG TGA CCA AAA AAT CA-3’ Primer is reverse complement at end of seqeunce run

  25. Sequence reverse complemented LCO1490: 5'-GGT CAA CAA ATC ATA AAG ATA TTG G -3’ HC02198: 5'-TAA ACT TCA GGG TGA CCA AAA AAT CA-3’ Primer is reverse complement at end of sequence run

  26. Fix “N” calls: what are these? Stay at regular spacing distance

  27. Fix “N” calls: what are these? Stay at regular spacing distance

  28. Dye Blobs? Edit under the blob

  29. A G C G T N A A NNNN Dye Blobs? Edit under the blob

  30. A G C G T N A A NNNN T T G A C G A G C A T T T T T A Dye Blobs? Edit under the blob

  31. BLAST

  32. BLAST

  33. Pseudosuccinea columella mitochondrial COI gene for cytochrome oxidase subunit I, partial cds, isolate: Pc.2 Sequence ID: gi|972806524|LC015521.1Length: 667Number of Matches: 2 Related Information Range 1: 132 to 656GenBankGraphics Next Match Previous Match Alignment statistics for match #1 Score Expect Identities Gaps Strand 569 bits(630) 2e-158 432/525(82%) 0/525(0%) Plus/Plus Query 126 TTTTGTTATAAtttttttCATANTTATACCAATAATANTTGGAGGGTNNGGAAATTGAAT 185 |||||||||||||||||||||| |||||||||||||| ||||||||| ||||||||||| Sbjct 132 TTTTGTTATAATTTTTTTCATAGTTATACCAATAATAATTGGAGGGTTTGGAAATTGAAT 191 Query 186 AGTTCCNCTTCTCATTGGNGCTCCNNATATAANATTTCCTCGNATAAATAATATANNANN 245 |||||| ||||||||||| ||||| |||||| ||||||||| |||||||||||| | Sbjct 192 AGTTCCACTTCTCATTGGTGCTCCAGATATAAGATTTCCTCGTATAAATAATATAAGATT 251 Query 246 TTGATTACTACCACCTTCNNTTATTCTCTTACTTTGNTCTNGAATANNANAAGGTGGGGN 305 |||||||||||||||||| |||||||||||||||| ||| ||||| | ||||||||| Sbjct 252 TTGATTACTACCACCTTCGTTTATTCTCTTACTTTGCTCTAGAATAGTAGAAGGTGGGGT 311 Query 306 AGGNACTGGATGAACAGTTTACCCACCATTGANTGGACCTATTGCTCATGGNGGATCTTC 365 ||| |||||||||||||||||||||||||||| |||||||||||||||||| |||||||| Sbjct 312 AGGTACTGGATGAACAGTTTACCCACCATTGAGTGGACCTATTGCTCATGGTGGATCTTC 371 Query 366 TGNNGANNNNNCTATNNTNTCNNTNCNTNTANCCGGNTNATNNNNGATTTTAGGANNNNN 425 || || |||| | || | | | || |||| | || |||||||||| Sbjct 372 TGTTGATTTAGCTATTTTTTCTTTACATTTAGCCGGTTTATCCAGGATTTTAGGAGCAAT 431 Query 426 NNATTTTATTACTACnntttttAATATACNATCTCCNGGNATTACATTANANCNAANAAN 485 ||||||||||||| |||||||||||| |||||| || ||||||||| | | || || Sbjct 432 TAATTTTATTACTACAATTTTTAATATACGATCTCCAGGTATTACATTAGAACGAATAAG 491 Query 486 ATNATTTGTATGATCNGNATTAGTTACNGCTTNTNNNCTTCTTTTATCTNTNCNNNTACT 545 || |||||||||||| | ||||||||| |||| | |||||||||||| | | |||| Sbjct 492 ATTATTTGTATGATCTGTATTAGTTACAGCTTTTTTACTTCTTTTATCTTTACCAGTACT 551 Query 546 TGCAGGGGCAATTACNANGNTTTTNACANATCGAAATTTTNNTACNACTTNTTTTGATCC 605 ||||||||||||||| | | |||| ||| ||||||||||| ||| |||| ||||||||| Sbjct 552 TGCAGGGGCAATTACAATGCTTTTAACAGATCGAAATTTTAATACCACTTTTTTTGATCC 611 Query 606 TGCTGGAGGTGGNGATNNNATTTTATATCAACNTNNATTCTGATT 650 |||||||||||| ||| ||||||||||||| | ||||||||| Sbjct 612 TGCTGGAGGTGGTGATCCTATTTTATATCAACATTTATTCTGATT 656 Similar but not same Need to do phylogenetics.

More Related