1 / 4

How Blast Works Role of Word Size, T, & Scoring Matrix in Seeding

Explore the impact of word size, scoring matrix, and extensions on BLAST algorithm through examples and analysis. Learn to optimize query seeding and evaluation for efficient sequence alignment.

mitchem
Download Presentation

How Blast Works Role of Word Size, T, & Scoring Matrix in Seeding

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. How Blast WorksRole of Word Size, T, & Scoring Matrix in Seeding Query: METGAAAALGMALAAGLGALGAAIGDGICTSKLLEGVARQPEARGQLMTLMFISVGLIESIPIIAVVVAFMLMGKIA Database entry#1: MEVGAAAAIATGLAVGLGALGAAVGDGICTGKAIESIARQPEAKGTIQTTMFISVGLIESIPIIAVVLAFMLFGKLG Database entry#2: MEIVLGMTAIAVALLIGMGALGTAIGFGLLGGKFLEGAARQPEMAPMLQVKMFIVAGLLDAVTMIGVGIALFMLFTNPLGAML Database entry#3: MDMSLQVLGNLNGLTAVAVALLISLPALGTAIGFGVLGGKYLEGVARQPELGGMLLGRMFIVAAFVDAFAAISIAIGFLVLYANPLAIPGLAETAQKVIGS • Assume a word size of 3, T = 999, & Blosum62 matrix. • How many word hits to database entry #1, #2, & #3? • (2) Does entry #3 look better if we lower word size to 2? • (3) Does entry #3 look better if T = 10?

  2. How Blast WorksRole of X in Extension Query: METGAAAALGMALAAGLGALGAAIGDGICTSKLLEGVARQPEARGQLMTLMFISVGLIESIPIIAVVVAFMLMGKIA Database entry#1: MEVGAAAAIATGLAVGLGALGAAVGDGICTGKAIESIARQPEAKGTIQTTMFISVGLIESIPIIAVVLAFMLFGKLG Database entry#2: MEIVLGMTAIAVALLIGMGALGTAIGFGLLGGKFLEGAARQPEMAPMLQVKMFIVAGLLDAVTMIGVGIALFMLFTNPLGAML Database entry#3: MDMSLQVLGNLNGLTAVAVALLISLPALGTAIGFGVLGGKYLEGVARQPELGGMLLGRMFIVAAFVDAFAAISIAIGFLVLYANPLAIPGLAETAQKVIGS • Try to extend off of the seed shown in red using X = 5. • (2) Try to extend off of the seed shown in red using X = 2.

  3. How Blast WorksRole of Scoring Matrix in Evaluation Query 8 ALGMALAAGLGALGAAIGDGICTSKLLEGVARQPEARGQLMTLMFISVGLIESIPIIAVV 67 A+ +AL G+GALG AIG G+ K LEG ARQPE L MFI GL++++ +I V Sbjct 9 AIAVALLIGMGALGTAIGFGLLGGKFLEGAARQPEMAPMLQVKMFIVAGLLDAVTMIGVG 68 Query 68 VA-FML 72 +A FML Sbjct 69 IALFML 74 • Score the alignment above with the Blosum62 matrix, and Gap Penalties (Existence: 15, Extension: 2). • (2) How would the score change if you only scored up to the end of the first line?

  4. How Blast WorksTry It For Yourself METGAAAALGMALAAGLGALGAAIGDGICTSKLLEGVARQPEARGQLMTLMFISVGLIESIPIIAVVVAFMLMGKIA • Blast against the nr database of proteins. • Blast against just the microbial genome (proteins) database. • Blast against just the Desulfutomaculum reducens genome.

More Related