1 / 100

Reporting Protein Identifications from MS/MS Results

Reporting Protein Identifications from MS/MS Results. Brian C. Searle Proteome Software Inc. Portland, Oregon USA Brian.Searle@ProteomeSoftware.com. Creative Commons Attribution. Outline. Assigning Proteins from Peptide IDs Correcting for One-Hit-Wonders Protein False Discovery Rates?

odelia
Download Presentation

Reporting Protein Identifications from MS/MS Results

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Reporting Protein Identifications from MS/MS Results Brian C. Searle Proteome Software Inc. Portland, Oregon USA Brian.Searle@ProteomeSoftware.com Creative Commons Attribution

  2. Outline • Assigning Proteins from Peptide IDs • Correcting for One-Hit-Wonders • Protein False Discovery Rates? • Correcting for Shared Peptides • Publication Standards

  3. Outline • Assigning Proteins from Peptide IDs • Correcting for One-Hit-Wonders • Protein False Discovery Rates? • Correcting for Shared Peptides • Publication Standards

  4. Just to Review: F possibly correct R clearly wrong Elias JE, Gygi SP. Nat Methods. 2007 Mar;4(3):207-14.

  5. Just to Review:

  6. Just to Review:

  7. Just to Review: ?

  8. …Well, Maybe

  9. AEPTIR Protein IDVCIVLLQHK NTGDR

  10. 85% AEPTIR ??% 65% Protein IDVCIVLLQHK 25% NTGDR

  11. FDRs for Whole Datasetsvs Individual Peptides • Cumulative FDRs only estimate the validity of a data set • Probabilities (or instantaneous FDRs) estimate the validity of a peptide of interest

  12. One Possible Approach • Instantaneous False Discovery Rate • PeptideProphet (TPP, Scaffold) • Percolator • Spectral Energies • RAId De Novo Many Others:

  13. Just to Review:

  14. Just to Review: 4 to 5 3 to 4 2 to 3 1 to 2 0 to 1 -1 to 0 -2 to -1

  15. Histogram of Decoy Matches “2x Decoy” # of Matches “Correct” Ion Score – Identity Score

  16. Histogram of Decoy Matches “2x Decoy” # of Matches “Correct” Ion Score – Identity Score

  17. Curve Fit Distributions “2x Decoy” # of Matches “Correct” Ion Score – Identity Score Choi H, Ghosh D, Nesvizhskii AI. J Proteome Res. 2008 Jan;7(1):286-92.

  18. Instantaneous FDR Method “2x Decoy” # of Matches “Correct” Ion Score – Identity Score Choi H, Ghosh D, Nesvizhskii AI. J Proteome Res. 2008 Jan;7(1):286-92.

  19. AEPTIR 85% ??% Protein 65% IDVCIVLLQHK 25% NTGDR

  20. AEPTIR (15%) (??%) Protein (35%) IDVCIVLLQHK (75%) NTGDR Feng J, Naiman DQ, Cooper B. Anal Chem. 2007 May 15;79(10):3901-11.

  21. AEPTIR (15%) (4%) Protein (35%) IDVCIVLLQHK (75%) NTGDR 0.15 * 0.35 * 0.75 = 0.04 Feng J, Naiman DQ, Cooper B. Anal Chem. 2007 May 15;79(10):3901-11.

  22. AEPTIR 85% 96% Protein 65% IDVCIVLLQHK 25% NTGDR 0.15 * 0.35 * 0.75 = 0.04 Feng J, Naiman DQ, Cooper B. Anal Chem. 2007 May 15;79(10):3901-11.

  23. If only it were so easy!

  24. Peptide 1 Peptide 2 Peptide 3 Peptide 4 Peptide 5 Peptide 6 Peptide 7 Peptide 8 Peptide 9 Peptide 10 80% Peptides

  25. Peptide 1 Correct Protein A Peptide 2 Peptide 3 Correct Protein B Peptide 4 Peptide 5 Peptide 6 Peptide 7 Peptide 8 Peptide 9 Peptide 10 80% Peptides

  26. Peptide 1 Correct Protein A Peptide 2 Peptide 3 Correct Protein B Peptide 4 Peptide 5 Incorrect Protein C Peptide 6 Peptide 7 Incorrect Protein D Peptide 8 Peptide 9 Peptide 10 80% Peptides 50% Proteins

  27. One hit wonders aredubious at best

  28. Outline • Assigning Proteins from Peptide IDs • Correcting for One-Hit-Wonders • Protein False Discovery Rates? • Correcting for Shared Peptides • Publication Standards

  29. Actual Probability Computed Probability Nesvizhskii, A. I.; Keller, A. et al Anal. Chem.75, 4646-4658

  30. UNDER estimation Actual Probability OVER estimation Computed Probability Nesvizhskii, A. I.; Keller, A. et al Anal. Chem.75, 4646-4658

  31. UNDER estimation Actual Probability OVER estimation Computed Probability Nesvizhskii, A. I.; Keller, A. et al Anal. Chem.75, 4646-4658

  32. What if we could scoreone-hit-wonderness? Nesvizhskii, A. I.; Keller, A. et al Anal. Chem.75, 4646-4658

  33. Combining different peptides • Quantify as a score: If different peptides agree: Good! If peptides are one-hit-wonders: Bad! Nesvizhskii, A. I.; Keller, A. et al Anal. Chem.75, 4646-4658

  34. Combining different peptides • Quantify as a score: If different peptides agree: Good! If peptides are one-hit-wonders: Bad! • Peptide agreement score: Nesvizhskii, A. I.; Keller, A. et al Anal. Chem.75, 4646-4658

  35. Combining different peptides • Quantify as a score: If different peptides agree: Good! If peptides are one-hit-wonders: Bad! • Peptide agreement score: NSP score for peptide (k) is the sum of other agreeing peptides (not k) Nesvizhskii, A. I.; Keller, A. et al Anal. Chem.75, 4646-4658

  36. Protein Prophet Distributions One-hit Wonders Multi-hit Proteins

  37. Protein Prophet Distributions

  38. Protein Prophet Distributions

  39. Protein Prophet Distributions multi-hit proteins (increase prob) in between (keep same) one hit wonders (decrease prob)

  40. UNDER estimation Actual Probability OVER estimation Computed Probability Nesvizhskii, A. I.; Keller, A. et al Anal. Chem.75, 4646-4658

  41. with NSP Actual Probability without NSP Computed Probability Nesvizhskii, A. I.; Keller, A. et al Anal. Chem.75, 4646-4658

  42. Brian, I hate math.What do I do?

  43. Option 1:Throw Out One-Hit-Wonders Advantages: Easy, works! Disadvantages: Loss of sensitivity!

  44. Option 2: Use Multiple Filters Filter 2 - Peptide Mode Filter 1 - Protein Mode • 1 peptide/protein • high spectrum threshold • ≥2 peptides/protein • moderate spectrum threshold

  45. Option 2: Use Multiple Filters Advantages: More sensitive! Disadvantages: Pretty arbitrary!

  46. Option 3: • Assigning Proteins from Peptide IDs • Correcting for One-Hit-Wonders • Protein False Discovery Rates? • Correcting for Shared Peptides • Publication Standards

  47. Protein FDRs only accurate with >100 Proteins Uncertainty in Protein FDR 1% Error In FDR Estimation Number of Confidently IDed Proteins

  48. Histogram of Decoy PROTEIN Matches “2x Decoy” # Protein Identifications “Correct” Protein Score

More Related