200 likes | 225 Views
This proposal outlines a new strategy for protein identification in proteomics by reducing database size and automating data transfer for faster and more reliable results. The aim is to streamline data analysis and expedite peptide identification during experiments, ultimately facilitating comprehensive protein characterization. The proposed method involves utilizing local protein databases, developing a peptide filter, and automating data transfer between different mass spectrometry stages.
E N D
A New Strategy of Protein Identification in Proteomics Xinmin Yin CS Dept. Ball State Univ.
Contents • Background. • Problems. • Proposal. • Conclusion.
What’s Proteomics ? Use of quantitative protein-level measurement of gene expression to characterize biological process and to decipher the mechanisms of gene expression control.
What’s Proteomics ? DNA transcription RNA translation PROTEIN
Current Research in Proteomics Quantitative measurement of proteins expressed in a cell * protein identification * protein quantification
Experimental Flow Chart of Protein Identification cell mass & sequence proteins peptides data processing
Schematic of MALDI MS/MS Laser Detector1 E1 MS1 E2 Peptides MS2 Detector2 Matrix Assisted Laser-induced Desorption Ionization
Problem of Scaling up MALDI MS/MS to Proteomics Number of peptides is huge Human genome (~60,000 proteins) 2.6 X 106 peptides It will take THREE MONTHS to identify all proteins in a human cell.
My Proposal To develop a new data analysis method • Avoid using MS2 if a peptide can be identified in MS1. • Reduce size of peptide database during data analysis
Facts of MALDI MS/MS Experiment • 40% peptides are unique in mass when mass accuracy of MALDI is better than 10ppm. • Analyzing MS1 spectrum is much easier than analyzing MS2 spectrum. • It takes about 1 to 2 min for analysis of a single MS2 spectrum through database.
My Proposal • Using local protein database for data analysis. • Developing a peptide filter. • Automating data transfer from MS1 to MS2
Local Protein Database PURPOSE: 1. Make peptide database small according to experimental methods 2. Do online data processing • Download protein database from public sites. • Digest proteins in computer. • Select peptides according experimental methods, and store them in memory. • Calculate peptide chemical modification.
Generating Peptide Database Protein Database Peptide Database Enzyme Digestion Each protein can produce 30 ~ 100 peptides
Peptide Filter PURPOSE: 1. Do online data processing 2. Reduce peptides database size 2. Reduce number of peptides passed to MS2 • Identify peptide by mass alone. • Sort identified peptides according to proteins. • Delete rest peptides of the identified protein from peptide database. • Pass unidentified mass information to MS2.
Identifying Peptide by Mass Alone Mth Mexp Npep M + + M = Mth – Mexp If (M < ) Npep++ Npep: number of mass matched peptides : mass accuracy of MS1
Reducing Size of Peptide Database If (Npep == 1) Peptide is unique; Deleting peptides from the same protein Else Send peptide to MS2 for sequence information.
F L O W C H A R T
Program Design Computer Language: C++ Classes: Protein Peptide Linked list
Conclusion 1. My program will generate a list of peptides from the protein database. 2. Peptide identification is carried out while doing experiment. 3. Data analysis time will be significantly reduced. 4. This program will make identification of proteins from a cell possible and reliable.