60 likes | 146 Views
Generalized Protein Parsimony. Nathan Edwards Department of Biochemistry and Molecular & Cellular Biology Georgetown University Medical Center. Traditional Protein Parsimony. Select the smallest set of proteins that cover all identified peptides . Sensible principle, implies
E N D
Generalized Protein Parsimony Nathan Edwards Department of Biochemistry and Molecular & Cellular Biology Georgetown University Medical Center
Traditional Protein Parsimony • Select the smallest set of proteins that cover all identified peptides. • Sensible principle, implies • Eliminate equivalent/subset proteins • Unique peptides force proteins into solution • Bad consequences for FDR filtered ids • Impact most tools, even probability based ones
Must ignore some PSMs • Improving peptide identification sensitivitymakes things worse! • False PSMs don't cluster PSMs PSMs 2x Proteins 10%
Must ignore some PSMs • A single additionalpeptideshould not force proteins into the solution
Generalized Protein Parsimony • Weight peptides by number of PSMs • Constrainunique peptides per protein • Maximize covered peptides (PSMs) • Can match filtering FDR to uncovered PSMs • Readily solved by branch-and-bound • Reduces to traditional protein parsimony
Match FDR to uncovered PSMs Traditional Parsimony at 1% FDR: 1085 (609 2+-Unique) Proteins