220 likes | 344 Views
QA/QC of SELDI Data in Clinical Studies. 6-8-2006 CAMDA. Simon Lin Northwestern University. CDC Toni Whistler Suzanne Vernon Northwestern Pan Du Warren Kibbe Simon Lin. Duke Radiology Ned Patz Mike Campa Duke Bioinformatics Patrick McConnell Rich Haney Sal Mungal.
E N D
QA/QC of SELDI Data in Clinical Studies 6-8-2006 CAMDA Simon Lin Northwestern University
CDC Toni Whistler Suzanne Vernon Northwestern Pan Du Warren Kibbe Simon Lin Duke Radiology Ned Patz Mike Campa Duke Bioinformatics Patrick McConnell Rich Haney Sal Mungal Acknowledgements
Agenda • Challenges in clinical proteomics • Hypothesis: QA/QC is the key • Potential biomarkers of CFS • Future: Online QC
“The standard procedure of using SELDI-TOF mass spectra to construct a classifier is ________ . ” - Dr. Brian Luke, NCI “The proper construction of SELDI-TOF-based classifiers for early disease detection”, CHI Proteomics Conference Brochure, 2006 WRONG
TEMPERAMENTAL “Mass spectrometers can be __________. ” - Coombes et. al., Nature Biotechnology, 3: 291-292, 2005
Evidences (I) Gusev et. al., Analytical Chemistry 67: 1034, 1995
Evidences (II) - Image from Invitrogen.co.jp
Evidences (III) • Same biological sample • Technical Replicates • m/z: 5.0K to 8.5K • CAMDA’06 QC serum
Hypothesis Removing spectra of poor quality will improve our capability to detect biomarkers.
How to measure quality • Classification confidence • Correlation coefficient: r2 • Principal component analysis • Signal-to-noise Ratio (SNR) • After-the-fact: QA • On-the-spot: QC
Why Wavelet • Can be directly applied to raw data • Mutliscale analysis • Noise • Signal • Baseline
Wavelets • A data projection method • From raw data space to wavelet space • c.f. Fourier transform • A multi-resolution analysis method • Finer v.s. coarse scale
Estimating SNR • Global method • Partition the measurements into noise, signal, and baseline • Local method • For each peak, estimate the SNR Raw data Signal Noise
Raw spectrum QA step DWT-based SNR estimation Baseline removal and normalization Spectrum alignment CWT-based peak detection Classification and other data analysis Biomarker identification by statistical tests
Improved biomarker detection QA cutoff: SNR > 5
Advantages of SNR • Complementary to outlier-resistant statistics • Online QC • Simple • Can be done in real time
Conclusions “Mass spectrometers can be __________. ” ONLINEQUALITY CONTROL - Coombes et. al., Nature Biotechnology, 3: 291-292, 2005 TEMPERAMENTAL “_________ nature of conclusions of most serum proteomics studies.” PROVISIONAL
Ben Bolstad, PLM Image Hall of Fame http://plmimagegallery.bmbolstad.com
Must read papers • The Ovarian cancer controversy • Wavelet smoothing
Proteomics Challenges Hilario et al., Mass Spectrometry Reviews, 25: 409-449, 2006