550 likes | 691 Views
Problems in Biological Imaging: Opportunities for Signal Processing. Jelena Kovačević bimagicLab Center for Bioimage Informatics Department of Biomedical Engineering Department of Electrical and Computer Engineering Carnegie Mellon University. Cast of Characters. Issues. Tools. Framework. ?.
E N D
Problems in Biological Imaging: Opportunities for Signal Processing Jelena Kovačević bimagicLabCenter for Bioimage InformaticsDepartment of Biomedical EngineeringDepartment of Electrical and Computer EngineeringCarnegie Mellon University
Issues Tools Framework ? What can we do? Tasks The Roadmap Revolution in biology
Revolution in Biology • Focus in biology • Vertical to horizontal approach • “Omics”: genomics, proteomics, … • Fluorescence microscopy • Hugely successful • Allows for live-cell imaging • Fluorescent markers, starting with GFP • Allows for collection of high-dimensional data sets • 2D images and 3D volumes • At multiple time instants • Multiple channels • Analysis and interpretation • Cumbersome, nonreproducible, error prone
PSF h A/D Denoising RestorationDenoising + Deconvolution Deconvolution RegistrationMosaicing SegmentationTracking AnalysisModeling Goal • Imaging in systems biology • Use informatics to • acquire, store, manipulate and share large bioimaging databases • Leads to • automated, efficient and robust processing • Need • Host of sophisticated tools from many areas
Revolution in biology The Roadmap Issues Noise levels and types Lack of ground truth Large deviations Low definition and contrast Wide range of time-frequency features
Noise Levels and Types • Shift towards noninvasive • Data collected farther from the source • Signals typically corrupted by high levels of noise • Weak biosignals • Standard SP techniques not used but even those will not work well with such signals • Types of noise • Electrical, neuronal, … • Modeling of noise a problem
Lack of Ground Truth • Shift towards noninvasive • No access to ground truth
Large Deviations • Humans and/or animals as ``customers'‘ • Wide range of states considered ``normal'‘ • Looking for is a range rather than a single state • Large deviations from the range of normal states may characterize what we are looking for delayed abnormal normal
Low Definition and Contrast • Images typically have low contrast and are poorly defined • Lack of consistent edges
Wide Range of Time- and Frequency-Localized Features • Bioimages • Global behaviors together with spikes and transients • Puts time-frequency tools to the test • “Speckled” nature---stochastic representation
Issues Revolution in biology The Roadmap Framework • Continuous-domain image processing • From continuous to discrete domain • Discrete-domain image processing
Continuous-Domain Image Processing PSF h • Specimen (object) vs image of it (projection) • LSI systems • Impulse response of the microscope: PSF • Fourier view • FT or FS A/D Denoising RestorationDenoising + Deconvolution Deconvolution RegistrationMosaicing SegmentationTracking AnalysisModeling
Resolution in microscopy Filtering before sampling Sources of uncertainty From Continuous to Discrete PSF h A/D Denoising RestorationDenoising + Deconvolution Deconvolution RegistrationMosaicing SegmentationTracking AnalysisModeling
Discrete-Domain Image Processing PSF h • LSI system, digital filtering • Consider the signal as • Infinite signal with finite number of nonzero coefficients • Finite signal • Fourier view • DTFT • DFT A/D Denoising RestorationDenoising + Deconvolution Deconvolution RegistrationMosaicing SegmentationTracking AnalysisModeling
Issues Revolution in biology Tools Framework The Roadmap Signal and image representations Fourier analysis Gabor analysis Multiresolution analysis Data-driven representation and analysis
“Holy Grail” of signal analysis/processing Understand the “blob”-like structure of the energy distribution in the time-frequency space Design a representation reflecting that Dirac basis ER WP f Actin FT STFT WT t Signal Representations
Data Driven Representation & Analysis • Use representations based on training data and automated learning approaches • Wavelet packets • PCA & variations • ICA • …
Estimation Framework • Random variations introduced by system noise, artifacts, uncertainty originating from the biological phenomena lead to statistical methods • Seek the solution optimal in some probabilistic sense • Optimality criterion • MSE, often depends on unknown parameters • Bayesian framework, MAP estimators
Issues Revolution in biology Tools Framework Tasks The Roadmap • Acquisition • Deblurring, denoising, restoration • Registration and mosaicing • Segmentation, tracing and tracking • Classification and clustering • Modeling
Acquisition • Issues in acquisition of fluorescence microscope images • Increase resolution • Total data acquisition is reduced, speeding up image acquisition • Allows a higher frame rate (increased temporal resolution) • Allows us to spend more time acquiring the regions of interest (which gives increased spatial resolution) • Acquire for longer periods • Acquisition process damages both the signal (photobleaching) and the cell (phototoxicity) • Efficient acquisition reduces the total amount of data acquired, thus reducing damage to the cell • This allows us to observe cellular processes for longer periods • Intelligent acquisition • Acquire only where and when needed adaptivity • Model driven (microscope model & data model)
Reconstruction Efficient Acquisition Modeling Knowledge Extraction Model-Driven Acquisition • Acquisition • Grid acquisition • MR adaptive acquisition • Markov Random Fields • Example-based enhancement • Reconstruction • Simple interpolation methods • Wavelet reconstruction • Model-based reconstruction
Problem Why acquire in areas of low fluorescence? Acquire only when and where needed Measure of success Problem dependent Here: Strive to maintain the achieved classification accuracy Approach Mimic “Battleship” Accuracy Compression Ratio MR Acquisition [Merryman & Kovačević, 2005]
2. IntelligentAcquisition No Model satisfactory? 1. Model Building Model Yes as well as design intelligent acquisition systems based on those models Develop a mathematical framework and algorithms to build accurate models of fluorescence microscope data sets Efficient Acquisition and Learning of Fluorescence Microscope Data Models Use all the input from the microscope to model the data set 2. Choose acquisition regions that allow us to construct an accurate model in the shortest amount of time
Efficient Acquisition and Learning of Fluorescence Microscope Data Models [Jackson, Murphy & Kovačević, 2007] • Predict the distribution of fluorescence in the subsequent frame and acquire accordingly • Predict likelihood of object moving to any given position • Acquire those positions with the highest likelihood • Too small an acquisition region may not find the object • Too large an acquisition region is inefficient • Motion models • Three motion models commonly observed in practice • Random walk • Constant velocity • Constant acceleration
Efficient Acquisition and Learning of Fluorescence Microscope Data Models • Learning the motion model • Prediction: Based on current beliefs about motion model, find likelihood of each object appearing at any given pixel in the subsequent frame • Acquisition: Acquire the pixels that have the highest overall likelihood of containing an object • Observation: Observe the actual location of each object, if found • Update: Use this information to update our beliefs about the motion models for each object
Efficient Acquisition and Learning of Fluorescence Microscope Data Models • Known motion model • Single object, random walk of known variance • Probability distribution of it appearing in any given location in the subsequent frame • Acquisition regions capture the locations where the object is expected with the highest probabilities
Efficient Acquisition and Learning of Fluorescence Microscope Data Models • Known motion model • If the object is detected, repeat, centering the new acquisition region at the object’s most recent location • If the object is not detected, estimate where it is • Probability distribution given that the object was not in the acquisition region
Efficient Acquisition and Learning of Fluorescence Microscope Data Models • Known motion model • Predict this object’s location in the next frame • Probability distribution • 1D case: choose two disconnected acquisition regions • 2D case: choose to acquire between the two black circles
Deblurring, Denoising & Restoration • Microscope images contain artifacts • Blurring caused by a PSF • Noise from the electronics of digitization • Deblurring/deconvolution • Widefield microscopy • Effect of depth • Denoising • Deconvolution + Denoising = Restoration
Registration & Mosaicing • Registration • Find spatial relationship and alignment between images • Mosaicing • Used when fine resolution is needed within a global view • Stitching together pieces of an image • Usually requires registration, given overlapping pieces
Segmentation, Tracing & Tracking • Segmentation • Methods used: thresholding and watershed • Edge-based, region-based, combination • Active contours • Tracing • Mostly tracing of axons • Typical, path following approaches • Fail in the presence of noise • Tracking • Molecular dynamics and cell migration • Tracking of objects over time
Separate objects of interest from each other and the background Fundamental step in microscopy Hand segmentation Not reproducible Not tight Piecewise linear Cannot compute statistics Time-consuming Current standard Watershed segmentation Segmentation
Active Contour Segmentation • Active contour algorithms • Contour comparable to an elastic string • Moved under external and internal forces • External: derived from the image (edges) • Internal: geometric properties of the contour (curvature) • Level Set method: A way to track the contour as it evolves • Positive inside the contour (mountain) • Negative outside the contour (valley) • Zero on the contour, C embedded at its zero (sea) level Fc< 0 < 0 > 0 n = 0 Fc> 0
STACS • Combines energy minimization approach with statistical modeling • Model matching • Pixels inside and outside the contour follow different statistical models • Modified STACs for fluorescence microscopy images • No edge information • No obvious shape information • Segmentation driven by statistics of the image and contour smoothness • MSTACS: Our level-set evolution equation • Topology needs to be preserved TPSTACS
Successful Problem Extremely slow Solution MRSTACS TPSTACS: Results [Coulot, Kirschner, Chebira, Moura, Kovačević, Osuna & Murphy, 2006] Hand-segmented TPTACS
h ↓2 h ↓2 g ↓2 h ↓2 g ↓2 horizontal g ↓2 vertical MRSTACS • Decompose image to L levels • Smoothing renders cell easier to discern • Detect cells using morphological operations • Get coarse version of contour (TPSTACS) • Refine contour iteratively faster segmentation • Coarse result < 3 sec • Fine result < 30 min 2D Filter bankLevel 1 decomposition
A Critical Review of Active Contours • Flexible • Can be tuned to be accurate • Adapt to topological changes in the image • But… • Tuning of parameters is involved • Updating the level set function – inefficient • What is the ‘contour’ in a digital image? • Discrete topological rules – external constraints can cause abruptness • Multiresolution – how do we reconstruct the level set function? • New math needed
A slight blur Original Image Enough to discern the cell boundary Too much blur – Edges rounded Active Mask Framework: No Contours • Fluorescence microscope images speckled in nature • Estimate densities of bright pixels in local neighborhood at different scales • Recast computation of force as a transformation • No need for the time consuming extension function • For image f, transform T is • Windowing function and scale factor a • Different conditions (cell lines, resolution, etc.) Different and a • TPSTACS: Rectangular , a = 1 and suitable operands
Active Masks: Results • Success • Initialization: Level set function is identically zero • Iterations: 3 • Time taken: 6.5 sec per iteration HeLa cells – Total protein image HeLa cells – Membrane protein image
Active Masks • Pros • Framework suited to digital images • Can be made specific with the choice of suitable forces, windows and scale factors • Performance not critically dependent on initialization • Easy and fast to compute • Translation, dilation and rotation invariance can be preserved • Cons • Topology preservations hard • Multiple active mask framework
Multiple Active Masks • Initialization • Random initialization with M»M0 masks, where M0 = expected number of objects in the image • Evolution: driven by distributor functions • Can incorporate multiresolution/multiscale • Convergence • Experimentally • Working on a proof
Results of STACS on Different Modalities Yeast DIC Cardiac MRI: Endocardium and epicardium • True Positive • False Positive • False Negative Brain fMRI Axial Coronal Saggital
Classification Problems in Bioimaging • Determination of protein subcellular location patterns[Chebira, Barbotin, Jackson, Merryman, Srinivasa, Murphy & Kovačević, 2007] • Detection of developmental stages in Drosophila embryos[Kellogg, Chebira, Goyal, Cuadra, Zappe, Minden & Kovačević, 2007] • Classification of histological stem-cell teratomas[Ozolek, Castro, Jenkinson, Chebira,, Kovačević, Navara, Sukhwani, Orwig, Ben-Yehudah & Schatten, 2007] • Fingerprint recognition [Hennings, Thornton, Kovačević & Kumar , 2005] [Chebira, Coelho, Sandryhalia, Lin, Jenkinson, MacSleyne, Hoffman, Cuadra, Jackson, Püschel & Kovačević , 2007] Develop an automated system capable of fast, robust and accurate classification
FeatureExtraction FE Classification C Generic Classification System MR WeightingAlgorithm shorthand MR W Multiresolution Classification • Hypothesis: Better classification accuracy obtained if we use the space-frequency information lying in the MR subspaces • Compute features in the MR-decomposed subspaces (subbands) instead • Would like to use wavelet packets • Do not have an obvious cost measure • Do it implicitly instead
FE C MR W MR Block • Grow full tree to L levels • Use all nodes • MR Bases • DWT • DFT • DCT • … • MR Frames • SWT • DT-CWT • DD-DWT • Our design: LTFT
Lapped Tight Frame Transforms • Build MR transforms for these problems • Not many nonredundant ones exist • Seed them from higher-dimensional bases
FE FE C C MR MR W W Feature Extraction and Classifier • Feature Extraction • New Haralick texture features (T3, 26 features) • Morphological features (M, 16 features) • Zernike features (Z, 49 features) • Classifier • Neural networks • No hidden layers
FE C MR W Weighting Procedure • Local decisions • Decision vectors for each subband of each training image containing C numbers • Goal: combine local decisions into a global one • Algorithms • Open form (iterative) • Closed form (analytical) • Per data set • Per class • Pruning criteria
MR significantly outperforms NMR MRF outperform MRB Per-Dataset CF slightly outperforms OF Trend is flat→ T3 set enough Determination of PSL Patterns: Results