1 / 49

Joachim Frank

Joachim Frank. 1. The Quest for High(er) Resolution 2. SNR and SSNR Estimates for EM Noise Processes. HHMI, Health Research, Inc., Wadsworth Center Starting April 1, 2008: Department of Biochemistry and Molecular Biophysics

wolfe
Download Presentation

Joachim Frank

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Joachim Frank 1. The Quest for High(er) Resolution 2. SNR and SSNR Estimates for EM Noise Processes HHMI, Health Research, Inc., Wadsworth Center Starting April 1, 2008: Department of Biochemistry and Molecular Biophysics and Department of Biology, Columbia University

  2. Before going into the subject matter, a few introductory slides on single-particle reconstruction. (“Single particle” here means “single, unattached” – the number of particles or molecules is normally huge) For details on this technique, see J. Frank, Three-Dimensional Electron Microscopy of Macromolecular Assemblies, Oxford University Press 2006

  3. This is the kind of data we are dealing with in “single-particle reconstruction”

  4. Once a reference is established, orientation determination can be done by 3D projection matching

  5. Projection matching normally starts with 83 projections on the half-sphere

  6. Angular Refinement, by Iterative 3D Projection Matching Subsequently, angular refinement is achieved by iteration, with smaller and smaller angular increments.

  7. (refinement by defocus groups)

  8. Part of the Bill Baxter’s ctfgroup interface: grouping by defocus Defocus groups are used for expediency, but they need to be narrow enough so as to avoid blurring of high-res information.

  9. Reconstruction by Defocus Groups At the very end, the reconstructions from all the defocus groups are merged in a CTF correction step (by Wiener Filtering)

  10. Cryo-EM 11.5 Å resolution; 73,000 particles Gabashvili et al. (2000) Cell The processing method described is now standard. It was used to obtain this reconstruction of the E. coli ribosome

  11. Math Problem, in a Nutshell, at Current Resolutions (~10 Å) Computation of 3D density map: 100,000 x (200 x 200) measurements with SNR ~0.1 – 0.4 # of particles # of pixels (projections) 200 x 200 x 200 + 5 x 100,000 unknowns # of voxels # of parameters (in 3D density map) (shifts, angles) What happens if we want to go to 3 Å resolution?

  12. Resolution measures & criteria -- Fourier ring/shell correlation k = spatial frequency Δk = ring width or shell thickness

  13. Conservative; SNR =1 R. Henderson, X-ray Too optimistic Resolution criteria based on FSC

  14. Extrapolation to 3 Å -- Millions of particles? We’re facing an obstacle!

  15. What can be controlled/improved experimentally? versus What can be controlled/improved by image processing with a given dataset? (or: how do we want to spend our time?)

  16. Processing by defocus groups, or variations in height These envelope functions show how the final resolution is affected by the width of the defocus range, or by variations in height of particle within the ice

  17. Question of Window Size Reconstruction of uncorrected data is done under the assumption that all information is present. But part of the information lies outside the particle, because of the finite width of the point spread function! Radius of point spread function has to be accommodated in the window. Window size = particle diameter + diameter of the point-spread function = point-spread function associated with the CTF

  18. Exploration of reconstruction strategy“High-resolution project” Use small dataset (50,000) to optimize processing, with the idea to switch to larger dataset (130,000) Parameters of image processing: • Sampling (strategy to switch from coarse to fine, to save CPU) • Window size (to avoid CTF effects) • Angular spacing (how to reduce) • Amplitude correction in each step of refinement vs. at the very end Final parameters emerging from this exploration: angular step 0.5 degrees, angular search range 2 degrees 7 iterations of refinement: 920 hours on a 48-node cluster Regular window size OK Sampling (decimation) can be switched mid-way from coarse to fine

  19. Resolution measurement issues • Apply soft mask to reconstruction to get true resolution! • Evidence for dependence of resolution R vs. log(N) • Is lin-log dependence general? • Is it allowed to extrapolate from half to full dataset? • Verification of molecular detail: phosphorus atoms show up as bumps

  20. “Clutter” outside

  21. To be replaced – first two curves are incorrect! Other curves OK.

  22. Final reconstructiion, at 7.5 A:

  23. Progression of resolution, from the beginning to the end of the project.

  24. EF-Tu E. coli 70S•aa-tRNA•EF-Tu•GDP•kir at 7.5 Å 130,000 particles 7.5 Å (FSC=0.5)

  25. P A E EF-Tu

  26. X-ray missing helix Protein S2 Except for a missing helix at the periphery, protein S2 is exactly as it appears in the X-ray structure.

  27. Extrapolation of FSC resolution to full set 130,000 65,000 Resolution is a linear function of Log(Number of particles) -- Ribosome.

  28. GroEL (Stagg et al.) Resolution is a linear function of Log(Number of particles) -- GroEL.

  29. 6.7 Å

  30. 6.7 Å (LeBarron et al., in prep.) 10 Å (Valle et al., NSB 2003)

  31. Cutoff Cryo-EM X-ray Cryo-EM X-ray 6.7 Å 5.0 Å 6.7 Å 5.0 Å Phosphorus atoms recognized as bumps!

  32. Definition of EF-Tu domains (stereo)

  33. SNR and SSNR estimation • Why? • We need better noise models for phantoms, to fine-tune classification • William Baxter

  34. Given: two images representing independent realizations of a signal embedded in noise. Align the two images, then compute the cross-correlation coefficient. Estimate the SNR α from the cross-correlation coefficient by α = ρ/(1-ρ) ρ: α: 0 0 1 ∞ 0.5 1 (Frank and Al-Ali, Nature 1975)

  35. Compare the images of two molecules that were identified, by 3D projection matching, as having the same orientation. • These images differ by (i) structural noise, (ii) shot noise, and (iii) digitization noise. • How to disentangle the three different noise portions?

  36. Structural noise: the irreproducible portion of the object. • Conformational changes • Superimposed structure of substrate (ice, thin carbon)

  37. Experimental design: • Three experiments: (i) take images of two molecules deemed in the same orientation (as determined by projection matching) (ii) take two images of the same field (iii) digitize the same image of a molecule twice

  38. These are the kinds of images we are using in the SNR estimation

  39. Projection matching: two particles falling into the same orientation bin represent the same molecule structure plus “structural noise” Using projection matching to a template, we can find molecules that nominally have the same orientation, and can bye compared in the SNR estimation.

  40. SNR estimation by double experiments Molecules in same view Same field imaged 2x Same micrograph digitized 2x

  41. Formula for the case of two noise subprocesses: obtaining SNR of subprocess #I from measured SNRs of complete process (αcomp) and subprocess #2 (αsub2): 1 αsub1 = -------------------------------- (1+1/ αcomp )/(1+1/αsub2)-1

  42. Results of Estimation: • SNR for digitization noise: 27 [for ZI microdensitometer -- will be different for different scanners] • SNR for shot noise: 0.09 • [Is in the range measured before, e.g. 0.1 (Fu et al. JSB 2007)] • Note: for CCD recording, digitization and shot noise cannot be separated. • SNR for structural noise: ~2 [Reasonable, since the “noise” contains all irreproducible components: conformational differences, thin carbon]

  43. SSNR Estimates, using the same formula in Fourier space along rings double scan double exposure double structure The curves show how the SNR is distributed as a function of spatial frequency

  44. SSNR for “true” film-recorded Shot Noise (averaged over 4 defoci)

  45. SSNR for “true” Structural Noise (averaged over 3 defoci)

  46. Outlook • Atomic resolution soon in reach. How soon is soon? Asymmetric structures much more tough than those with symmetry. • Efficient classification is key! • Extrapolation of resolution might be possible in general – does lin-log rule always hold? • Develop strategy with subsets is a good idea!

  47. People in my group who have contributed to this research: • William Baxter (SNR) • Robert A. Grassucci (HR) • Jamie LeBarron (HR) • Haixiao Gao (HR) • Tapu Shaikh (HR) • Jayati Sengupta (HR) • Funding: HHMI, NIH, NCRR HR = high-resolution project SNR = SNR and SSNR estimation

More Related