410 likes | 800 Views
Interferometric Imaging in CASA An Introduction for ALMA reduction. Steven T. Myers. National Radio Astronomy Observatory Socorro, NM. Radio Interferometric Imaging in theory…. Classic Radio Interferometer. e.g. The EVLA or ALMA a bunch (27 to 50) antennas connected together
E N D
Interferometric Imaging in CASAAn Introduction for ALMA reduction Steven T. Myers National Radio Astronomy Observatory Socorro, NM
Radio Interferometric Imaging in theory…
Classic Radio Interferometer • e.g. The EVLA or ALMA • a bunch (27 to 50) antennas connected together • independent elements Earth rotation synthesis
baseline vector B Interferometer Baselines • Baseline vector B in “aperture plane” • coherent signal applied to interferometer would produce plane-wave interference “fringe” on sky with angular period l/B q = λ /B interferometer naturally decomposes sky into plane waves!
each point on aperture gets correlated with each point on other aperture autocorrelations measure uv spacings inside D/l The Aperture Plane • Correlate wavefronts in plane of apertures (Fourier transform of sky) • dish optics sum aperture plane at focus • visibility is cross-correlation of wavefronts of the 2 apertures visibility contains range of baselines from closest to furthest parts of apertures interferometer cannot measure “zero-spacing” w/o autocorrelations
aperture xcor width 2D/λ From sky to uv-plane The uv-plane is the Fourier Transform of the tangent plane to the sky F-1 2.5o baseline u = B/λ ℓ = 2p|u|= 2p|B|/λ F Sky Plane x = (x,y) Fourier Plane u = (u,v)
Example: VLA observes Jupiter • A 6cm VLA observation of Jupiter: Fourier transform of nearly symmetric planetary disk bad data
Reconstruction of the Sky • Visibilities and the Sky v = AF-1 s + n • A known instrumental response, but is not invertible • instrumental noise n is a random variable • The issues: • unknown random noise n • convolution due to size of A in uv domain • incomplete sampling of uv-plane by visibilities • Maximum Likelihood - Optimal Map mMLE = ( ATN-1A)-1ATN-1v = R-1dd = Hv No inversion R singular (at best ill-conditioned) for fully sampled s
The Dirty Map • Grid onto sampled uv-plane d = Hv = Hs + nd • H should be close to HMLE, e.g. H = ATN-1 : A ~ A • AT should sample onto suitable grid in uv-plane • reminder: need only be approximate for gridding • Invert onto sky “dirty image” d = Fd = R s + ndR = FRF-1 • image is “dirty” as it contains artifacts • convolution by “point spread function” (columns of R) • multiplication by response function (diagonal of R) • noise
VLA point-spread function (PSF) • The VLA is an example of a sparsely filled array • uv-plane gaps are treated as zeroes, cause “sidelobes” in PSF • many solutions for sky that fit data, “dirty image” is principal solution • must use “deconvolution” techniques to “clean” image snapshot uv-coverage PSF “dirty beam” “dirty” image “clean” image Example: VLA 30s snapshot discovery data for gravitational lens CLASS B1608+656 (Myers et al. 1995, ApJL, 447, L5-L8)
Image, uv, and Data Spaces • image plane uv-plane visibilities • operators F , H , A handle these transformations • not all operators have inverses (H and A do not) • example: model image m • first transform sky model to uv-plane m = F-1 m • then project onto the visibilities (data space) vm = Am = A F-1 m • form residual dvm = v-vm = A ( s - m ) + n • finding “best model” will involve minimizing this residual
Classic Deconvolution • uv-plane CLEAN algorithm (“major cycles”) • iterate on residual images removing point models • initial residual data, and model: dv0 = v m0 = 0 • form dirty image: d0 = FHdv0 • locate peak and residual and put fraction f into model dm1 = f M d0 mask M : 1 at max, else 0 • increment model: m1 = m0 + dm1 • form cumulative visibilities and residual v1 = Am1 = AF-1 m1dv1 = v – v1 • form new dirty residual image: d1 = FHdv1 • and repeat until final residual image dfis noise-like
image-plane CLEAN • CLEAN in the image plane (minor cycles) • initial model: m0 = 0 • form dirty image: d0 = FHdv0 • locate peak and residual and put fraction f into model dm1 = f M d0 mask M : 1 at max, else 0 • increment model: m1 = m0 + dm1 • subtract from dirty image to make dirty residual image d1 = d0 – Rdm1 R ~ R (the PSF) • and repeat until final residual image dfis noise-like • size of kernel R (Hogbom=full-size, Clark=quarter size) • specified by psfmode in CASA
CLEAN variations • Cotton-Schwab (CS) CLEAN • break into major (uv-plane) & minor (image-plane) cycles • in CASA initiated by imagermode=‘csclean’ • some options ( ‘mosaic’, ‘mfs’ nterms>1) hardwired cs • clean in minor cycles to a threshold where max residual is some fraction of starting max residual • in CASA will be cyclefactor x max psf sidelobe • more major cycles = more accurate cleaning but slow • poor PSF, simple image structure = lower cyclefactor (<1) • good PSF, complex image structure = raise cyclefactor (>1) • purpose: correct errors from gridding & minor cycles • the transform back from model visibilities is as accurate as we wish to make it
CLEAN Example • Jupiter 6cm – interactive cleaning in CASA as imaging proceeds, include regions containing emission in mask breaks between interactive cycles are major cycles
Gridding kernels in CLEAN • This is a choice: d = Hv • Default kernel: H0 = N-1 • includes only noise (inverse variance) weighting • Optimal (mosaic) imaging: H = ATN-1 • uses aperture function (A ~ A) • will grid mosiacs onto same uv-plane • can correct for known pointing errors • Frequency synthesis kernel (MFS) • add kernels corresponding to Taylor expansion terms • G0k= 1 G1k= ln(nk/n0) for each channel k
Mosaicing in the sky and uv planes offset & add phase gradients
Mosaicing kernel • the usual equation (aperture and offset term) • note that we assume “phased up” at each pointing xk ! • kernel B and its transform (Fourier shift theorem) • B = F-1BF is the “mosaicing” kernel (A-projection) • if offset xk is unknown, then this is a “pointing error” • offset for R and L polarizations is the “squint” term
Mosaicing considerations • In CASA, imagermode=‘mosaic’ • ftmachine=‘mosaic’ uses A-projection kernel • grids to single uv-plane, optimally weights fields, most efficient • can be some aliasing, keep mosaic to inner quarter of imsize • ftmachine=‘ft’ does standard image-plane shift+add • takes much longer, not recommended • mosaic uses csclean, watch cyclefactor • uses approximate single PSF for entire mosaic • will “correct” in major cycles • uses POINTING table when present (FIELD for phases) • Outputs • standard: .image, .model, .psf • .flux (PB plus weights plus extra PB from A-convolution) • .flux.pbcoverage (just the PB effects) • e.g. to correct to flux on-sky use .image/.flux
MEM and CLEAN • CLEAN • algorithm: find peak in residual image; add fraction to model; form new residual data & residual image; iterate • performance: good on compact emission, difficult for extended • Maximum Entropy Method (MEM) • algorithm: for pixel values p : maximize entropy -S p ln p ; minimize c2(p) to encourage “smooth” extended emission • performance: complicated, suppresses spiky emission, but fast • CLEAN and MEM use point (pixel) basis • complete basis – unique representation of image
Sparse Approximation Imaging • Problem: find a model to represent the sky as efficiently as possible, subject to the data constraints and within the noise uncertainty, possibly also subject to prior constraints. • some problems (like ours) cannot be efficiently reconstructed using orthonormal bases (like pixels or Fourier modes) • extensive literature on this! • use non-orthogonal bases: multiscale (e.g. Gaussians) • choose dictionary of model elements (atoms) • efficiency: find a representation that uses the fewest number of atoms
Example: MEM versus MS-CLEAN Restored Residual Error Maximum Entropy MS Clean
The Future of Multiscale Methods • Algorithms • mostly iterative, starting from a blank model • “greedy” methods make locally optimal choices at each step • MS-CLEAN is a greedy algorithm in this class! • dictionary of points and Gaussians on different scales • is essentially a “Matching Pursuit” (MP) algorithm (e.g. Tropp 2004) • Key Research Area in CASA • new arrays are designed for high dynamic range & fidelity • we need efficient, robust, and accurate multiscale methods • integrate multiscale (MS) with multispectral (MFS)
Calibration and Imaging • Some effects corrupt the visibilities • most are on a per-antenna basis, other per-baseline • antenna based effects can be “self-calibrated” out • The Measurement Equation (ME) • the Jones matrices J contain the corruptions to V • there are different corruption terms to the J • gain G, pol leakage D, ionosphere F, parallactic angle P • can be direction dependent, additive RFI & correlator errors
Calibration in Image Plane • Calibration errors show up as artifacts in image plane: • given an approximate model for the image we can solve for the errors “self-calibration” through iteration Before Correction After Correction
Pointing Corrections • Example of direction-dependent errors: • VLA antennas have ~10’’ pointing residual • affects high-dynamic range imaging (>104) • also “squint” between R and L beams • Work by Sanjay Bhatnagar (NRAO) • Simulation of 1.4GHz EVLA observations • Residual images • Before correction: Peak 250Jy, RMS 15Jy • After correction: Peak 5Jy, RMS 1Jy • Can incorporate into standard self-cal • Computational cost ok for now • See EVLA Memos 100 & 84 • Implementing in CASA, testing underway
Polarization: Vis to Stokes in XY basis • for antennas with parallactic angles fi and fj • and for identical parallactic angles f between antennas: Linear Feeds: linear polarization in all hands, circular only in cross-hands
Polarization Leakage • Primary on-axis effect is “leakage” of one polarization into the measurement of the other (e.g. X Y) : X X + diX Y • but, direction dependence due to polarization beam! • Calibrate out on-axis leakage and put direction dependence in “beam” • example: expand XY basis with on-axis leakage • remember XX,YY contain IQU and XY,YX contain QUV • ideally the “d”s are ~1-2% but can be worse, should be stable
Primary Beam: full field polarization • VLA primary beams • Beam squint due to off-axis system • Instrumental polarization off-axis • AZ-EL telescopes • Instrumental polarization patterns rotate on sky with parallactic angle • Limits polarization imaging • Limits Stokes I dynamic range (via second order terms) • must implement during imaging Green contours: Stokes I 3dB, 6dB, black contours: fractional polarization 1% and up, vectors: polarization position angle, raster: Stokes V
I V Q U Simulations on a complex model • VLA simulation of ~ 1 Jy point sources + large source with complex polarization (“Hydra A”) • Long integration with full range of parallactic angles • equivalent to weak 1.4GHz source observed with EVLA • Antenna primary beam model by W. Brisken • See EVLA memo 62
I V Q U 1-D Symmetric Beam • dynamic range limited by errors from incorrect approximate primary beam Dynamic Range ~200 using symmetric beam model
I V Q U 2-D Polarized Beam • need to use accurate polarized beam to reach high fidelity and dynamic range Dynamic Range ~104 using 2-D beam model
CASA CLEAN Imaging Details • MFS (mode mfs) • nterm=2 compute spectral index, 3 for curvature etc. • needed for bandwidths ~5% or more (S/N dependent) • tt0 average intensity, tt1 alpha*tt0, alpha images output • takes at least nterms longer (image size dependent) • Multiscale (set non-zero multiscale list) • scales are in units of pixels • usually set to be multiples of synth. beam size • e.g. for 3x oversampling of beam: • multiscale = [0,3,6,12,24] • takes at least nscales longer (x nterms?) • can be tricky to get to work right
CASA CLEAN logistics • clean can be restarted from current state • if imagename used before and files exist and same size • will first recompute residuals from model • user can input some starting conditions • previous mask (e.g. from previous clean) • previous modelimage • key toggles • mode: mfs, channel, velocity, frequency • imagermode: ‘’, csclean, mosaic • psfmode: clark, hogbom • gridmode: advanced stuff, eventually a-projection
Spectral Cube Considerations • mode: channel, velocity, frequency • sets the grid of planes in output image cube • will apply doppler corrections on the fly • can set frame info, restfreq, etc. • data: taken in sky frequency (terrestrial) frame • velocity: include doppler shifts from: • Earth rotation: few km/s (diurnal) • Earth orbit: 30 km/s (annual) • Earth/Sun motion w.r.t. LSR (e.g. LSRK, LSRD) • maybe galactic rotation to extragalactic frames • these shifts are time dependent • “Doppler Tracking/Setting” adjust sky freq while observing • can shift and regrid data before imaging : cvel
Continuum Subtraction • If you have a viable continuum model • e.g. via using mode mfs • can use uvsub to remove from data before imaging • BUT: must have really good accurate model (not likely) • In practice, subtract “continuum” in data • use task uvcontsub or uvcontsub2 • particularly if strong short-baseline emission • these fit polynomial to each vis spectrum and subtract • uvcontsub2 can cross spw boundaries • need to know line-free channel ranges • if uvcontsub need to fit in each spw • “continuum” not generally imageable • e.g. no closure !!!
Odds and Ends • Oversampling PSF • at least 2.5x, I like 3-4x (whichever rounds well), some use 5x, no strong impact (but can make imaging longer) • Boxing • helps guide clean where to pick out real emission, important when psf sidelobes high/complex and/or when image has complicated structure • if initial calibration poor, shallow careful clean with boxing will get better model for selfcal (don’t burn in artifacts) • some cases using csclean (w/cyclefactor) you can “turn it loose”, but be careful before doing this • we are working on autoclean / autoboxing
Flotsam and Jetsam • What, no clean components? • CASA clean increments the .model image • does not keep track of each iteration component • does not report total cleaned flux until end • does not keep separate multiscale components • might later store cc’s as component lists, but not now • you can mask/edit .model image • you can supply initial “model” using modelimage • this can come from anywhere! e.g. single dish • Stopping clean • when you get residual that are noiselike :) or a mess :( • ideally you should get within 2x thermal • if not, probably calibration errors, bad data, or dyn. range issues
Fin • Final words • the proof of the quality of the data is in the final imaging, you generally cannot guess this from metrics on the calibrators (particularly when selfcal is possible) • Anything else? • Documentation • inline help <task> • online cookbook (massive but in depth) • casaguides