1 / 16

Correlation in the MWA

Correlation in the MWA. US VLBI Technical Coordination Meeting Roger Cappallo 2007.5.14. MWA Introduction. Low frequency (80-300 MHz) array in outback of Western Australia 500 dual pol. dipole tiles spread across 1.5 km of desert

Download Presentation

Correlation in the MWA

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Correlation in the MWA US VLBI Technical Coordination Meeting Roger Cappallo 2007.5.14

  2. MWA Introduction • Low frequency (80-300 MHz) array in outback of Western Australia • 500 dual pol. dipole tiles spread across 1.5 km of desert • Analog signals sampled at 640 MHz, broken into coarse (~1.3 MHz) channels, of which 31 MHz are transmitted to correlator

  3. coarseF fineF PFB: fine channels, reorder, route X, short med sum rotate, long sum

  4. Large-N Considerations • Correlator design based on ideas developed independently at CSIRO and MIT • Joint white paper (Bunton, Cappallo, Morales, 2005) applying ideas to SKAMP and MWA • Large N correlator has two central problems: computation and routing • How to bring >500K signal pairs together for multiply and add? • Answer: Replicate (500x!) signals at as low a level as possible, in hierarchical fashion

  5. Signal Replication • In order of increasing cost: • local traces in FPGA (x16) • chip-wide traces in FPGA (x8) • multiple traces on PC board (x4) • across a backplane • off-board signals (e.g. multicast packets)

  6. Temporal Mismatch • FPGA multiplier rate ~250 MHz • Data sample rate is 10 KHz • Factor of 25000 mismatch! • Answer: Time multiplex over multiple (256) station pairs, and multiple (96) frequency channels

  7. Correlator Requirements • Complex cross-multiply and accumulate data from 524800 signal pairs • Each pair comprises 3072 channels with 10 KHz bandwidth • 10 KHz bandwidth  30 Km wavelength, thus array is  / 20 within a channel, regardless of direction! • Max fringe-rate of 0.109 Hz for 1.5 km baseline at 300 MHz would allow dump rate of 2 s (which is v.1 int. period); 0.5 s used for solar & transients, longer baselines, minimize coherence loss • No fringe rotation or delay compensation necessary in hardware!

  8. System Dataflow

  9. Numerology • 1 correlator board has 8 SX-35’s, each with 136 cells, which can process a total of 278528 signal pairs • Each pair of correlator boards processes 96 channels (0.967 MHz) • 32 board pairs required for all 3072 channels (30.94 MHz) • Requires five 23” or six19” AdvancedTCA shelves

  10. Two boards cover all baselines • 2boards: m andn • CMACchips 0..7 • axis of symmetry along hypotenuse • reverse input order to get lower diagonal half

  11. Cells mapped onto SX-35 chip • Separate groups of 256 antennas to X and to Y • Uses 136 of 192 available DSP slices

  12. Correlator cell • 16 X & 16 Y 8 bit input values in distributed RAM, for a single point in time • complex 4+4 bit multiply encoded into single 36 bit hardware multiply • 18 bit adder implemented in local fabric • short-term sums ping pong in block RAM: 2 comp x 18 bit x 256 prod x 2 buffers = 18 Kb

  13. Data Ordering - tnf96t512a1024 • Within a cell 2 sets of 16 antenna samples for each time point are cross multiplied and added into 256 short term accumulators • Process is repeated for 512 t points, when accum is dumped to LTA and cleared • Above done for each of 96 channels, then repeated for next time block, etc.

  14. Voltage Beamformer • Not needed for 32T system, so detailed development has not yet begun • For each frequency channel, need to form a linear combination of all 1024 antennas: V = an x gn • 16 dual polarization beams are formed (but treated internally as 32 single polarization beams) • Computational load ~1.0 TCMAC/s • only 6% of correlator load of 16.2 TCMAC/s • possibly will be done in routing chips (FX-20’s)

  15. Beamformer Gains • If every beam, baseline, and channel gain were independently specified, it would require 1 TB/s of coefficient data! But… • Complex gains gn are a product of • instrumental gain - slow t variation, gi(,) • ionospheric phase - medium t variation, gs(,), f-2 freq dependence • geometric phase - rapid f variation (linear), gg(,) • Gain flow from RTS to beamformer must be efficient, taking advantage of characteristics of each term • e.g. geometric phase could be specified as 2 numbers: phase at low f end, and increment per channel

More Related