1 / 1

Software Correlators

Using GPGPUs as Correlators. Dominic Ford, Jongsoo Kim & Paul Alexander. Introduction. Antennas. Antennas. Antennas.

amina
Download Presentation

Software Correlators

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Using GPGPUs as Correlators Dominic Ford, Jongsoo Kim & Paul Alexander Introduction Antennas Antennas Antennas Cross correlators are a highly computationally intensive part of any radio interferometer. Although the number of operations per data sample is small, correlation occurs before time integration and so the sample rate is very high – it must be at least twice the radio-frequency bandwidth of the telescope. The number of operations is proportional to both the bandwidth of the telescope, and the number of baselines being correlated. For SKA1, and operation rate of many petaflops will be required in the correlator alone. Time-domain FFT Cross correlation High data rate. Low complexity. Time integration Lower sample rate. High complexity. Spatial FFT and Imaging Figure 1: A schematic FX correlator. Software Correlators At present, FPGA and/or ASIC based correlators are widely used. These offer highly transistor-efficient solutions as logic units are hard-wired to solve the particular correlation problem required. However, this efficiency comes at the expense of flexibility. Software correlators, running on general-purpose CPUs or GPUs offer the ability to easily add antennas, change the number of baselines correlated, or tune the spectral resolution of the telescope as required. The specification of the telescope can be scaled with time as more computing power becomes available. Several implementations already exist. LOFAR uses an IBM Bluegene supercomputer as its correlator. An MPI-based correlator for use on x86 compute clusters has been developed by Deller et al. (2007) at Swinburne and is in use at the LBA. Figure 3: The MWA has already trialled a GPU-based correlator (Wayth et al. 2007). Figure 2: LOFAR uses an IBM Bluegene/P as its correlator (Romein et al. 2010). Prototype systems NVIDIA Tesla cards In Cambridge, we are developing a CUDA-based correlator to run on NVIDIA GPUs. We hope to deploy a small-scale trial system on the Arcminute Microkelvin Imager (AMI). Graphics Processing Units (GPUs) may offer the computational power needed in a correlator for the SKA. Today, NVIDIA’s Tesla cards already offer 100s GFLOPs per card, and a maximum data rate of 64 Gbit/s through the card’s 16-lane PCI-Express-2 connection. They easily surpass more traditional CPUs in cost per FLOP and power consumption per FLOP. This will allow us to optimise our use of the memory architecture of the Tesla cards and the distribution of data between cards. It will also allow us to assess the performance which could be achieved in a correlator for SKA1. Figure 4: An NVIDIA C1060 Tesla card, offering a theoretical peak performance of 933 GFLOP/s in single precision. The correlation problem is sufficiently parallel that such highly-parallel architectures can be readily used. Moreover, GPU architectures are optimised for performing the multiply-and-add operations that we require. For telescopes with fewer than around 300 antennas, a GPU correlator would be throttled by the speed of PCI-Express data transport, but for larger systems such as SKA1, they are an attractive option. Figure 5: The memory model of an NVIDIA Tesla card. Email your comments to: D.Ford@mrao.cam.ac.uk

More Related