1 / 15

A Streaming Sensor Challenge Problem for Ubiquitous High Performance Computing

A Streaming Sensor Challenge Problem for Ubiquitous High Performance Computing. 2011 High Performance Embedded Computing Workshop 21 September. Daniel P. Campbell, Daniel A. Cook, Brian P. Mulvaney dan.campbell@gtri.gatech.edu.

sinjin
Download Presentation

A Streaming Sensor Challenge Problem for Ubiquitous High Performance Computing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Streaming Sensor Challenge Problem for Ubiquitous High Performance Computing 2011 High Performance Embedded Computing Workshop 21 September Daniel P. Campbell, Daniel A. Cook, Brian P. Mulvaney dan.campbell@gtri.gatech.edu This work was supported by DARPA under contract HR0011-10-C-0145. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressly or implied, of the Defense Advanced Research Projects Agency or the U.S. Government.

  2. Ubiquitous High Performance Computing • DARPA program to develop next generation of energy-efficient high throughput computers • Nominal goals: 1 petaflop, 50 gigaflops/watt Linpack, 1 cabinet, 57KW whole system • Modular and scalable: embedded to cabinet • Complete goals TBD based on Defense needs

  3. Ubiquitous High Performance Computing: CHASM • OBJECTIVE • Drive the focus of UHPC architecture designs toward scalable capability on defense applications. • SUMMARY • The CHASM team is designing challenge problems that embody the computing needs of 2018 defense missions, and developing new execution models that can support million-way parallelism and maintain performance with frequent faults. • PARTNERS • GT College of Computing • GT Electrical and Computer Engineering • Louisiana State University • Lawrence Livermore National Laboratory • Oak Ridge National Laboratory

  4. CP #1 – Streaming Sensor Sensor0 Sensor1 Sensor2 Sensorn-1 “We are going to find ourselves in the not too distant future swimming in sensors and drowning in data.” KnowledgeGeneration Front-end Processing Back-end Processing M UX ActionableKnowledge Dynamic DigitalSensor Data . .. Sensor Data Input Subsystem UHPC Processing System Large volumes of data must be distilled to manageable, actionable knowledge in order to benefit the warfighter

  5. Sensor0 Overview - SSCP Front-end Processing Back Projection Back Projection KnowledgeGeneration Coherent Change Detection CFAR Digital Spotlight Back-end Processing Back Projection Back Projection Difference Image Detections P Images P=2+ • Inputs: • Sensor collections • For each of many pulses: • Sensor position • Measured returns at many time instances • Grid of image point locations Outputs: Location of each detection

  6. Why Backprojection? • FFT-based reconstruction techniques exist • Require either linear or circular collections • Only modest deviations can be compensated • Requires extra steps to get georeferenced imagery • Images only onto planar surface • Procedure • Attempt to fly a perfectly straight line • Compensate for unwanted motion • Form image using Fourier-based method backpropagation • Register and interpolate image onto map coordinates • Flexibility • Can image directly onto map coordinates without postprocessing • Expanded operating envelope • Can image in adverse environmental conditions during maneuvers

  7. Digital Spotlight P1 x S1 (Spatial) P1 x S1 (Frequency) FFT Range Shift Elwise Multiply: e-2πΔfi/K FFT P1 x S1 (Frequency) P1 x S1 (Spatial) Range Gate Cross-Range Downsample (Decimating FIR) P1 x S2 (Spatial) P2x S2 (Spatial)

  8. Image Formation Back Projection For i = {Ix} For j = {Iy} For k = {P3} R = range to i,j S = sample at (k,R) S *= eaR Out(i,j) += S P3x S3 (Spatial) FFT and Oversampled IFFT P3x S4 (Spatial) Ix x Iy 8

  9. Digital Spotlight Loading Impact 9

  10. CCD(1) - Affine Registration Xo = Z1 + Z2Xi + Z3Yi + Z4Xi2 + Z5Yi2 + Z6XiYi M x M Complex Image Find Global Affine Transform AZ = b Solve for Z 2D Correlate x Nc (2Sc-1x2Sc-1) 2D FFT Elwisemult 2D IFFT RcxRcMax Mag Control Points: Ncx4 (Locations, Offsets) A: Ncx6 [1 x y x2x2xy] (x,y = Control point locations) b: Ncx2 (each row =final x,y position) Warp & Sample C = AZ Bilinear sample @ C Z = Warp Weights: 6x2 C: Warped loc A(i,:) = [1 x y y x2 y2xy] i = 1..M2 One row per pixel Z: 6x2 • Rc = ~2*% x M • Round up to NPOT • Sc = Rc x r • Scf = 16* x r • r = 4* • Nc = ceil(M/Rc)2 • *Heuristics

  11. CCD(2) – Thin Spline Registration • Rf = 2^ceil(log2(Rc)/2) • Scf = Rf x 4* • Nf = ceil(M/Df)2 • Df = 200* (easy) • ~1000* (hard) • Rn = 5* (~1.5m) • *Heuristic M x M Complex Single Precision Image Form L L = [K P; P’ O] K(I,j) = U(|Xi-Xj|) U(n) = n2ln(n) P(i,:) = [1 xiyi] O = 0 Xk:CtrlPt. Locs 2D Correlate x Nf (2Scf-1x2Scf-1) 2D FFT Elwisemult 2D IFFT RfxRfMax Mag Control Points: Nfx4 (Locations, Offsets) L: (Nf+3)x(Nf+3) Y: (Nf+3)x2 Warp & Sample DXx,y = WNf + WNf-1x + WNf-2y + SWiU(|Pi-(x,y)|) Pi: location of ith control point Bilinear sample @ Xx,y Find Weights LW = Y Each row of Y: control point + offset Rows Nf+1 -> Nf+3 are zeroes Solve for W W = Warp Weights: (Nf+3) x 2 X = pixel location

  12. CCD(3) – Coherence / CFAR • Coherence • For each pixel: NcorxNcor 2D neighborhood correlation between current frame and reference frame • Constant False Alarm Rate (CFAR): • For each pixel, calculate whether this pixel’s value is lower than Tcfar% of pixels in NcfarxNcfar neighborhood. If so, emit detection

  13. CP #1 – Scenario Parameters

  14. CP #1 – Scenario Loading

  15. SSCP Status • Reference implementation completed & available to UHPC participants • Includes challenge problem & data generator • Accelerated / Parallel implementations under development • Investigating addition of EO/IR & Fusion • Adjustments to registration process under investigation • Adjustments to load-influencing parameters under consideration

More Related