20 likes | 153 Views
CMAQ Parallel Performance. CMAQ Sept 2003 release, cb4_ae3_aq chemical mechanism 118 x 118 xy grid (2 km), 21 z-layers, 84 species 24-hour run, output every 2 hrs HP Linux cluster, 3.0 GHz Xeon (dual-proc), 4 Gb/node, Myrinet, PVFS
E N D
CMAQ Parallel Performance • CMAQ Sept 2003 release, cb4_ae3_aq chemical mechanism • 118 x 118 xy grid (2 km), 21 z-layers, 84 species • 24-hour run, output every 2 hrs • HP Linux cluster, 3.0 GHz Xeon (dual-proc), 4 Gb/node, Myrinet, PVFS • New version on 1 proc: 1 day in 6.7 hrs 1 year in 102 days • New version on 32 procs: 1 day in 26 min 1 year in 6.8 days Original full Computation Original write Modified write Modified full
Performance Improvement Ideas May affect serial or parallel performance: Aerosol: alternative to numerical quadrature Chemistry: try damped Newton-Raphson solver Vdiff: array orderings for cache effects Xadv/Yadv: alternate formulation of PPM for reduced communication I/O: binary vs NetCDF (10x) multiple procs read & write pre- and post-processing of files 6.7 hrs 40 min