260 likes | 403 Views
RRTMGP : A High-Performance Broadband Radiation Code for the Next Decade A 3-year project funded by the Office of Naval Research Eli Mlawer AER David Berthiaume AER Robert Pincus Univ. of Colorado Brian Eaton NCAR Ming Liu ONR Mike Iacono AER. Overview of Talk.
E N D
RRTMGP: A High-Performance Broadband Radiation Code for the Next Decade A 3-year project funded by the Office of Naval Research Eli Mlawer AER David Berthiaume AER Robert Pincus Univ. of Colorado Brian Eaton NCAR Ming Liu ONR Mike Iacono AER
Overview of Talk • Motivation for Project • Basics of radiation calculations • Radiation calculations in GCMs • Overview of AER radiation codes, validation • RRTMG – Background, accuracy, GCM Implementations • RRTMG – Computational issues • Introduction to Other Talks
Motivation for project • Climate/weather prediction models need physics parameterizations • Trend toward greater complexity in GCMs - e.g. greater temporal, spatial, vertical resolution • Accuracy and computational efficiency are linked • Radiative processes important • means by which planet stays in long-term equilibrium with the universe • differential distribution of absorbed solar radiation drives equator-to-pole temperature gradient • small radiative imbalance due to increasing GGHs drives modern-era warming • Radiative processes complex • Fast parameterizations developed, but still can take ~30% of computational time in GCMs • Even with that, radiation package is not called at every time step e.g. NAVGEM: SW code called every 2 hrs, LW code every 2 hrs per 6 grid cells Conclusion: Radiation is a key bottleneck in predictive modeling. Goal: Develop next generation radiation code for multi-core, vector- and cache-based computational architectures.
Basics of Monochromatic Radiative Transfer (1) Rin(ν) Rout(ν) Single layer (P, T) Gases (e.g. H2O, CO2) B(ν,T)E(ν) T(ν) B(ν,T)E(ν) T(ν) Rin(ν) Rout(ν) E(ν) is layer emissivity at this frequency; B(ν,T) is the Planck function at ν and T T(ν) is layer transmissivity at this frequency 1 – E(ν) = T(ν) = exp(-τ(ν)) τ(ν) is the layer optical depth at this frequency (assumes no scattering)
Basics of Monochromatic Radiative Transfer (2) Zenith-sky optical depths (Univ. of Oxford) τ(ν) = WCO2 * kCO2(P,T) + WH2O * kH2O(P,T) + ... W is number of molecules in layer; k is absorption coefficient at P and T
Basics of Monochromatic Radiative Transfer (3) CO2 band H2O Lines H2O Continuum Ozone band H2O band Measured downwelling radiances measured at surface (Oklahoma, 7/22/01) Differences between measured and calculated radiances (model circa 1999) Differences between measured and calculated radiances (current model)
Basics of Monochromatic Radiative Transfer (4) • By the 1990’s, validations with spectral radiation measurements with line-by-line models had provided confidence in model quality • U.S. Department of Energy Atmospheric Radiation Measurement (ARM) program instrumental • dedicated to the collection of high-quality observations of geophysical properties at a number of ground sites and utilizing these observational data sets to improve physical parameterizations in climate models. • initial emphasis was on understanding the spectrally detailed distribution of radiation at the Earth’s surface and how this distribution depends on the state of the atmosphere. • Comparisons of high-spectral-resolution radiometric measurements with LBLRTM led to significant improvements in the spectroscopy and other physics underlying such models. Conclusion: model is accurate under a wide range of atmospheric conditions • Key point: we know “the answer” for clear-sky RT • Development of RRTMG at AER was funded by ARM (1995-2000)
RRTMG – Basics of correlated-k method (1) • Rearrange absorption coefficients (k’s) in ascending order (i.e. a k-distribution) • - defines a mapping from ν- to g-space • Break into sub-intervals (‘g-points’), compute average k for each, store in LUTs
RRTMG – Specifics (2) • Some considerations • Full spectrum broken up into bands based on absorbing species in region • - 16 in LW, 14 in SW • k-distributions for different layers in same column will not generally be spectrally correlated (i.e. ν- to g-space mapping not fixed) • - Monochromatic RT equations not obeyed, contributes to model’s error budget • Other spectrally dependent values (e.g. Planck function) included by applying ν- to-g mapping • LUTs needed for a wide range of P, T, and ratios of abundances of key species (η) • Per g-point: • troposphere: 13 P’s x 5 T x 9 ratios for bands with two ‘key’ species; otherwise 13 P’s x 5 T’s • stratosphere: 47 P’s x 5 T x 5 ratios for bands with two ‘key’ species; otherwise 47 P’s x 5 T’s • For each layer, use LUTs to interpolate in ln P, T, and (possibly) η • Tuning occasionally needed
RRTMG – Interpolating in binary species param. η(3) Different interpolation methods used for outer 1/8’s and inner ¾ of η-space
RRTMG – Relationship with LBLRTM(4) Validations through Radiative Closure Studies Line-by-line modeling Parameterizations Spectral Measurements • Satellite – e.g. AIRS, TES, IASI • Aircraft – e.g. HIS • Ground – e.g. AERI, TCCON • Satellite – e.g. CERES • Ground – e.g. ARM LBLRTM Optical depths Broadband Measurements RRTMG OSS ApplicationApplication Radiances Fluxes and and Jacobians Heating Rates
RRTMG – Relationship with LBLRTM (5) Absorption line parameters (HITRAN) Continuum absorption models Cross-sections
RRTMG – Spectral Bands (6) Selected Spectral Bands in RRTMG (troposphere)
RRTMG - Accuracy (7) ‘Effective’ Accuracy Equivalent to LBLRTM
RRTMG - Accuracy (8) CIRC RT Intercomparison Percentage errors in calculated flux for each participating model RRTM – Code #1 RRTMG – Code #2
RRTMG – Cloud Approach (9) • The Monte-Carlo Independent Column Approximation provides an efficient and unbiased method to handle cloud inhomogeneity and vertical correspondence. • Each g-point is matched with a single element from the pdf of cloud amounts and vertical overlap
RRTMG – Moving Toward Parallelization (11) • RT in a GCM: repetitive, independent calculations. • For each time step: • number of ‘columns’ = grid cells (~104) X g-points (~200) • number of optical depths = ‘columns’ X layers (~70) • RRTMG: developed in 1990s for CPUs • has many conditional branches aimed at minimizing the number of floating point operations • is not vectorized • is generous in its use of memory copying • Conclusion: it’s time to bring RRTMG into the modern computational era
RRTMGP – Our Team (1) • AER • Eli Mlawer: PI, lead developer of RRTMG, PI for NASA RRTMGPU project • David Berthiaume: developer of RRTMGPU, advanced programming skills • Mike Iacono: great experience wrt RRTMG implementation in atmospheric models • University of Colorado • Robert Pincus: developer of Psrad and McICA, worked with many modeling centers to improve treatment of radiation in atmospheric models • National Center for Atmospheric Research (NCAR) • Brian Eaton: significant experience wrt infrastructure and interfaces for physics packages, dynamical cores, and physics-dynamics coupling in CAM/CESM • Naval Research Laboratory • Ming Liu: substantial experience with physics packages and code structure in NAVGEM
RRTMGP – Project Schedule/Milestones (2) • Year 1 • Redesign RRTMG to be more flexible and easier to optimize • - Refactor RRTMG to be more modular, using the structure of PSrad as a guide • - Focus on optimality across vector- and cache-based architectures, including the use of high-performance libraries where possible. When possible, GPU and MIC drop-in versions of routines will be created. • - Vectorize all routines across columns, with no preference as to vertical direction • - Design code to allow calculation of full broadband irradiance or selected sets of subintervals • - Redesign to be done in consultation with collaborators at NRL and NCAR • • Begin recoding of RRTMGP • • Develop large test suite of clear and cloudy cases • Year 2 • Complete recoding of RRTMGP • Revise gas optics scheme -- modified interpolation scheme that is more uniform across bands • • Thorough testing and validation on a variety of platforms • Year 3 • • Work with NRL and CESM collaborators to test performance of RRTMGP across the full range of platforms on which their respective GCMs are run
RRTMGP – Elements of the Development (3) • Rebuilding RRTMG for the computers of the next decade • Redesign, refactoring, parallelization • • Revised interfaces • • Uniform table interpolations for optical depths • Allow spectral sampling • Complier directives for OpenACC and OpenMP • Fast math routines • CMAKE build system • Validation and benchmarking will be key. • Implementation and performance testing in CESM and NAVGEM. • Need to coordinate with NASA project to improve physics (regenerate k’swith updated spectroscopy, fix known issues, add LW scattering)
N is a large number = nlat X nlong (~104) or = nlat X nlong X ‘spectral’ dimension of RT code (~106) GCM Schematic Model Physics (1) Next time step ……… N independent calculations Atmospheric State COUND Project: Acceleration of RRTMG with GPU Technology Radiation Code ……… Radiative Fluxes Heating Rates Model Physics (2) Timing statistics for porting of cldprmc subroutine to GPU Overhead not included; speed-up shown here may not be representative of final speed-up of RRTMG Calculation of radiative fluxes, heating rates in a GCM can take ~30% of time Individual radiation calculations in a GCM are independent – ideal for GPU Steps in redevelopment of RRTMG to run on GPU – first LW code, then SW Create timing/accuracy benchmarking code Restructure so that each component is parallelized Pass global data to GPU; implement kernel functions Port each component/subroutine to GPU - Main RT code running on GPU; subroutine ‘cldprmc’ completed Optimize memory throughput; evaluate – repeat as needed Deliver to collaborators at GMAO (Suarez/Oreopoulos) Work with GSFC to evaluate performance of GEOS-5 with GPU/RRTMG With GPU/RRTMG implemented, GEOS-5 can exploit the time savings by introducing other new physics packages
Improvement in Accuracy of LBLRTM Two AERI Measurements
LBLRTM: Improvements to CO2 Spectroscopy Mid-Upper Trop Mid-Upper Trop Mid-Upper Trop Mid-Upper Trop Previous version (2006) • Q branch line coupling • HITRAN 2000 CO2 parameters Mean residuals from 36 AIRS ARM TWP nighttime cases using Tobin et al. best estimate sonde profiles CO2 v3 CO2 v2 Latest version (2011) • P, Q and R line coupling • Niro et al. [2005] • Widths, line coupling coeffs • Lamouroux et al. [2010] • Tashkun positions, intensities • Flaud et al. [2003] • Updated CO2 and H2O continua • Mlawer et al., [2012] Improved agreement (Obs - Calc) and consistency across spectral bands! (Input profiles supplied by L. Strow and S. Hannon).
An example related to flexibility • A solution for cloud-scale models • Monte Carlo Spectral Integration (Pincus and Stevens 2008) approximates G ~ 100 calculations every N time steps with G’ ~ 1 calculations every time step (slide from Robert Pincus)