Exploring Alternative Approaches to CFD

Exploring Alternative Approaches to CFD Dimitri Mavriplis Dept.of Mechanical Engineering University of Wyoming Laramie, WY

Workshop Statement • Barriers to Progressand Technology Drivers • Misguided Government and Industry Policies • NASA: No base R&D budget, mission/vehicle emphasis • Boeing: Lack of interest in technology/internal expertise • Dell Computer Corp. • Multiple Scales • Anisotropy, Solver Stiffness, Mesh Refinement, Turbulence • Software Complexity • MDO, multiphase flow front tracking, education

Technology Drivers • High-Order Accuracy (Discontinuous Galerkin) • Attempt to mitigate multiple-scale issue • Large gains possible through more efficient approximations (temporal and spatial) • Requires efficient solvers to realize gains • More complex software • Low-Order Accuracy: Simpler Models (Lattice Boltzmann) • Simpler software • More complex physics: Mesoscales • Also requires efficient solvers

CFD Accuracy for Drag Prediction (AIAA workshop results) • Current simulations suffer from : • Discretization errors • Physical Modeling errors • Apparently, one does not dominate the other (still)

Apparently, one does not dominate the other (still) • Grid convergence (or lack thereof) from: AIAA-2003-3400: CFD Sensitivity Analysis of a Drag Prediction Wing/Body Transport Configuration, E.M. Lee-Rausch, P. G. Buning, J. H Morrison, M. A. Park, C. L. Rumsey and D. J Mavriplis

Higher-Order Methods • Simple asymptotic arguments indicate benefit of higher-order discretizations • Most beneficial for: • High accuracy requirements • Smooth functions

Motivation • Higher-order methods successes • Acoustics • Large Eddy Simulation (structured grids) • Other areas • High-order methods not demonstrated in: • Aerodynamics, Hydrodynamics • Unstructured mesh LES • Industrial CFD • Cost effectiveness not demonstrated: • Cost of discretization • Efficient solution of complex discrete equations

Motivation • Discretizations well developed • Spectral Methods, Spectral Elements • Streamwise Upwind Petrov Galerkin (SUPG) • Discontinuous Galerkin • Most implementations employ explicit or semi-implicit time stepping • e.g. Multi-Stage Runge Kutta ( ) • Need efficient solvers for: • Steady-State Problems • Time-Implicit Problems ( )

General Idea • Use element Jacobi as Smoother for p-multigrid solver (AIAA-2003-3989: Atkins, Helenbrook and Mavriplis) Solution of linear (2D) wave equation using 2 grid p-MG system for p=3 (Mavriplis and Atkins, 2002)

Multigrid Solver for Euler Equations • Develop efficient solvers (O(N)) for steady-state and time-implicit high-order spatial discretizations • Discontinuous Galerkin • Well suited for hyperbolic problems • Compact-element-based stencil • Use of Riemann solver at inter-element boundaries • Reduces to 1st order finite-volume at p=0 • Natural extension of FV unstructured mesh techniques • Closely related to spectral element methods

Discontinuous Galerkin (DG) Mass Matrix Spatial (convective or Stiffness) Matrix Element Based-Matrix Element-Boundary (Edge) Matrix

Steady-State Solver • Kijui=0 (Ignore Mass matrix) • Block form of Kij: • Eij = Block Diagonals (coupling of all modes within an element) • Fij = 3 Block Off-Diagonals (coupling between neighboring elements) Solve iteratively as: Eij (ui n+1 – uin ) = Kij uin

Steady-State Solver: Element Jacobi Solve iteratively as: Eij (ui n+1– uin) = Kij uin Duin+1 = E-1ij Kij uin Obtain E-1ij by Gaussian Elimination (LU Decomposition) 10X10 for p=3 on triangles

DG for Euler Equations • Mach = 0.5 over 10% sin bump • Cubic basis functions (p=3), 4406 elements

Entropy as Measure of Error • S  0.0 for exact solution • S is smaller for higher order accuracy

Error Convergence for Bump Case • P=1: Final Slope: 1.26 • P=2: Final Slope: 2.8 • P=3: Final Slope: 3.2 • Based on L2 norm of entropy • H-convergence • 382 elements • 1500 elements • 2800 elements • 4406 elements

Element Jacobi Convergence • P-Independent Convergence • H-dependence

Improving Convergence H-Dependence • Requires implicitness between grid elements • Multigrid methods based on use of coarser meshes for accelerating solution on fine mesh

Spectral Multigrid • Form coarse “grids” by reducing order of approximation on same grid • Simple implementation using hierarchical basis functions • When reach 1st order, agglomerate (h-coarsen) grid levels • Perform element Jacobi on each MG level

Hierarchical Basis Functions • Low order basis functions are subset of higher order basis functions • Low order expansion (linear in 2D): • U= a1F1 + a2F2 + a3F3 • Higher order (quadratic in 2D) • U=a1F1 + a2F2 + a3F3 + a4F4 + a5F5 + a6F6 • To project high order solution onto low order space: • Set a4=0, a5=0, a6=0

Hierarchical Basis on Triangles • Linear (p=1): F1=z1, F2=z2, F3=z3 • Quadratic (p=2): • F4=-0.5z1z2, F5=-0.5z2z3, F6=z3z1 • Cubic (p=3): F7=1/12z1z2(z1-z2), F8=1/12z2z3(z2-z3), F9=1/12z3z1(z3-z1), F10=z1z2z3

Spectral Multigrid • Fine/Coarse Grids contain same elements • Transfer operators almost trivial for hierarchical basis functions • Restriction: Fine to Coarse • Transfer low order (resolvable) modes to coarse level exactly • Omit higher order modes • Prolongation: Coarse to Fine • Transfer low order modes exactly • Zero out higher order modes

Euler Equations • System of Non-Linear Equations • Element Jacobi and Spectral MG Implemented for Euler Equations • FAS Multigrid (non-linear MG) • Possible tradeoffs using Linear MG • Requires storing entire Jacobian • Reduced cpu time (freeze Jacobian) • Bump and Airfoil Test Cases • Includes Curved Surface Elements • Necessary for maintaining accuracy

Element Jacobi Convergence • P-Independent Convergence • H-dependence

Multigrid Convergence • Nearly h-independent

Future Multigrid Work • Reduced complexity of element Jacobi • Develop implicit time integration scheme • High-order time discretization • Multigrid driven implicit solver • H-P Adaptivity in Multigrid Context • Towards a General Multi-resolution Framework

Using Alternate Models: The Lattice Boltzmann Approach • Lattice Boltzmann methods increasingly popular • Origins from Lattice Gas Automata (LGA) methods • LBM abandons particle formulation in favor of particle distribution function approach • Mesoscale Method (neither macro, nor molecular) • Shown to converge to Navier-Stokes equations in macroscopic limit • Facilitates physical modeling (and software complexity) • Front capturing in place of fitting • Physics incorporated at mesoscale level (collision term)

Front Capturing with LBM • c/o M. Krafczyk, TU Braunschweig, Germany

Governing Equations of Fluid Dynamics • Statistical Mechanics Description (Meso-Scale) • Boltzmann Equation (BGK approximation) • f=f(x, ,t): Distribution function in velocity space • = speed of gas molecules • Linear convection terms • Non-Linear Collision (source) term (local)

Lattice Boltzmann • 9 velocity model: • Velocities allow exact jump to neighboring grid point • Proven mathematically to reduce to Navier-Stokes equations in asymptotic limit • Obtain macroscopic values as:

2 Step LBM Implementation • Implemented as Collision step followed by advection step (reminiscent of LGA method) • Collision step is entirely local : is a non-linear function of the • Advection: Shift in memory with no flops!

Motivation • Formally: 2nd order accurate in space • Formally: 1st order accurate in time • Remarkable delivered accuracy in many cases • Numerically efficient • Local collision term • Linear convection term: shift operator with no flops! • Corresponds to explicit time-stepping approach • Notoriously inefficient solution strategy

LBE for 2D Driven Cavity • Streamlines for 129 x 129 grid (Re=100)

Solution of Steady-State LBE • 16,000 time steps to machine zero on 33x33 grid

Solution of Steady-State LBE • 16,000 time steps to machine zero on 33x33 grid • 4 to 5 times slower on 129x129 grid

Solution of Steady-State LBE • 16,000 time steps to machine zero on 33x33 grid • 4 to 5 times slower on 129x129 grid • Even slower on 513x513 grid

Motivation • Develop efficient (optimal) solution strategies for LBE • LBE simplicity + Asymptotic efficiency • Steady-state problems • Decouple space and time • Preserve exact LBE spatial discretization • Obtain identical final result with faster solution method

Steady-State LBE Discretization • Drop time-level index n: • In residual form: • Linearize and solve with Newton’s method: • Iterative inversion of Jacobian matrix using Jacobi, GG, Multigrid…

Linear Multigrid Driven Newton Scheme • Efficiency of linear system • 150 multigrid cycles • Quadratic convergence of non-linear system

Linear vs. Non-Linear Approach • Linearized LBE is expensive • Block 9x9 matrices for collision term • Looses efficient 2 step non-linear evaluation

Operation Count • Linear system iteration (Jacobi): • 9x9 matrix-vector product: 2x9x9= 162 ops • Other terms : 63 ops • Total: 225 ops • Non-Linear LBE Time Step: • Macro variables: 18 ops • Collision: 75 ops • Advection: 0 ops • Total : 93 ops

Non-Linear Multigrid Approach • Use LBE time-step to drive non-linear multigrid algorithm (non-linear equivalent of Jacobi) • Jacobi has poor smoothing properties • Implement non-linear under-relaxed Jacobi in 3 steps:

Non-Linear Multigrid (FAS)

Non-Linear Multigrid Approach • On coarse grids, need to include defect-correction : • Implement as modification to under-relaxation stage • Preserve first two steps • Implemented by calling existing LBE code on all grid levels

Multigrid Cost • 1 MG cycle = 24 LBE steps • 4 pre-smoothing and 4 post-smoothing steps (factor 8) • Work of coarse grids (W-cycle in 2D: factor 2) • Remaining factor: 1.5: Under-relaxation, intergrid transfers • Memory Usage (factor 3) • 2 field arrays on fine grid (f + residuals) • Remainder is coarse grid variables (factor 1.5)

Solution of Steady-State LBE • 16,000 time steps to machine zero on 33x33 grid • 4 to 5 times slower on 129x129 grid • Even slower on 513x513 grid

Non-Linear Multigrid Solution of LBE • 2 to 3 orders of magnitude faster in terms of cycles • Multigrid cycle is 24 times cost of LBE time-step

Non-Linear Multigrid Solution of LBE • Work unit: CPU time for 1 LBE time-step on relevant grid • MG 1 to 2 orders of magnitude more efficient • Benefit increases with finer grids • Asymptotic property of MG: O(N)

Conclusions • Steady-state LBE can be solved efficiently O(N) using multigrid • Implemented by calling existing LBE code on each grid level • Plus defect correction and under-relaxation step • Better smoothers are available at the cost of decreased modularity • Linearizing LBE is a poor strategy • Destroys favorable operation count

Future Challenges • For steady-state cases, apply known PDE strategies • Adaptive meshing • Adjoint solution techniques for MDO sensitivities • Simple (but costly) linearization • Extend steady-state solver to time-implicit solver • Exact convection (shift to neighbor) is lost • Effect on temporal accuracy (?)

Exploring Alternative Approaches to CFD