diff options
Add support for generating a D3Q19 kernel
Note how this basically required no changes besides generalizing cell indexing
and adding the symbolic formulation of a D3Q19 BGK collision step.
Increasing the neighborhood communication from 9 to 19 cells leads to a
significant performance "regression": The 3D kernel yields ~ 360 MLUPS
compared to the 2D version's ~ 820 MLUPS.
Diffstat (limited to 'standalone_cpp_codegen.py')
0 files changed, 0 insertions, 0 deletions