aboutsummaryrefslogtreecommitdiff
path: root/lbm_codegen.ipynb
AgeCommit message (Collapse)Author
2019-06-11Count operationsAdrian Kummerlaender
2019-06-11Restructure codegen notebookAdrian Kummerlaender
2019-06-10Reduce thread block sizeAdrian Kummerlaender
=> ~780 MLUPS
2019-06-10Add fixed velocity boundaries to generated LBM kernelAdrian Kummerlaender
Interestingly this increased performance to ~750 MLUPS compared to ~665 MLUPS.
2019-06-09First test of partially generated LBM kernelAdrian Kummerlaender
A kernel extracted from `lbn_codegen.ipynb` yields ~665 MLUPS compared to the ~600 MLUPS produced by a manually optimized kernel. Note that this new kernel currently doesn't handle boundary conditions (but dropping in a density condition doesn't impact performance).
2019-06-09Start tracking codegen notebookAdrian Kummerlaender