Age | Commit message (Collapse) | Author | |
---|---|---|---|
2019-06-11 | Count operations | Adrian Kummerlaender | |
2019-06-11 | Restructure codegen notebook | Adrian Kummerlaender | |
2019-06-10 | Reduce thread block size | Adrian Kummerlaender | |
=> ~780 MLUPS | |||
2019-06-10 | Add fixed velocity boundaries to generated LBM kernel | Adrian Kummerlaender | |
Interestingly this increased performance to ~750 MLUPS compared to ~665 MLUPS. | |||
2019-06-09 | First test of partially generated LBM kernel | Adrian Kummerlaender | |
A kernel extracted from `lbn_codegen.ipynb` yields ~665 MLUPS compared to the ~600 MLUPS produced by a manually optimized kernel. Note that this new kernel currently doesn't handle boundary conditions (but dropping in a density condition doesn't impact performance). | |||
2019-06-09 | Start tracking codegen notebook | Adrian Kummerlaender | |