Age | Commit message (Collapse) | Author | |
---|---|---|---|
2019-06-15 | Start to record some benchmarks | Adrian Kummerlaender | |
2019-06-13 | Add JupyterLab to environment | Adrian Kummerlaender | |
2019-06-11 | Start to use codegen for actual kernel generation | Adrian Kummerlaender | |
2019-06-11 | Remove initial vector field example | Adrian Kummerlaender | |
2019-06-08 | Performance optimizations | Adrian Kummerlaender | |
Starting point: ~200 MLUPS on a NVidia K2200 Changes that did not noticeably impact performance: * Memory layout AOS vs. SOA (weird, probably highly platform dependent) * Propagate on read * Tagging pointers as read / write only * Manual code inlining Changes that made things worse: * Bad thread block sizes The actual issue: * Hidden double precision computations => Code now yields ~600 MLUPS | |||
2019-06-04 | Enable verbose OpenCL output | Adrian Kummerlaender | |
2019-05-29 | Add Jupyter to nix-shell | Adrian Kummerlaender | |
2019-05-26 | Add basic D2Q9 LBM | Adrian Kummerlaender | |
Ported the basic compustream structure | |||
2019-05-20 | Throw together basic UI for updating field function | Adrian Kummerlaender | |
2019-05-19 | Initial commit | Adrian Kummerlaender | |