Age | Commit message (Collapse) | Author | |
---|---|---|---|
2019-11-10 | Implement basic CUDA target | Adrian Kummerlaender | |
Currently only for the SSS streaming pattern. CudaCodePrinter in `utility/printer.py` is required to add a 'f' suffix to all single precision floating point literals. If this is not done (when targeting single precision) most calculations happen in double precision which destroys performance. (In OpenCL this is not necessary as we can simply set the `-cl-single-precision-constant` flag. Sadly such a flag doesn't seem to exist for nvcc.) | |||
2019-11-02 | Restructure LBM model / lattice distinction | Adrian Kummerlaender | |
2019-10-28 | Add basic setup.py | Adrian Kummerlaender | |
No guarantee for correctness - I mostly fiddled this together in order to use common nixpkgs python package functions for including boltzgen in other shell environments. |