Age | Commit message (Collapse) | Author |
|
Interestingly this seems to lose up to 10 MLUPS at first glance.
On the other hand such a small difference could also be a temporary load issue.
|
|
|
|
|
|
|
|
* Only update moment field when it is actually needed
* => ~825 MLUPS
* Defer plot generation until the actual simulation is done
|
|
=> ~780 MLUPS
|
|
|
|
Interestingly this increased performance to ~750 MLUPS compared to ~665 MLUPS.
|
|
A kernel extracted from `lbn_codegen.ipynb` yields ~665 MLUPS compared
to the ~600 MLUPS produced by a manually optimized kernel.
Note that this new kernel currently doesn't handle boundary conditions (but
dropping in a density condition doesn't impact performance).
|