Cycles: reduce CUDA stack memory access for Maxwell and up, increasing max registers.
authorBrecht Van Lommel <brechtvanlommel@gmail.com>
Sun, 19 Jun 2016 16:06:22 +0000 (18:06 +0200)
committerBrecht Van Lommel <brechtvanlommel@gmail.com>
Sun, 19 Jun 2016 18:17:26 +0000 (20:17 +0200)
commite26eb9c93bdeae0b52153a7fcf37bceebffd6304
tree1d84a8bf7fb020e0bc7b1b0243fe38c32299c77f
parent9b618c1a5a523e74a5c8f9b0c55a56c54a2388d7
Cycles: reduce CUDA stack memory access for Maxwell and up, increasing max registers.

For non-branched path tracing with a GTX 960 and CUDA 7.5, this gives a small reduction
in stack usage but mainly: 8% faster render on BMW, 5% on pabellon, 13% on classroom.
intern/cycles/kernel/kernels/cuda/kernel.cu