this shows that we spend about 2/3 of the GPU roundtrip time on stepping. the other 1/3 is i guess scheduling/latency/memory transfers.
this shows that we spend about 2/3 of the GPU roundtrip time on stepping. the other 1/3 is i guess scheduling/latency/memory transfers.