Currently: - Wrong-mode pushforward or pullback runs the `map` on the CPU and copies back to the GPU https://github.com/JuliaDiff/DifferentiationInterface.jl/blob/7e80fef20e8aa40aa47e33a50040b2f60907521a/DifferentiationInterface/src/first_order/pullback.jl#L354-L370 - If I try to parallelize it, I need to duplicate the `prep` object to avoid race conditions, which is not possible See also: - #620 - #970
Currently:
mapon the CPU and copies back to the GPUDifferentiationInterface.jl/DifferentiationInterface/src/first_order/pullback.jl
Lines 354 to 370 in 7e80fef
prepobject to avoid race conditions, which is not possibleSee also:
transpose(::CuArray)#970