@blueberry: can you point me at the right way to do convolutions 2d in jcuda? (I expect there to be a builtin but I am not finding it)
@blueberry: also, do you know of any efforts to bind the *.cu files from Tensorflow, PyTorch, or CuPY with JCuda? At the end of the day, there's no reason to rewrite hte .cu files, I just want a way to use them from Clojure, and the .cu/jcuda layer seems like a great place to intercept on.
I do not understand what you are asking.
@blueberry: libraries like tensorflow / pytorch / cupy provide, at some level, an accelerated nd-array + various ops on cuda
then there's some python wrapper over it
this "accelerated cuda nd-array + ops" seems like something that can be extracted, and then bound to via jcuda, and thenuseable via clojure
Those cu kernels are a small percentage of the overall code, and they do not work in isolation from the host code. You'd have to take the cu files (respecting the license, of course) and write matching host code in Java and/or Clojure.