Different interfaces, different performances

Thanks for the time and consideration @josh. My guess is the weights and inputs structures are by default not suitable for Cuda which maybe can be modified?

The error has been reported almost at the same time here as well.