Keras hybrid model lightning.gpu with adjoint differentiation

Hi @ToMago

Thanks for posting the above info. I have attempted to run your earlier script using adjoint with lightning.gpu v0.25.0 and was able to run it to completion.

The GPU used for this was a 40GB A100, which is the general target GPU for our cuQuantum backed workloads, along with the other Tesla-grade V100.

The error you reported is likely due to there being insufficient GPU memory available to initialize the CUDA contexts necessary for cuQuantum, and so fails to complete. Can you indicate which GPU you are running this on, as it may help us to track down the problem.

Also, while running the script, it may be worth examining the GPU memory use through nvidia-smi to help idnetify if this is the root cause. I prefer to poll this every second using watch -n 1 nvidia-smi to keep track over the script execution time.

1 Like