CPU faster than GPU ; GPU utilization not increasing

qubi · December 21, 2022, 11:51pm

Hi!
I’m comparing the performance when using lightning.qubit(diff_method = ‘adjoint’)
on CPU(Intel Xeon W-2245 CPU @3.90 GHz) and on GPU(NVIDIA GeForce RTX 2080 Ti ). I’m trying to train a hybrid QNN for 100 epochs. I run on two environments: one, using tensorflow, and the other using tensorflow-gpu.

On CPU,
batch_size = 4 takes ~300s per epoch.
batch_size = 64 takes ~247s per epoch.

On GPU,
batch_size = 4 takes ~268s per epoch.
batch_size = 64 takes ~254s per epoch.
batch_size = 2000 takes ~272s per epoch.

I’m wondering why CPU is faster than the GPU?
Also, GPU utilization only goes as high as 3-5% (10417MiB / 11019MiB) despite increasing the batch size to 1024. Is it no longer possible to speed training up by utilizing GPU up to 60% or more?

A summary of my untrained model looks like this…
Model: “sequential_1”

=================================================================
sequential (Sequential) (None, 24) 14232

dense_1 (Dense) (None, 10) 250

keras_layer (KerasLayer) (None, 10) 0 (unused)

dense_2 (Dense) (None, 24) 264

=================================================================
Total params: 14,746
Trainable params: 14,746
Non-trainable params: 0

The QNN keraslayer contains IQPEmbedding and StronglyEntanglingLayers,
and it should have 90 parameters after training.
Please shed some light to this. Thank you.

CatalinaAlbornoz · December 23, 2022, 12:20am

Hi @qubi, welcome to the Forum!

Using a GPU can have large overheads for small numbers of qubits. As you go above 20 qubits you can start seeing an improvement vs CPU.

Does the comparative performance change if you increate the number of qubits?

qubi · December 26, 2022, 3:54am

Hi! Thank you for the response. That helps

Upon increasing the number of qubits, I can see a speedup of up to 10%. But the GPU utilization is still quite low. It only reaches about 12%. Is there anything more that I can do with PennyLane in order to increase this?

My batch_size is already set to 512.

CatalinaAlbornoz · January 19, 2023, 3:31pm

Hi @qubi, there may be a bottleneck somewhere. What is your CPU utilization?

Topic		Replies	Views
Trouble with GPU Training Speed in Hybrid Quantum-Classical Model PennyLane Help	3	112	August 8, 2024
Is lightning.gpu really faster than lightning.qubit? PennyLane Help	3	504	December 21, 2022
GPU does not help qml.AngleEmbedding for batched training data PennyLane Plugins	7	709	April 14, 2023
How to use GPU to accelerate the hybrid QNN PennyLane Help	6	604	March 8, 2024
Hardware/Plugins for Faster QNNs PennyLane Help	3	491	February 17, 2023

CPU faster than GPU ; GPU utilization not increasing

Related topics