Two devices found in transfer learning example

Hey @DarthMalloc!

There have been a couple other posts about this same issue:

It looks like this issue might be due to the fact that putting torch tensors on a gpu is problematic when lightning-gpu is being used as well. Lightning-gpu and torch’s gpu pipeline are entirely differerent, and lightning-gpu expects the data to be on the host right now, so that should fix it!