Double Stochastic Gradient Descent

KAJ226 · March 8, 2021, 8:18am

In the Double Stochastic Gradient Descent method. The Hamiltonian was given as

H = 4 +2I \otimes X + 4 I ⊗ Z −X⊗X + 5 Y ⊗ Y + 2 Z ⊗ X

And so the idea of “Double” stochastic means when we iterate through our SGD, we only sample the Pauli terms from H and evaluate the expectation of those terms?

For instance, we can set to sample 3 out of 5 Pauli terms at each iteration. At the first iteration, we might sample, IX,XX, ZX then at the next step we might sample IX, XX, YY,… and so on. Am I thinking of this right? If so, how many terms were set to get the results presented below?

Thanks!

josh · March 8, 2021, 11:59am

And so the idea of “Double” stochastic means when we iterate through our SGD, we only sample the Pauli terms from H and evaluate the expectation of those terms?

@KAJ226, exactly right! The stochasticity comes from two sources:

The finite number of shots
The sampling of a random subset of the Hamiltonian terms

Hence ‘doubly stochastic’.

In this particular example, the number of sampled Hamiltonian terms was set to 1, via n=1 in the below snippet:

def loss(params):
    return 4 + (5 / 1) * circuit(params, n=1)

Kutubkhan_Bhatiya · July 19, 2024, 8:08am

I just have a question that what is this loss exactly, from what I understand we sample terms from hamiltonian and look at the moving average to see if this converges, so what is this def loss(), and why is it returning 4+(5/1)* circuit()? Shouldn’t it just be circuit()?

josh · July 19, 2024, 6:03pm

Hi @Kutubkhan_Bhatiya, this comes from the Hamiltonian, and the fact that we are only sampling 1 out of 5 terms:

H = 4 +2I \otimes X + 4 I ⊗ Z −X⊗X + 5 Y ⊗ Y + 2 Z ⊗ X

Here, the 4 + comes from the constant 4 term in the Hamiltonian. The 5/1 comes from the fact that we are only sampling 1 term from the remaining 5.

Kutubkhan_Bhatiya · July 21, 2024, 12:34pm

Oh okay, that makes a lot of sense, so if we sampled let’s say 2 out of 5 terms, would it be 5/2? Thanks for replying, I didn’t think I would get a reply on such an old post.

CatalinaAlbornoz · July 23, 2024, 9:32pm

That’s right @Kutubkhan_Bhatiya, it would be 5/2

There’s no problem that it’s an old post, we’re here to help!

Topic		Replies	Views
Single Shot Stochastic Gradient Descent PennyLane Help	5	898	February 20, 2021
Computation procedure when defining number of shots in qml.device PennyLane Help	3	491	March 13, 2023
Qml.qchem.molecular_hamiltonian silently uses numpy's seed PennyLane Development	6	644	November 18, 2022
Gradients computing fail for evolution under a hamiltonian PennyLane Help	2	491	October 2, 2023
How do I create a parameterized Hamiltonian with the same coefficient for certain terms? PennyLane Help	8	349	February 9, 2024

Double Stochastic Gradient Descent

Related topics