In the Double Stochastic Gradient Descent method. The Hamiltonian was given as

H = 4 +2I \otimes X + 4 I ⊗ Z −X⊗X + 5 Y ⊗ Y + 2 Z ⊗ X

And so the idea of “Double” stochastic means when we iterate through our SGD, we only sample the Pauli terms from H and evaluate the expectation of those terms?

For instance, we can set to sample 3 out of 5 Pauli terms at each iteration. At the first iteration, we might sample, IX,XX, ZX then at the next step we might sample IX, XX, YY,… and so on. Am I thinking of this right? If so, how many terms were set to get the results presented below?

Thanks!