Optimisation routine yields the same loss values

Zohim_Chandani1 · June 22, 2021, 2:58pm

Been using Pennylane for a optimization task and seem to be getting the following error: UserWarning: Output seems independent of input. I have checked various things on how the loss is being calculated and if all examples are being passed in correctly but cant seem to find the issue. I have pasted my code file below - cant attach it since I am a new user. Thanks a lot for taking a look at this.

import pennylane as qml                                                 
from pennylane import numpy as np
from sklearn.metrics import log_loss
from pennylane.optimize import AdamOptimizer

#set parameters of the model 

n_qubits = 8
n_features = 16
n_parameters = 15
feature_range = (0, 2*np.pi)

train_size = 150
test_size = 150

qubits = list(range(n_qubits))


def initialize_parameters(min_range, max_range, n_parameters):
    params = np.random.uniform(low = min_range, high = max_range, size = n_parameters)
    return params


#Import the data which I have ommitted 

x_train = np.random.uniform(size = (150,16))
x_test = np.random.uniform(size = (150,16))
y_train = np.array([0]*75 + [1]*75)
y_test = np.array([0]*75 + [1]*75)


dev = qml.device("default.qubit", wires=8)

@qml.qnode(dev)
def qnn(data, theta):

    #data encoding 
    for i in range((n_features // 2)):
        qml.RX(data[i], wires = i)

    for i in range((n_features // 2)):
        qml.RY(data[i+n_qubits], wires = i)

    #variational
    theta_counter = 0 

    for i, q in enumerate(qubits, start = theta_counter):
        qml.RX(theta[i], wires = q)
        theta_counter = i

    for q1, q2 in zip(qubits[0::2], qubits[1::2]):
        qml.CZ(wires=[q1,q2])

    for i, q in enumerate(qubits[1::2], start = theta_counter+1):
        qml.RY(theta[i], wires = q)
        theta_counter = i 

    qml.CZ(wires = [1, 3])
    qml.CZ(wires = [5, 7])

    for i, q in enumerate(qubits[3::4], start = theta_counter+1):
        qml.RX(theta[i], wires = q)
        theta_counter = i 

    qml.CZ(wires=[3, 7])

    for i, q in enumerate(qubits[7::8], start = theta_counter+1):
        qml.RY(theta[i], wires=q)
        theta_counter = i 

    return qml.expval(qml.PauliZ(7))


def compute_cost(params, x, y):
    
    y_pred = [qnn(x[i], params) for i in range(x.shape[0])] 

    yhat = [1 if x > 0 else 0 for x in y_pred]

    return np.array(log_loss(y, yhat))

#training the model 
epochs = 10

opt = AdamOptimizer(stepsize= 0.01, beta1=0.9, beta2=0.999)

params = initialize_parameters(feature_range[0], feature_range[1], n_parameters)

loss = compute_cost(params, x_train, y_train)

print("Epoch: {:2d} | Cost: {:3f}".format( 0, loss ))

for it in range(epochs):

    params, loss = opt.step_and_cost(lambda v: compute_cost(v, x_train, y_train), params)

    res = [it + 1, loss]
    print("Epoch: {:2d} | Loss: {:3f}".format(*res))

nathan · June 22, 2021, 5:02pm

Hi @Zohim_Chandani1, and welcome to the forum!

I think I’ve determined the issue here. It is in your implementation of compute_cost:

def compute_cost(params, x, y):
  y_pred = [qnn(x[i], params) for i in range(x.shape[0])]

  yhat = [1 if x > 0 else 0 for x in y_pred]

  return np.array(log_loss(y, yhat))

The returned value of this function does not depend in a differentiable manner on params, which is why you are seeing the automatic differentiation system complaining that “Output seems independent of input.”

Specifically, you are using y_pred (which should be differentiable with respect to params) within a regular Python list comprehension, the output of which (i.e., discrete zeros or ones) no longer has a derivative with respect to params.

(You’re also importing log_loss from scikit-learn, would furthermore might break differentiability, since its internal implementation would import “standard” numpy, and not the autograd numpy provided within PennyLane).

Instead of converting predictions to binary 0 or 1, I would recommend working directly with the prediction probabilities y_pred themselves (which have the nice property that it equals 1 when the prediction is 100% confident). You can them put them through a cross-entropy or log-loss type cost function to compare predictions vs labels. If you’re using numpy, you might have to hand code it (I don’t believe it is built-in there); the equivalent step in a library like Tensorflow would be (note: you’d have to record your model using TF to use this):

probs = tf.stack(y_pred) / 2 + 1
logits = tf.math.log(probs) # this creates "logits", i.e., unnormalized log probabilities
cost = tf.nn.softmax_cross_entropy_with_logits(y, logits) # outputs a single cost function, averaged over all labels y

Zohim_Chandani1 · June 24, 2021, 2:54pm

Thanks for your previous response @nathan

How could one determine if the output of the cost function depends on the tunable parameters in a ‘differentiable manner’?

I hand-coded the log loss function as per your advice with a for loop, as shown below, and the code now works

def compute_cost(params, x, y): 

    cost = 0 

    for i in range(len(x)):

        probs = qnn(x[i], params)
        p1 = probs[1]
        loss = (-1)*((y[i]*np.log(p1)) + ((1-y[i])*np.log(1-p1)))
        cost += loss 

    cost = cost/len(x)

    return cost

However, a list comprehension version of the same function (shown below) does not work.

def compute_cost(params, x, y):
    
    probs = [qnn(x[i], params) for i in range(x.shape[0])]
    
    p1 = [probs[i][1] for i in range(len(probs))]
    
    cost = ((np.sum((y*np.log(p1)) + ([1-x for x in y]*np.log([1-x for x in p1]))))/len(y))*(-1)

    return cost

Shall I just avoid list comprehensions moving forward or is there a deeper reason to why Pennylane does not support this?

josh · June 25, 2021, 8:25am

Hey @Zohim_Chandani1! This is less a restriction of PennyLane, and more a restriction of the machine learning framework/autodifferentiation interface that is being used.

In general, there are restrictions that are common across all autodifferentiation libraries:

Cost functions must return a single floating-point scalar
The output of the cost function must be a piecewise differentiable function of the input.

In your case above, I believe that your original cost function did not satisfy the second restriction, since yhat was not a continuous transformation of the input.

Beyond these general restrictions, though, there might also be some restrictions that are specific to the autodifferentiation library.

For example, Autograd (which PennyLane uses by default when you from pennylane import numpy) has a couple of additional restrictions, including no differentiable support for array assignments (A[0, 0] = x). You can see the full list of restrictions in their documentation.

I’m not 100% sure about TensorFlow and Torch, but JAX also has similar restrictions: https://jax.readthedocs.io/en/latest/notebooks/Common_Gotchas_in_JAX.html#in-place-updates

Hope that helps!

Zohim_Chandani1 · June 29, 2021, 3:25pm

Thankyou for your previous response @josh.

I have been studying the behaviour of various loss functions which I have hardcoded adhering to the restrictions you mentioned in your last post and am still somewhat confused on what seems to break differentiability in the second example as compared to the first example shown below.

In the first case, when I measure the output probability of my qubit in my QNN, I define the following loss function which is differentiable and is minimised:

def compute_cost(params, x, y): 

    eps = 1e-15

    cost = 0 

    for i in range(len(x)):

        probs = qnn(x[i], params)
        p0 = probs[0]
        p1 = probs[1]

        if y[i] == 0: 
            p = p0

        elif y[i] == 1: 
            p = p1

        p = np.clip(p, eps, 1 - eps)
        loss = (-1)*((y[i]*np.log(p)) + ((1-y[i])*np.log(1-p)))
        cost += loss 

    cost = cost/len(x)

    return cost

If I now alter this setup slightly and instead measure the Z expectation value of the qubit and define the following:

def compute_cost(params, x, y): 

    eps = 1e-15

    cost = 0 

    for i in range(len(x)):

        exp_val = qnn(x[i], params)

        if exp_val >= 0.0: 
            prediction = 1 
        elif exp_val < 0.0: 
            prediction = 0 

        p = np.clip(prediction, eps, 1 - eps)
    
        loss = (-1)*((y[i]*np.log(p)) + ((1-y[i])*np.log(1-p)))
        cost = cost + loss 

    cost = cost/len(x)

    return cost

yielding the following error UserWarning: Output seems independent of input.

I am particularly fixated on this particular function as it seems to give me strong gradients for training on a different example in pyQuil but can’t seem to spot what the fault may be. Any help is much appreciated.

josh · June 29, 2021, 4:03pm

Hi @Zohim_Chandani1, I haven’t tested it myself locally, but my gut feeling would be that the if statement,

if exp_val >= 0.0: 
    prediction = 1 
elif exp_val < 0.0: 
    prediction = 0

is what causes the non-differentiability issue. This is because prediction is not dependent on the output of the QNode exp_val.

Compare this to the first example, where p is either p0 or p1, and is directly connected to the output of the QNode.

In the second example, you could try replacing the if statement with the np.heaviside function,

prediction = np.heaviside(exp_val, 1.)

however I can’t confirm if heaviside itself is also differentiable!

Pratibha_Hegde · September 11, 2024, 2:04pm

Hello @josh and @Zohim_Chandani1,

I had a similar issue of cost function not being explicitly dependent on the parameters. I used for loop as shown by @Zohim_Chandani1 to fix the issue using default.qubit device. When I use lightning.qubit for the exact same code, I am getting the Userwarning of output not dependent on input. I know that this topic was discussed a long time ago. I truly appreciate if someone has any suggestion on how to fix the issue while using lightning.qubit. Thank you!

CatalinaAlbornoz · September 11, 2024, 10:50pm

Hi @Pratibha_Hegde,

Is this related to your question here?

Do you have a minimal reproducible code example that you could share so that we can try to replicate the issue? A lot has changed since the previous posts on this thread

Pratibha_Hegde · September 12, 2024, 11:48am

Hi @CatalinaAlbornoz , thank you for your reply! No it is not related to that question. It is more related to this question for which you have helped me resolve the issue. Thank you once again for that.

I have the simpler version of my code here. So my circuit is dependent on 2 sets of parameters w and x. My cost function depends on the gradient of the circuit w.r.t the parameter x. I want to optimize the parameters w by minimizing the cost function. The way I have defined the cost function seems differentiable with respect to w when I am using default.qubit device and training is done without any warning and the cost function changes in every step. However, when I try to use lightning.qubit I get the userwarning as shown below.

I also tried to use res = circuit(w, x) where I simply ask the circuit to minimize the expectation value. This case works well and I also observe speedup when I am using lightning.qubit. Somehow, the gradient g1(w, x) is making cost function not differentiable w.r.to w. However, as I said before, the code works fine with default.qubit. I want to know how I can make it work using lightning plugin since the bigger version of my code is very slow since it has many layers and qubits and I would benefit from using lightning. Thanks in advance for you help!

Please see my code below

import os
os.environ["OMP_NUM_THREADS"] = "4"
import pennylane as qml
from pennylane import numpy as np
import time


start = time.time()

np.random.seed(42)

npoints = 5
batch_size = 5
max_steps = 10 # number of iterations in training

dev = qml.device('lightning.qubit', wires=1)

@qml.qnode(dev)
def circuit(w, x):
    qml.RX(w[0], wires = 0)
    qml.RX(x, wires = 0)
    qml.RX(w[1], wires = 0)
    return qml.expval(qml.PauliZ(wires=0))

g1 = qml.grad(circuit, argnum=1) #gradient of the circuit w.r.t x

def loss(w, x):
    '''Loss function which minimizes the gradient of teh ciruit w.r.t x at a given point x'''
    res = g1(w, x) 
    
    return res**2


# defining cost function
def cost(w, xlist):
    '''cost function which computes the mean of loss function for all the points in xlist'''
    loss1 = 0
    for i in xlist:
        loss1 += loss(w, i)
    loss1 = loss1/len(xlist)
    return loss1

X = np.linspace(0, 1, npoints, requires_grad=True) #values for parameter x

weights = np.array([0.1, 0.2], requires_grad=True) #initial weights

lr = 0.01
opt = qml.AdamOptimizer(lr)

for step in range(max_steps):
    # select batch of data
    batch_index = np.random.randint(0, len(X), (batch_size,))
    x_batch = X[batch_index]

    # update the weights by one optimizer step
    weights = opt.step(lambda w: cost(w, x_batch), weights)
    c = cost(weights, X)

    if (step + 1) % 1 == 0:
        print("Cost at step {0:3}: {1}".format(step + 1, c))

end = time.time()

print('run time:'+ str(end-start))

Here is the output when using lightning plugin

C:\Users\hedge\PycharmProjects\venv\Lib\site-packages\autograd\tracer.py:14: UserWarning: Output seems independent of input.
  warnings.warn("Output seems independent of input.")
Cost at step   1: 0.511200264393484
Cost at step   2: 0.511200264393484
Cost at step   3: 0.511200264393484
Cost at step   4: 0.511200264393484
Cost at step   5: 0.511200264393484
Cost at step   6: 0.511200264393484
Cost at step   7: 0.511200264393484
Cost at step   8: 0.511200264393484
Cost at step   9: 0.511200264393484
Cost at step  10: 0.511200264393484
run time:0.10907101631164551

Here is the output of qml.about()

Name: PennyLane
Version: 0.37.0
Summary: PennyLane is a cross-platform Python library for quantum computing, quantum machine learning, and quantum chemistry. Train a quantum computer the same way as a neural network.
Home-page: https://github.com/PennyLaneAI/pennylane
Author: 
Author-email: 
License: Apache License 2.0
Location: C:\Users\hedge\PycharmProjects\venv\Lib\site-packages
Requires: appdirs, autograd, autoray, cachetools, networkx, numpy, packaging, pennylane-lightning, requests, rustworkx, scipy, semantic-version, toml, typing-extensions
Required-by: PennyLane-Rigetti, PennyLane_Lightning

Platform info:           Windows-10-10.0.22631-SP0
Python version:          3.11.9
Numpy version:           1.26.4
Scipy version:           1.14.1
Installed devices:
- default.clifford (PennyLane-0.37.0)
- default.gaussian (PennyLane-0.37.0)
- default.mixed (PennyLane-0.37.0)
- default.qubit (PennyLane-0.37.0)
- default.qubit.autograd (PennyLane-0.37.0)
- default.qubit.jax (PennyLane-0.37.0)
- default.qubit.legacy (PennyLane-0.37.0)
- default.qubit.tf (PennyLane-0.37.0)
- default.qubit.torch (PennyLane-0.37.0)
- default.qutrit (PennyLane-0.37.0)
- default.qutrit.mixed (PennyLane-0.37.0)
- default.tensor (PennyLane-0.37.0)
- null.qubit (PennyLane-0.37.0)
- lightning.qubit (PennyLane_Lightning-0.37.0)
- rigetti.numpy_wavefunction (PennyLane-Rigetti-0.36.0.post0)
- rigetti.qpu (PennyLane-Rigetti-0.36.0.post0)
- rigetti.qvm (PennyLane-Rigetti-0.36.0.post0)
- rigetti.wavefunction (PennyLane-Rigetti-0.36.0.post0)
None

CatalinaAlbornoz · September 13, 2024, 11:39pm

Thanks for sharing the code @Pratibha_Hegde . Are you still facing the issue if you upgrade to PennyLane v0.38.1?

Pratibha_Hegde · September 14, 2024, 9:45am

Hi @CatalinaAlbornoz, I am still facing the issue after upgrading to v0.38.1. Here is the latest qml.about() followed by the output of the same code. There is a change in the format of error where it shows warning in every step.

Name: PennyLane
Version: 0.38.1
Summary: PennyLane is a cross-platform Python library for quantum computing, quantum machine learning, and quantum chemistry. Train a quantum computer the same way as a neural network.
Home-page: https://github.com/PennyLaneAI/pennylane
Author:
Author-email:
License: Apache License 2.0
Location: /scratch/prhedge/myenv/lib/python3.11/site-packages
Requires: appdirs, autograd, autoray, cachetools, networkx, numpy, packaging, pennylane-lightning, requests, rustworkx, scipy, toml, typing-extensions
Required-by: PennyLane_Lightning, PennyLane_Lightning_GPU

Platform info:           Linux-4.18.0-348.7.1.el8_5.x86_64-x86_64-with-glibc2.28
Python version:          3.11.6
Numpy version:           1.26.4
Scipy version:           1.13.1
Installed devices:
- lightning.gpu (PennyLane_Lightning_GPU-0.36.0)
- lightning.qubit (PennyLane_Lightning-0.38.0)
- default.clifford (PennyLane-0.38.1)
- default.gaussian (PennyLane-0.38.1)
- default.mixed (PennyLane-0.38.1)
- default.qubit (PennyLane-0.38.1)
- default.qubit.autograd (PennyLane-0.38.1)
- default.qubit.jax (PennyLane-0.38.1)
- default.qubit.legacy (PennyLane-0.38.1)
- default.qubit.tf (PennyLane-0.38.1)
- default.qubit.torch (PennyLane-0.38.1)
- default.qutrit (PennyLane-0.38.1)
- default.qutrit.mixed (PennyLane-0.38.1)
- default.tensor (PennyLane-0.38.1)
- null.qubit (PennyLane-0.38.1)
None
/scratch/prhedge/myenv/lib/python3.11/site-packages/autograd/tracer.py:14: UserWarning: Output seems independent of input.
  warnings.warn("Output seems independent of input.")
Cost at step   1: 0.511200264393484
/scratch/prhedge/myenv/lib/python3.11/site-packages/autograd/tracer.py:14: UserWarning: Output seems independent of input.
  warnings.warn("Output seems independent of input.")
Cost at step   2: 0.511200264393484
/scratch/prhedge/myenv/lib/python3.11/site-packages/autograd/tracer.py:14: UserWarning: Output seems independent of input.
  warnings.warn("Output seems independent of input.")
Cost at step   3: 0.511200264393484
/scratch/prhedge/myenv/lib/python3.11/site-packages/autograd/tracer.py:14: UserWarning: Output seems independent of input.
  warnings.warn("Output seems independent of input.")
Cost at step   4: 0.511200264393484
/scratch/prhedge/myenv/lib/python3.11/site-packages/autograd/tracer.py:14: UserWarning: Output seems independent of input.
  warnings.warn("Output seems independent of input.")
Cost at step   5: 0.511200264393484
/scratch/prhedge/myenv/lib/python3.11/site-packages/autograd/tracer.py:14: UserWarning: Output seems independent of input.
  warnings.warn("Output seems independent of input.")
Cost at step   6: 0.511200264393484
/scratch/prhedge/myenv/lib/python3.11/site-packages/autograd/tracer.py:14: UserWarning: Output seems independent of input.
  warnings.warn("Output seems independent of input.")
Cost at step   7: 0.511200264393484
/scratch/prhedge/myenv/lib/python3.11/site-packages/autograd/tracer.py:14: UserWarning: Output seems independent of input.
  warnings.warn("Output seems independent of input.")
Cost at step   8: 0.511200264393484
/scratch/prhedge/myenv/lib/python3.11/site-packages/autograd/tracer.py:14: UserWarning: Output seems independent of input.
  warnings.warn("Output seems independent of input.")
Cost at step   9: 0.511200264393484
/scratch/prhedge/myenv/lib/python3.11/site-packages/autograd/tracer.py:14: UserWarning: Output seems independent of input.
  warnings.warn("Output seems independent of input.")
Cost at step  10: 0.511200264393484
0.7163808345794678

CatalinaAlbornoz · September 17, 2024, 11:32pm

Thanks for confirming @Pratibha_Hegde .

Let me check what’s going on.

Topic		Replies	Views
Variational Circuits for Solving Differential Equations PennyLane Help	6	788	April 13, 2022
How to perform optimisation in parallel? PennyLane Help	6	52	December 18, 2024
Module pennylane has no attribute Quadx PennyLane Help	5	774	February 13, 2024
Pennylane, qml.matrix(), bug? PennyLane Help	2	316	January 19, 2023
Help with Regression Model Training PennyLane Help	3	168	May 17, 2024

Optimisation routine yields the same loss values

Related topics