Manual Gradient Descent Step

medium (<30 mins) autograd optimization gradient descent

this year by E

Simulate one step of gradient descent for a simple quadratic loss.

Problem

Given a scalar parameter $w$ initialized at 5.0, minimize the loss $L (w) = (w - 3)^{2}$ using PyTorch.

Input: None (fixed setup).
Output: Updated parameter value after one gradient descent step.

Example

w = torch.tensor(5.0, requires_grad=True)
loss = (w - 3) ** 2
loss.backward()
with torch.no_grad():
    w -= 0.1 * w.grad
print(w.item())  # Should be closer to 3 than 5

Solution Sketch

Compute the gradient with .backward(), then update $w$ using w = w - lr * grad. Reset gradients before the next step.