Gradient checking - ML & AI Coding

177. Gradient checking

medium

General

staff

Gradient checking is a simple way to verify your backprop gradients by comparing them to numerical gradients from finite differences. In this task, you’ll implement gradient checking for a tiny neural network layer so you can catch sign/shape mistakes early.

Requirements

Implement the function

python

Rules:

Compute analytical gradients ( \frac{\partial L}{\partial W} ) for the model above.
Compute numerical gradients using centered differences:
$g_{\text{num}}(i) = \frac{L(W_i + \epsilon) - L(W_i - \epsilon)}{2\epsilon}$
For each weight entry, return the relative error:
$\text{rel\_err} = \frac{|g_{\text{anal}} - g_{\text{num}}|}{\max(1e-8,\ |g_{\text{anal}}| + |g_{\text{num}}|)}$
Use a numerically stable sigmoid and avoid taking log(0) by clipping probabilities (e.g., to [1e-12, 1-1e-12]).
Do not use autograd frameworks (no PyTorch/TensorFlow/JAX); only NumPy + built-ins.

Example

python

Output:

python

Input Signature

Argument	Type
W	np.ndarray
X	np.ndarray
b	np.ndarray
y	np.ndarray
eps	float

Output Signature

Return Name	Type
value	np.ndarray

Constraints

Use NumPy only; no autograd frameworks.
Return relative error as (d,1) np.ndarray.
Use stable sigmoid; clip probabilities for log.

Hint 1

Convert inputs to NumPy arrays with consistent shapes: X:(n,d), y:(n,1), W:(d,1), b:(1,) so z = X @ W + b works without broadcasting surprises.

Hint 2

For sigmoid + binary cross-entropy, the gradient simplifies: dz = (a - y)/n where a = sigmoid(z). Then dW = X.T @ dz gives the analytical gradient.

Hint 3

Roles

ML Engineer

AI Engineer

Companies

General

Levels

staff

senior

entry

Input Arguments

Edit values below to test with custom inputs

You need tolog in/sign upto run or submit