Skip to content
/ npgrad Public

Auto differentiation for tensors with just Numpy.

Notifications You must be signed in to change notification settings

AllenHW/npgrad

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

npgrad

A small auto differentiation library built with just Numpy. Inspired by micrograd.

Installation

In the project root directory, run

pip install -e .

Numpy Like API

npgrad supports subset of Numpy APIs that are commonly used in deep learning.

Most notably, npgrad supports basic operations like +, -, *, /, **, @, max, min, sum, mean, reshape, transpose, swapaxes, ravel, squeeze, expand_dims, concat, index, masked_fill, repeat, tile, einsum, matmul, max_pool2d, avg_pool2d, conv2d.

See APIs for a ful list of supported operations.

import npgrad as npg
import numpy as np
import torch

x = npg.randn(7, 3, 4)
y = npg.randn(1, 4, 5)
z = x @ y
print(z)

# Compute gradients with torch
torch_x = torch.tensor(x.data, requires_grad=True)
torch_y = torch.tensor(y.data, requires_grad=True)
torch_z = torch_x @ torch_y

# Compare gradients
assert np.allclose(z.data, torch_z.detach().numpy())

Auto Differentiation

npgrad supports auto differentiation for all operations exposed in the API. The differentations are implemented natively in Numpy. It handles broadcasting the same way as Numpy and PyTorch.

You can check out the implementations in the npgrad/ops folder.

Einsum

x = npg.randn(3, 4, 7, 8, 3)
y = npg.randn(3, 4, 5, 6)

z = npg.sum(
  npg.einsum('ijkli,ijmn->klmn', x, y)
)
z.backward()

# Compute gradients with torch
torch_x = torch.tensor(x.data, requires_grad=True)
torch_y = torch.tensor(y.data, requires_grad=True)
torch_z = torch.sum(
  torch.einsum('ijkli,ijmn->klmn', torch_x, torch_y)
)
torch_z.backward()

# Compare gradients
assert np.allclose(x.grad, torch_x.grad.numpy())

Conv2D

import torch.nn.functional as F

x = npg.randn(2, 5, 28, 28)  # (batch_size, in_channels, height, width)
weight = npg.randn(3, 5, 3, 3)  # ( out_channels, in_channels, kernel_height, kernel_width)
bias = npg.randn(3)  # (out_channels,)

# Forward pass with npg.conv2d
y = npg.conv2d(x, weight, bias, stride=1, padding=1)
z = npg.sum(y)
z.backward()

# Compute gradients with PyTorch
torch_x = torch.tensor(x.data, requires_grad=True)
torch_weight = torch.tensor(weight.data, requires_grad=True)
torch_bias = torch.tensor(bias.data, requires_grad=True)

torch_y = F.conv2d(torch_x, torch_weight, torch_bias, stride=1, padding=1)
torch_z = torch.sum(torch_y)
torch_z.backward()

# Compare gradients
assert np.allclose(x.grad, torch_x.grad.numpy())
assert np.allclose(weight.grad, torch_weight.grad.numpy())
assert np.allclose(bias.grad, torch_bias.grad.numpy())

Advanced Indexing

x = npg.randn(10, 8, 12)
indices1 = np.random.randint(0, 10, (3, 2))
indices2 = np.random.randint(0, 8, (3, 2))

y = x[indices1, indices2, 2:8:2]
y = npg.sum(y)
y.backward()

# Compute gradients with torch
torch_x = torch.tensor(x.data, requires_grad=True)
torch_y = torch_x[indices1, indices2, 2:8:2]
torch_y = torch.sum(torch_y)
torch_y.backward()

# Compare gradients
assert np.allclose(x.grad, torch_x.grad.numpy())

And all other operations are tested thoroughly against PyTorch github.com/AllenHW/npgrad/blob/main/tests/ops.py.

Todo

  • Support comparison operators like ==, !=, <, <=, >, >=
  • Support inter-operation with Python and Numpy objects for +, -, *, /, **, @
  • Add checks so ``backward()` can only be called for scalars

About

Auto differentiation for tensors with just Numpy.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published