What is meant by that Flax Linen uses "shape inference"? #2077

marcvanzee · 2022-04-27T19:28:34Z

marcvanzee
Apr 27, 2022
Maintainer

This is a question that comes up often, and I think it is useful to clarify this in a discussion.

Aug 29, 2022

We only declared the number of features we wanted in the output of the model, not the size of the input. Flax finds out by itself the correct size of the inputs. Example:

We create one dense layer instance (taking 'features' parameter as input)

model = nn.Dense(features=5)

Parameters are not stored with the models themselves. You need to initialize parameters by calling the init function, using a PRNGKey and a dummy input parameter.

key1, key2 = random.split(random.PRNGKey(0))
x = random.normal(key1, (10,)) # Dummy input
params = model.init(key2, x) # Initialization call
jax.tree_util.tree_map(lambda x: x.shape, params) # Checking output shapes

Ouputs:

FrozenDict({
    params: {
        b…

View full answer

marcvanzee · 2022-08-29T07:05:26Z

marcvanzee
Aug 29, 2022
Maintainer Author

We only declared the number of features we wanted in the output of the model, not the size of the input. Flax finds out by itself the correct size of the inputs. Example:

We create one dense layer instance (taking 'features' parameter as input)

model = nn.Dense(features=5)

Parameters are not stored with the models themselves. You need to initialize parameters by calling the init function, using a PRNGKey and a dummy input parameter.

key1, key2 = random.split(random.PRNGKey(0))
x = random.normal(key1, (10,)) # Dummy input
params = model.init(key2, x) # Initialization call
jax.tree_util.tree_map(lambda x: x.shape, params) # Checking output shapes

Ouputs:

FrozenDict({
    params: {
        bias: (5,),
        kernel: (10, 5),
    },
})

The result is what we expect: bias and kernel parameters of the correct size. Under the hood:

The dummy input variable x is used to trigger shape inference: we only declared the number of features we wanted in the output of the model, not the size of the input. Flax finds out by itself the correct size of the kernel.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is meant by that Flax Linen uses "shape inference"? #2077

{{title}}

Replies: 1 comment

{{title}}

Select a reply

What is meant by that Flax Linen uses "shape inference"? #2077

marcvanzee Apr 27, 2022 Maintainer

Replies: 1 comment

marcvanzee Aug 29, 2022 Maintainer Author

marcvanzee
Apr 27, 2022
Maintainer

marcvanzee
Aug 29, 2022
Maintainer Author