rewrites for exp/log combinations #1540

OriolAbril · 2025-07-14T08:05:23Z

OriolAbril
Jul 14, 2025
Maintainer

Describe the issue:

We have a component in a model that basically boils down to $e^{a+b\log(x)}$. $a$ and $b$ are scalars and $x$ is a vector whose elements are strictly positive, so that can be simplified to $e^ax^b$ to change the logs and exponentials over all the array elements into a single exponential and one elemwise power.

Even considering the case where x can be anything and $b$ is an odd integer, which makes the two options not return the same for x<=0, I think it would still be helpful to have a switch so all <=0 values are set to nan automatically and the rest are computed with the simplified expression.

I thought it would already be simplified but it looks like no rewrite happens

Reproducable code example:

import pytensor.tensor as pt
from pytensor.graph import rewrite_graph

a = pt.dscalar("a")
b = pt.dscalar("b")
x = pt.dvector("x")

g = pt.exp(a + b * pt.log(x))
g.dprint();

rewrite_graph(g).dprint();

PyTensor version information:

pytensor 2.31.6 installed from conda-forge

Answered by ricardoV94

Jul 14, 2025

I'm not sure you would want such a rewrite, if anything you want to make more things on log scale, not less (see #177), for stability.

In your case if b is negative for example it helps keeping the exponential small, but making it power will facilitate underflow. In either case, in the final fused function you only iterate over the vector x once.

The performance will depend on your machine, but in mine the original one is roughly 2x faster than the proposed alternative (also allows smaller outputs before underflowing). Power with an arbitrary float is more expensive than repeated log and exp. It may change depending on the specific b.

import pytensor
import pytensor.tensor as pt
import numpy

View full answer

ricardoV94 · 2025-07-14T09:34:07Z

ricardoV94
Jul 14, 2025
Maintainer

I'm not sure you would want such a rewrite, if anything you want to make more things on log scale, not less (see #177), for stability.

In your case if b is negative for example it helps keeping the exponential small, but making it power will facilitate underflow. In either case, in the final fused function you only iterate over the vector x once.

The performance will depend on your machine, but in mine the original one is roughly 2x faster than the proposed alternative (also allows smaller outputs before underflowing). Power with an arbitrary float is more expensive than repeated log and exp. It may change depending on the specific b.

import pytensor
import pytensor.tensor as pt
import numpy as np

a = pt.dscalar("a")
b = pt.dscalar("b")
x = pt.dvector("x")

g = pt.exp(a + b * pt.log(x))
fn = pytensor.function([a, b, x], g, trust_input=True)
fn.dprint()

# Composite{exp((i2 + (i1 * log(i0))))} [id A] 2
#  ├─ x [id B]
#  ├─ ExpandDims{axis=0} [id C] 1
#  │  └─ b [id D]
#  └─ ExpandDims{axis=0} [id E] 0
#     └─ a [id F]

# Inner graphs:

# Composite{exp((i2 + (i1 * log(i0))))} [id A]
#  ← exp [id G] 'o0'
#     └─ add [id H]
#        ├─ i2 [id I]
#        └─ mul [id J]
#           ├─ i1 [id K]
#           └─ log [id L]
#              └─ i0 [id M]

a_test = np.array(100.0)
b_test = np.array(-100.0)
x_test = np.repeat([1000., 2000.], 1000)

res = fn(a_test, b_test, x_test)
print(res.min())  # 2.1205505218333326e-287

%timeit fn(a_test, b_test, x_test)  # 16.1 μs ± 419 ns per loop (mean ± std. dev. of 7 runs, 100,000 loops each)

g_alt = pt.exp(a) * (x ** b)
fn_alt = pytensor.function([a, b, x], g_alt, trust_input=True)
fn_alt.dprint()
# Composite{(i2 * (i0 ** i1))} [id A] 3
#  ├─ x [id B]
#  ├─ ExpandDims{axis=0} [id C] 2
#  │  └─ b [id D]
#  └─ Exp [id E] 1
#     └─ ExpandDims{axis=0} [id F] 0
#        └─ a [id G]

# Inner graphs:

# Composite{(i2 * (i0 ** i1))} [id A]
#  ← mul [id H] 'o0'
#     ├─ i2 [id I]
#     └─ pow [id J]
#        ├─ i0 [id K]
#        └─ i1 [id L]

res_alt = fn_alt(a_test, b_test, x_test)
print(res_alt.min())  # 0.0

%timeit fn_alt(a_test, b_test, x_test)  # 32.6 μs ± 489 ns per loop (mean ± std. dev. of 7 runs, 10,000 loops each)

0 replies

ricardoV94 · 2025-07-14T09:41:18Z

ricardoV94
Jul 14, 2025
Maintainer

Actually power(a, b) may be implemented as exp(b * log(a)) under the hood: https://stackoverflow.com/questions/4518011/algorithm-for-powfloat-float. If that's true you are actually just rewriting exp(a + b * log(x)) -> exp(a) * exp(b * log(x)), which could explain why it's slower.

0 replies

ricardoV94 · 2025-07-14T09:49:07Z

ricardoV94
Jul 14, 2025
Maintainer

Small note when calling graph_rewrite you only run canonicalize by default. For this kind of expressions you may want to do graph_rewrite(include=("canonicalize", "stabilize", "specialize") since the relevant optimizations may only happen on those passes. And if you care about fusion you can also include "fusion".

0 replies

OriolAbril · 2025-07-14T10:54:08Z

OriolAbril
Jul 14, 2025
Maintainer Author

Thanks for checking. Should we have the inverse rewrites then? from power to exp of product times log?

I guess the opposite direction is trickier though because we still want things like x ** (2.0) to work when x is negative so the "float that can be interpreted as integer" can't use the log version. But it would probably be nice to have a way to end up with the same graph given either of the two.

Actually power(a, b) may be implemented as exp(b * log(a)) under the hood: https://stackoverflow.com/questions/4518011/algorithm-for-powfloat-float. If that's true you are actually just rewriting exp(a + b * log(x)) -> exp(a) * exp(b * log(x)), which could explain why it's slower.

For this you mean by the computational backends themselves? So below what pytensor controls?

In case it helps I tried:

rewrite_graph(g, include=("canonicalize", "stabilize", "specialize", "fusion")).dprint();
# Composite{exp((i2 + (i1 * log(i0))))} [id A]
#  ├─ x [id B]
#  ├─ ExpandDims{axis=0} [id C]
#  │  └─ b [id D]
#  └─ ExpandDims{axis=0} [id E]
#     └─ a [id F]
# 
# Inner graphs:
# 
# Composite{exp((i2 + (i1 * log(i0))))} [id A]
#  ← exp [id G] 'o0'
#     └─ add [id H]
#        ├─ i2 [id I]
#        └─ mul [id J]
#           ├─ i1 [id K]
#           └─ log [id L]
#              └─ i0 [id M]

rewrite_graph(g_alt, include=("canonicalize", "stabilize", "specialize", "fusion")).dprint();
# Composite{(i2 * (i0 ** i1))} [id A]
#  ├─ x [id B]
#  ├─ ExpandDims{axis=0} [id C]
#  │  └─ b [id D]
#  └─ Exp [id E]
#     └─ ExpandDims{axis=0} [id F]
#        └─ a [id G]
# 
# Inner graphs:
# 
# Composite{(i2 * (i0 ** i1))} [id A]
#  ← mul [id H] 'o0'
#     ├─ i2 [id I]
#     └─ pow [id J]
#        ├─ i0 [id K]
#        └─ i1 [id L]

0 replies

ricardoV94 · 2025-07-14T11:12:45Z

ricardoV94
Jul 14, 2025
Maintainer

I guess the opposite direction is trickier though because we still want things like x ** (2.0) to work when x is negative so the "float that can be interpreted as integer" can't use the log version. But it would probably be nice to have a way to end up with the same graph given either of the two.

Depends on the goal of the rewrite. If you're just compiling for performance/stability you probably don't want that. If you are rewriting to find whether an expression is equivalent to something (or just simplify it in some well defined definition) then it may be fine to have such rewrites, but not include them in the compilation database.

This sort of stuff may be helped by having hints at the graph level (like the user specifying that x is non-negative, or we inferring it because it comes from the output of an abs), which is something we are planning on tackling at some point, and also different rewrite approaches like egglog which allow you to simply explore the space of alternative representations without having to commit to any of them ahead of time.

What was your motivation for this issue?

0 replies

ricardoV94 · 2025-07-14T11:13:52Z

ricardoV94
Jul 14, 2025
Maintainer

For this you mean by the computational backends themselves? So below what pytensor controls?

Yes. pow is probably a standard library function (in C/numba, and xla in JAX) that implements it at its own discretion. The final form may also be CPU specific

0 replies

ricardoV94 · 2025-07-14T11:15:31Z

ricardoV94
Jul 14, 2025
Maintainer

~~x ** (2.0) is tricky. I don't know what we actually do but numpy won't treat that as an integer IIRC.~~ Nevermind, it does

0 replies

OriolAbril · 2025-07-14T13:31:54Z

OriolAbril
Jul 14, 2025
Maintainer Author

I added a comment while reviewing a model to check if that was being rewritten, after seeing it wasn't @tomicapretto and @jessegrabowski were also a bit surprised so I opened the issue.

If you're just compiling for performance/stability you probably don't want that.

In the general case right? In our case with x>0, it looks like if we had actually worked out the model analytically on paper first, simplified to power, then implemented it with the power equivalent we would have ended up with slower and less stable code.

0 replies

ricardoV94 · 2025-07-14T14:31:57Z

ricardoV94
Jul 14, 2025
Maintainer

Yes, what looks like a simplification mathematically and performance/stability in float point operations can be very different. In either case I wouldn't expect this to ever be a model bottleneck. If you were worried about performance the first thing would be to profile the logp_dlogp function of the model and see what is taking time

If you care about stability you probably would want the original expression as well, but that's a different kind of benchmark altogether

0 replies

ricardoV94 · 2025-07-14T15:23:41Z

ricardoV94
Jul 14, 2025
Maintainer

I'll move this to a discussion, let me know if you think it should still be an issue

1 reply

OriolAbril Jul 14, 2025
Maintainer Author

I have marked it as answered. On my end I think we could even close, we weren't worried about performance, just checking things extra carefully before putting it in production. If there are already plans to include the domain aware rewrites and things like that the issues related to that might be a better place to add a note with this a potentially useful domain aware rewrite.

rewrites for exp/log combinations #1540

Uh oh!

Uh oh!

OriolAbril Jul 14, 2025 Maintainer

Describe the issue:

Reproducable code example:

PyTensor version information:

Replies: 10 comments · 1 reply

Uh oh!

Uh oh!

ricardoV94 Jul 14, 2025 Maintainer

Uh oh!

Uh oh!

ricardoV94 Jul 14, 2025 Maintainer

Uh oh!

Uh oh!

ricardoV94 Jul 14, 2025 Maintainer

Uh oh!

OriolAbril Jul 14, 2025 Maintainer Author

Uh oh!

ricardoV94 Jul 14, 2025 Maintainer

Uh oh!

Uh oh!

ricardoV94 Jul 14, 2025 Maintainer

Uh oh!

Uh oh!

ricardoV94 Jul 14, 2025 Maintainer

Uh oh!

OriolAbril Jul 14, 2025 Maintainer Author

Uh oh!

Uh oh!

ricardoV94 Jul 14, 2025 Maintainer

Uh oh!

ricardoV94 Jul 14, 2025 Maintainer

Uh oh!

OriolAbril Jul 14, 2025 Maintainer Author

OriolAbril
Jul 14, 2025
Maintainer

Replies: 10 comments 1 reply

ricardoV94
Jul 14, 2025
Maintainer

ricardoV94
Jul 14, 2025
Maintainer

ricardoV94
Jul 14, 2025
Maintainer

OriolAbril
Jul 14, 2025
Maintainer Author

ricardoV94
Jul 14, 2025
Maintainer

ricardoV94
Jul 14, 2025
Maintainer

ricardoV94
Jul 14, 2025
Maintainer

OriolAbril
Jul 14, 2025
Maintainer Author

ricardoV94
Jul 14, 2025
Maintainer

ricardoV94
Jul 14, 2025
Maintainer

OriolAbril Jul 14, 2025
Maintainer Author