Add `b` and `return_sign` functionality to scipy.special.logsumexp #3488

dpfau · 2020-06-18T17:11:45Z

This resolves Issue #3487

jakevdp

Thanks for tackling this! Looks great. Just a few comments below:

jax/scipy/special.py

tests/lax_scipy_test.py

dpfau · 2020-06-19T13:24:48Z

I'm working on the broadcasting today. In the meantime, the tests seem to be failing due to a RuntimeWarning, caused by logsumexp taking the log of a negative value when return_sign is false. Since this is just a warning, I don't know why it's causing a test failure.

hawkinsp · 2020-06-19T13:35:58Z

We treat warnings as errors in tests (because we want a warning-clean build).

You'll need to avoid or suppress the warning. The jtu.ignore_warning decorator may help.

dpfau · 2020-06-21T17:33:54Z

Seems like a whole bunch of unrelated tests are failing now. What's up?

dpfau · 2020-06-21T17:45:14Z

Basically every test that uses tostring in TF is telling me to use tobytes instead, even though it worked fine 2 days ago.

tests/lax_scipy_test.py

jakevdp · 2020-06-22T18:15:53Z

The test failures should be fixed if you rebase your branch on master.

…sumexp`

jakevdp · 2020-06-22T21:13:18Z

Looks great! There's a stray pdb.set_trace() presumably left over from debugging; if you remove that we can get this merged 😁

dpfau · 2020-06-22T21:33:55Z

Whoops. Should be fixed now.

dpfau · 2020-06-23T09:26:00Z

Can we merge this?

jakevdp · 2020-06-23T14:36:03Z

Thanks! We're not quite ready to merge, unfortunately. I ran this through our internal tests, which exercise the GPU and TPU backends, and we're seeing a number of test failures on GPU/TPU (but not CPU) in cases where return_sign=True and use_b=False.

I'm not entirely certain what may be causing this, but given the random number generators specified in the tests, it's probably related to handling of nan and inf values in the inputs.

dpfau · 2020-06-23T14:41:08Z

Ah I see. To make the behavior consistent with NumPy, I had it return NaN in the case where return_sign was false but the sign of the result was negative. I could undo that and just have it return the true value without the sign no matter what, but then the tests would have to be changed when comparing against NumPy.

jakevdp · 2020-06-23T14:54:21Z

I've pasted an example failure below; it's in the comparison with numpy's output. The commonality of all the failures is:

GPU or TPU backend
return_sign=True and use_b=False

I'll have some time later today to help find the root of the issue. If you want to look into it before then, I'd focus on inputs containing some inf and nan values, given the random number generator the test is using in this case (as written, none of the CPU tests are covering inputs with nans and infs).

[device=GPU] LaxBackedScipyTests.testLogSumExp_shapes=float32[(2, 1, 4),(2, 1, 4)]_axis=0_keepdims=False_return_sign=True_use_b_False

Traceback (most recent call last):
  File "<embedded stdlib>/unittest/case.py", line 59, in testPartExecutor
    yield
  File "<embedded stdlib>/unittest/case.py", line 605, in run
    testMethod()
  File "/build/work/474fee93c453555e4cdf942d5ea3c8d371f2/google3/runfiles/google3/third_party/py/absl/testing/parameterized.py", line 282, in bound_param_test
    return test_method(self, **testcase_params)
  File "/build/work/474fee93c453555e4cdf942d5ea3c8d371f2/google3/runfiles/google3/third_party/py/jax/test_util.py", line 365, in test_method_wrapper
    return test_method(self, *args, **kwargs)
  File "<embedded stdlib>/contextlib.py", line 52, in inner
    return func(*args, **kwds)
  File "/build/work/474fee93c453555e4cdf942d5ea3c8d371f2/google3/runfiles/google3/third_party/py/jax/tests/lax_scipy_test.py", line 142, in testLogSumExp
    self._CheckAgainstNumpy(scipy_fun, lax_fun, args_maker)
  File "/build/work/474fee93c453555e4cdf942d5ea3c8d371f2/google3/runfiles/google3/third_party/py/jax/test_util.py", line 831, in _CheckAgainstNumpy
    canonicalize_dtypes=canonicalize_dtypes)
  File "/build/work/474fee93c453555e4cdf942d5ea3c8d371f2/google3/runfiles/google3/third_party/py/jax/test_util.py", line 756, in assertAllClose
    rtol=rtol, canonicalize_dtypes=canonicalize_dtypes)
  File "/build/work/474fee93c453555e4cdf942d5ea3c8d371f2/google3/runfiles/google3/third_party/py/jax/test_util.py", line 763, in assertAllClose
    self.assertArraysAllClose(x, y, check_dtypes=False, atol=atol, rtol=rtol)
  File "/build/work/474fee93c453555e4cdf942d5ea3c8d371f2/google3/runfiles/google3/third_party/py/jax/test_util.py", line 730, in assertArraysAllClose
    _assert_numpy_allclose(x, y, atol=atol, rtol=rtol)
  File "/build/work/474fee93c453555e4cdf942d5ea3c8d371f2/google3/runfiles/google3/third_party/py/jax/test_util.py", line 117, in _assert_numpy_allclose
    np.testing.assert_allclose(a, b, **kw)
  File "/build/work/474fee93c453555e4cdf942d5ea3c8d371f2/google3/runfiles/google3/third_party/py/numpy/testing/_private/utils.py", line 1501, in assert_allclose
    verbose=verbose, header=header, equal_nan=equal_nan)
  File "/build/work/474fee93c453555e4cdf942d5ea3c8d371f2/google3/runfiles/google3/third_party/py/numpy/testing/_private/utils.py", line 757, in assert_array_compare
    flagged = func_assert_same_pos(x, y, func=isnan, hasval='nan')
  File "/build/work/474fee93c453555e4cdf942d5ea3c8d371f2/google3/runfiles/google3/third_party/py/numpy/testing/_private/utils.py", line 733, in func_assert_same_pos
    raise AssertionError(msg)
AssertionError: 
Not equal to tolerance rtol=1e-06, atol=1e-06

x and y nan location mismatch:
 x: array([[ 1., nan, nan,  1.]], dtype=float32)
 y: array([[1., 1., 1., 1.]], dtype=float32)

dpfau · 2020-06-23T14:56:01Z

So if the input contains -inf values, it actually should still give finite values. I'll take a look.

jakevdp · 2020-06-23T15:50:02Z

Here's a short example of where this implementation differs from scipy:

import numpy as np
import scipy.special as osp_special
from jax.scipy.special import logsumexp  # in this branch
np.random.seed(0)
x = np.random.rand(3, 4)
x[x < 0.4] = np.nan

arr1, sign1 = logsumexp(jnp.array(x), axis=0, return_sign=True)
arr2, sign2 = osp_special.logsumexp(x, axis=0, return_sign=True)

print(x)
print(sign1)
print(sign2)

[[0.5488135  0.71518937 0.60276338 0.54488318]
 [0.4236548  0.64589411 0.43758721 0.891773  ]
 [0.96366276        nan 0.79172504 0.52889492]]
[1. 1. 1. 1.]
[ 1. nan  1.  1.]

jax/scipy/special.py

dpfau · 2020-06-23T16:47:10Z

Let's try this again...

…

On Tue, Jun 23, 2020 at 5:38 PM Jake Vanderplas ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In jax/scipy/special.py <#3488 (comment)>: > dims = _reduction_dims(a, axis) dimadd = lambda x: lax.expand_dims(x, dims) amax = lax.reduce(a, _constant_like(a, -np.inf), lax.max, dims) amax = lax.stop_gradient(lax.select(lax.is_finite(amax), amax, lax.full_like(amax, 0))) amax_singletons = dimadd(amax) - out = lax.add(lax.log(lax.reduce(lax.exp(lax.sub(a, amax_singletons)), - _constant_like(a, 0), lax.add, dims)), amax) + if b is None: + out = lax.add(lax.log(lax.reduce(lax.exp(lax.sub(a, amax_singletons)), + _constant_like(a, 0), lax.add, dims)), amax) + sign = lax.stop_gradient(lax.sign(out)) Shoot, given the test failures I think I misled you here. We need the sign of exp(out), not the sign of out. I think this should probably be something like this: sign = jnp.where(jnp.isnan(out), np.nan, 1.0) — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#3488 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AABDACGQ6LYCYLSKBPDQQTTRYDK6PANCNFSM4OB7H6LA> .

jakevdp · 2020-06-23T19:26:19Z

Getting closer... there's still an issue when the data contains large negative numbers: your implementation returns sign=1 where numpy returns sign=0. For example:

import numpy as np
import scipy.special as osp_special
from jax.scipy.special import logsumexp
np.random.seed(0)
x = np.random.rand(3, 4)
x[:, 1] = -np.inf

arr1, sign1 = logsumexp(jnp.array(x), axis=0, return_sign=True)
arr2, sign2 = osp_special.logsumexp(x, axis=0, return_sign=True)

print(sign1)
print(sign2)

[1. 1. 1. 1.]
[1. 0. 1. 1.]

(Side-note: like the previous failures, these come up in the GPU/TPU tests and not the CPU tests, because the current test uses a different random number generator on CPU than on GPU/TPU. I'm not sure why that's the case, but you might look into changing the test to use rand_some_inf_and_nan for CPU tests as well in order to catch these errors more quickly).

dpfau · 2020-06-23T20:47:54Z

Honestly, the tests pass fine when I swap in rand_some_inf_and_nan. I'll keep trying.

jakevdp · 2020-06-23T21:07:32Z

Honestly, the tests pass fine when I swap in rand_some_inf_and_nan. I'll keep trying.

This is probably because a failing case is not generated with the default num_generated_cases=10. The default is 25 on github and 100 on borg, I believe, so many more inputs are tested. You can pass, e.g. --num_generated_cases=50 locally to test more cases (see https://jax.readthedocs.io/en/latest/developer.html#running-the-tests for details).

jax/scipy/special.py

jakevdp · 2020-06-23T22:36:20Z

It looks like this does it! Thanks for the contribution!

googlebot added the cla: yes label Jun 18, 2020

dpfau changed the title ~~Add b and return_sign functionality to scipy.special.logsumexp~~ Add b and return_sign functionality to scipy.special.logsumexp Jun 18, 2020

jakevdp self-assigned this Jun 18, 2020

jakevdp requested changes Jun 18, 2020

View reviewed changes

jax/scipy/special.py Show resolved Hide resolved

tests/lax_scipy_test.py Outdated Show resolved Hide resolved

tests/lax_scipy_test.py Outdated Show resolved Hide resolved

dpfau requested a review from jakevdp June 21, 2020 18:05

jakevdp requested changes Jun 22, 2020

View reviewed changes

tests/lax_scipy_test.py Outdated Show resolved Hide resolved

dpfau added 10 commits June 22, 2020 19:58

Add return_sign and b keyword functionality to `scipy.special.log…

ea5037a

…sumexp`

Add test of b and return_sign to testLogSumExp

d94c708

Move rng_factory to test body in testLogSumExp

8fd8e02

Add ignore_warning flag to testLogSumExp

a98a7a3

Allow broadcasting in logsumexp

db65690

Added broacasting to testLogSumExp

91181fc

Added b and return_sign kwargs to logsumexp

d7dc9d9

Removed trailing whitespace

673bd80

Move loop over tests out of test body

d3a1f19

Corrected formatting of test names

677bbc6

dpfau requested a review from jakevdp June 22, 2020 21:04

Removed pdb.set_trace()

a8c2314

Set sign to NaN if output is NaN in logsumexp

059a33b

jakevdp reviewed Jun 23, 2020

View reviewed changes

jax/scipy/special.py Outdated Show resolved Hide resolved

Set sign to NaN if output is NaN in logsumexp

fe21e5e

jakevdp reviewed Jun 23, 2020

View reviewed changes

jax/scipy/special.py Outdated Show resolved Hide resolved

Set sign to NaN if output is NaN...for real

e18c829

dpfau added 2 commits June 23, 2020 18:18

Correct type of sign when b is None in logsumexp

847638b

Correct type of sign when b is None in logsumexp

4f56462

Fixed issue with sign if output is -inf

564d6e5

jakevdp reviewed Jun 23, 2020

View reviewed changes

jax/scipy/special.py Outdated Show resolved Hide resolved

Fixed issue...again

627cf04

jakevdp merged commit 9d173c6 into jax-ml:master Jun 23, 2020

hawkinsp mentioned this pull request Jun 24, 2020

Implement all options for logsumexp #3487

Closed

Add b and return_sign functionality to scipy.special.logsumexp #3488

Add b and return_sign functionality to scipy.special.logsumexp #3488

Uh oh!

Conversation

dpfau commented Jun 18, 2020

Uh oh!

jakevdp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dpfau commented Jun 19, 2020

Uh oh!

hawkinsp commented Jun 19, 2020

Uh oh!

dpfau commented Jun 21, 2020

Uh oh!

dpfau commented Jun 21, 2020

Uh oh!

Uh oh!

jakevdp commented Jun 22, 2020

Uh oh!

jakevdp commented Jun 22, 2020

Uh oh!

dpfau commented Jun 22, 2020

Uh oh!

dpfau commented Jun 23, 2020

Uh oh!

jakevdp commented Jun 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dpfau commented Jun 23, 2020

Uh oh!

jakevdp commented Jun 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dpfau commented Jun 23, 2020

Uh oh!

jakevdp commented Jun 23, 2020

Uh oh!

Uh oh!

Uh oh!

dpfau commented Jun 23, 2020 via email

Uh oh!

jakevdp commented Jun 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dpfau commented Jun 23, 2020

Uh oh!

jakevdp commented Jun 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

jakevdp commented Jun 23, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add `b` and `return_sign` functionality to scipy.special.logsumexp #3488

Add `b` and `return_sign` functionality to scipy.special.logsumexp #3488

jakevdp commented Jun 23, 2020 •

edited

Loading

jakevdp commented Jun 23, 2020 •

edited

Loading

jakevdp commented Jun 23, 2020 •

edited

Loading

jakevdp commented Jun 23, 2020 •

edited

Loading