Add optix to experimental #1620

mtthss · 2019-11-01T16:02:45Z

Introducing optix: a composable gradient processing and optimization library.

It's objective is to support the composition of arbitrary sets of gradient transformations,
including sequential transformations (e.g. clip then rescale by rms), and parallel transformation (where multiple distinct optimizers share a subset of the variable to optimize).

Many popular optimizers can be implemented as one-liners, and, for convenience,
we additionally provide aliases for the most common ones.

optix is a composable gradient processing and optimization library

googlebot · 2019-11-01T16:02:50Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

mtthss · 2019-11-01T19:05:28Z

@googlebot I signed it
(I am a google employee so I registered as such)

googlebot · 2019-11-01T19:05:33Z

CLAs look good, thanks!

ℹ️ Googlers: Go here for more info.

jekbradbury · 2019-11-01T21:32:49Z

jax/experimental/optix.py

+### Utilities for building and using custom optimizers. ###
+
+
+def chainer(*args):


I feel like this is maybe better as chain?

jekbradbury · 2019-11-01T21:38:49Z

jax/experimental/optix.py

+
+  def update_fn(updates, state):
+    f = lambda g, t: g + decay * t
+    update_trace = tree_multimap(f, updates, state.trace)


Almost every optimizer update_fn (all of them except clip_by_global_norm) performs tree_multimap over an inner per-parameter update. Is that a pattern that you can extract out like @optimizer in optimizers.py or is there a reason why that doesn't work?

I purposefully diverged from optimizers.py in this aspects for two reasons:

there are several gradient transformations of interest that require to consider the entire gradient and cannot be computer variable by variable. The clip_by_global_norm transformation included here is one example, but others could be PopArt, KFac, etc...

it really is just a handful of characters that are saved, and at the cost of introducing an additional level of indirection and a reduction in flexibility.

jekbradbury · 2019-11-01T21:39:18Z

jax/experimental/optix.py

+  return tree_multimap(lambda p, u: p + u, params, updates)
+
+
+### Aliases for popular optimizers. ###


Are there one or two (maybe less popular) optimizers that you can add as examples of the benefits of composability? For instance, perhaps nadam from https://openreview.net/pdf?id=OM0jvwB8jIp57ZJjtNEZ or LARS can be expressed using chainer and the existing primitives?

As an example I added a noisy_sgd as from the paper: https://arxiv.org/abs/1511.06807.

Crucially though, the composable nature of optix would allow the user to build in a one liner also a noisy_adam or noisy_rmsprop, or more generally combine the idea from the paper with any optimizer of his/her choice.

As a comparison to TF or jax/experimental/optimizers.py, there the user could only add noise to the gradient before applying the adam/rmsprop rescaling, because the adam/rmsprop would immediately apply the updates and the user could not insert itself before the update without rewriting the entire optimizer.

Instead in optix the user may experiment with adding the noise before or after the adam/rmsprop rescaling (and I suspect the latter, very complicated without optix, will actually be better).

mtthss added 3 commits November 1, 2019 15:38

Add optix to experimental

a3a59a4

optix is a composable gradient processing and optimization library

Add build target for optix

7644b31

Add tests checking equivalence to optimizers.py

0355405

googlebot added the cla: no label Nov 1, 2019

googlebot added cla: yes and removed cla: no labels Nov 1, 2019

jekbradbury reviewed Nov 1, 2019

View reviewed changes

jekbradbury requested a review from mattjj November 1, 2019 21:39

mtthss added 5 commits November 4, 2019 15:43

Add gradient noise function

72eb6b3

create noisy_sgd variant

b77d2a6

rename to

6b33d54

missing imports for grad_noise

8d1b583

fix indexing of next random key

cf81c83

jekbradbury merged commit e4d4e4e into jax-ml:master Nov 7, 2019

mattjj mentioned this pull request Mar 10, 2020

Optix documentation #2297

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add optix to experimental #1620

Add optix to experimental #1620

Uh oh!

mtthss commented Nov 1, 2019

Uh oh!

googlebot commented Nov 1, 2019

Uh oh!

mtthss commented Nov 1, 2019

Uh oh!

googlebot commented Nov 1, 2019

Uh oh!

jekbradbury Nov 1, 2019

Uh oh!

mtthss Nov 4, 2019

Uh oh!

jekbradbury Nov 1, 2019

Uh oh!

mtthss Nov 2, 2019

Uh oh!

jekbradbury Nov 1, 2019

Uh oh!

mtthss Nov 4, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		### Utilities for building and using custom optimizers. ###


		def chainer(*args):

		return tree_multimap(lambda p, u: p + u, params, updates)


		### Aliases for popular optimizers. ###

Add optix to experimental #1620

Add optix to experimental #1620

Uh oh!

Conversation

mtthss commented Nov 1, 2019

Uh oh!

googlebot commented Nov 1, 2019

What to do if you already signed the CLA

Individual signers

Corporate signers

Uh oh!

mtthss commented Nov 1, 2019

Uh oh!

googlebot commented Nov 1, 2019

Uh oh!

jekbradbury Nov 1, 2019

Choose a reason for hiding this comment

Uh oh!

mtthss Nov 4, 2019

Choose a reason for hiding this comment

Uh oh!

jekbradbury Nov 1, 2019

Choose a reason for hiding this comment

Uh oh!

mtthss Nov 2, 2019

Choose a reason for hiding this comment

Uh oh!

jekbradbury Nov 1, 2019

Choose a reason for hiding this comment

Uh oh!

mtthss Nov 4, 2019

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants