[Feature] Adding support for initialising TensorDicts from nested dicts #404

zeenolife · 2022-09-08T12:22:42Z

Description

As described in #390, this PR addresses initialisation of TensorDicts from nested Python dictionaries.

Namely, it includes three new functionalities:

Nested dict initialisation, with inferred batch size and device:

TensorDict({"a": {"b": torch.randn(3, 1)}}, [3])

returns

TensorDict(
    fields={
        a: TensorDict(
            fields={
                b: Tensor(torch.Size([3, 1]), dtype=torch.float32)},
            batch_size=torch.Size([3]),
            device=cpu,
            is_shared=False)},
    batch_size=torch.Size([3]),
    device=cpu,
    is_shared=False)

Setting dict value, with inferred batch size and device:

td = TensorDict({"a": torch.randn(3, 1)}, [3])
td["b"] = {"c": torch.randn(3, 4)}

returns

TensorDict(
    fields={
        a: Tensor(torch.Size([3, 1]), dtype=torch.float32),
        b: TensorDict(
            fields={
                c: Tensor(torch.Size([3, 4]), dtype=torch.float32)},
            batch_size=torch.Size([3]),
            device=cpu,
            is_shared=False)},
    batch_size=torch.Size([3]),
    device=cpu,
    is_shared=False)

Recursive conversion of TensorDict into dict:

td = TensorDict({"a": torch.randn(3, 1)}, [3])
td["b"] = {"c": torch.randn(3, 4)}
td.to_dict()

returns

{'a': tensor([[-2.0345],
        [ 0.8855],
        [-0.6279]]), 'b': {'c': tensor([[ 0.2649, -1.3553, -0.0903,  1.7265],
        [-0.0252,  1.1936, -0.2416,  0.1220],
        [ 0.2263,  0.6542,  0.4279,  0.2826]])}}

Motivation and Context

The motivation and context for the change is described in #390

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

New feature (non-breaking change which adds core functionality)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide (required)
? My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
? I have updated the documentation accordingly.

torchrl/data/tensordict/tensordict.py

zeenolife · 2022-09-08T13:13:44Z

It seems like the tests are currently failing due to the new version of gym, and it's currently being handled at #403

… fix

zeenolife · 2022-09-08T16:27:23Z

PTAL @vmoens

vmoens

Almost there: It seems everything is coded and the CI is happy!
Can we test the setting

tensordict[:, :2] = {"a": torch.randn(3, 4, 5), ...}

One way to test that would be

sub_td = tensordict[:, :2].to_tensordict()  # clone the data to a new tensordict
sub_td.zero_()
sub_dict = sub_td.to_dict()
tensordict[:, :2] = sub_dict
# check that all values in tensordict[:, :2] are zero

zeenolife · 2022-09-12T09:14:04Z

torchrl/data/tensordict/tensordict.py

                    )
                raise err
        else:
            indexed_bs = _getitem_batch_size(self.batch_size, index)


I've put the TensorDict casting over here, because the batch size wouldn't be computed correctly for broadcasting

vmoens · 2022-09-12T10:12:49Z

.circleci/unittest/linux/scripts/environment.yml

    - future
    - cloudpickle
-    - gym
+    - gym==0.25.1


we should be able to remove this now

The main branch currently has it fixed too, should I remove it here?

zeenolife · 2022-09-12T11:46:27Z

Note about the PR:

Ideally, we would want the _process_tensor() function to be the main and only "pre-processor" of the values. This would allow us to remove all the logic from __setitem__ dunder method, and put it into .set*() methods. However, currently the child classes of the TensorDictBase, particularly, their .set*() methods are not consistent. This results in complicated intercalls, and errors. I propose to create another task, to make all .set*() methods to be consistent, so that _process_tensor() would be a single "pre-processor" of the input

vmoens · 2022-09-12T13:04:05Z

Ideally, we would want the _process_tensor() function to be the main and only "pre-processor" of the values. This would allow us to remove all the logic from __setitem__ dunder method, and put it into .set*() methods. However, currently the child classes of the TensorDictBase, particularly, their .set*() methods are not consistent. This results in complicated intercalls, and errors. I propose to create another task, to make all .set*() methods to be consistent, so that _process_tensor() would be a single "pre-processor" of the input

We definitely need some hardcore cleanup in the calls from __setitem__ -> set / set_ / set_at_ -> _process_tensor!

torchrl/data/tensordict/tensordict.py

Adding support for nested dicts. First draft.

b1d2296

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 8, 2022

zeenolife commented Sep 8, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Show resolved Hide resolved

zeenolife commented Sep 8, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

zeenolife added 2 commits September 8, 2022 16:43

Temporarily fixing the gym version to run unittests

45d1a5e

Temporarily fixing the gym version to run unittests. Conda env syntax…

0cca892

… fix

vmoens changed the title ~~[DRAFT] Adding support for initialising TensorDicts from nested dicts. Addressing #390~~ [DRAFT] Adding support for initialising TensorDicts from nested dicts Sep 8, 2022

vmoens linked an issue Sep 8, 2022 that may be closed by this pull request

[Feature Request] Creating tensordicts from nested dictionaries (and returning them) #390

Closed

3 tasks

vmoens added the enhancement New feature or request label Sep 8, 2022

zeenolife added 3 commits September 9, 2022 10:55

Moving the dict key-value setting to set() function

6134e6f

Adding dict conversion to set*() functions and to __setitem__()

9929b60

Lint fix

537110d

zeenolife changed the title ~~[DRAFT] Adding support for initialising TensorDicts from nested dicts~~ [Feature] Adding support for initialising TensorDicts from nested dicts Sep 9, 2022

vmoens reviewed Sep 9, 2022

View reviewed changes

Fixing broadcasting, and adding tests for it

9674f0a

zeenolife commented Sep 12, 2022

View reviewed changes

vmoens reviewed Sep 12, 2022

View reviewed changes

zeenolife and others added 3 commits September 12, 2022 12:00

Merge branch 'main' into zeenolife/nested-dictionary

d306d8a

Removing the gym version fix

c250b39

Changing the kwarg name

0467d94

Merge branch 'main' into zeenolife/nested-dictionary

17669d2

Adding lock statements

59f3c43

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

vmoens reviewed Sep 12, 2022

View reviewed changes

torchrl/data/tensordict/tensordict.py Outdated Show resolved Hide resolved

Changing to device_safe()

82d6143

vmoens merged commit 59007c3 into pytorch:main Sep 12, 2022

[Feature] Adding support for initialising TensorDicts from nested dicts #404

[Feature] Adding support for initialising TensorDicts from nested dicts #404

Uh oh!

Conversation

zeenolife commented Sep 8, 2022

Description

Motivation and Context

Types of changes

Checklist

Uh oh!

Uh oh!

Uh oh!

zeenolife commented Sep 8, 2022

Uh oh!

zeenolife commented Sep 8, 2022

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

zeenolife Sep 12, 2022

Choose a reason for hiding this comment

Uh oh!

vmoens Sep 12, 2022

Choose a reason for hiding this comment

Uh oh!

zeenolife Sep 12, 2022

Choose a reason for hiding this comment

Uh oh!

zeenolife commented Sep 12, 2022

Uh oh!

vmoens commented Sep 12, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants