[Feature]: R3M integration #321

vmoens · 2022-07-25T16:32:43Z

R3M integration in torchrl

As soon as I'm granted access to the domain's s3 bucket i'll upload the resnet weights.

Here's a code snippet to get started:

import torch
from torchrl.envs.transforms import R3MTransform
from torchrl.data import TensorDict

for shape in ([], [2], [2, 3]):
    td = TensorDict({"next_observation": torch.randint(255, (*shape, 224, 224, 3))}, shape)
    transform = R3MTransform(model_name="resnet50", keys_in=["next_observation"])
    transform(td)
    print(td)

Downloading weights

Weights can be downloaded easily via
transform = R3MTransform(model_name="resnet50", keys_in=["next_observation"], download=True)

Benefits:

Faster execution
Possibility of executing R3M on a large number of workers (current implementation collides above 8 workers or so)
Possibility of downloading specific versions of the R3M
Makes it easy to execute R3M on images stored in the replay buffer (if needed)
Fully customisable (e.g. can return both the R3M embedding and the orignal image, can work with any environment that return images, not only gym)

Efficiency

When comparing with the old implementation, the following code returns a speed of 1.4 sec / batch of 4 rollout on CUDA for the torchrl version, compared with 5.8 sec for the mj_envs implementation

from torchrl.trainers.helpers.envs import LIBS
from utils import MJEnv
from torchrl.envs import R3MTransform, TransformedEnv, ParallelEnv
import time
import torch

LIBS["mjenv"] = MJEnv

device = torch.device("cuda:0") if torch.has_cuda and torch.cuda.device_count() else torch.device("cpu")
if __name__ == "__main__":
    transform = R3MTransform(model_name="resnet18", keys_in=["next_pixels"])
    env1 = TransformedEnv(
        ParallelEnv(16, lambda: MJEnv("kitchen_micro_open-v3", from_pixels=True,
                                     device=device)
                    ),
        transform=transform.eval(),
    )
    assert env1.device == device
    # assert not env1.transform.training
    # assert not env1.transform[-1].convnet.training
    del transform

    env1.reset()
    t0 = time.time()
    print(env1.rollout(max_steps=20))
    print(time.time() - t0)
    t0 = time.time()
    print(env1.rollout(max_steps=20))
    print(time.time() - t0)
    env1.close()
    del env1

    env2 = ParallelEnv(
        16,
        lambda: MJEnv("kitchen_micro_open-v4", device=device)
    )
    assert env2.device == device
    env2.reset()
    t0 = time.time()
    print(env2.rollout(max_steps=20))
    print(time.time() - t0)
    t0 = time.time()
    print(env2.rollout(max_steps=20))
    print(time.time() - t0)
    env2.close()
    del env2

(MJEnv is the TorchRL wrapper for mj_env environments)

TODO:

store weights on AWS
write tests

cc @vikashplus @suraj-nair-1

vmoens added 3 commits July 12, 2022 17:20

init

aa810af

amend

72660f3

amend

7c25135

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 25, 2022

vmoens added the enhancement New feature or request label Jul 25, 2022

vmoens added 4 commits July 25, 2022 22:15

amend

7f1625e

Merge branch 'main' into r3m_integration

cb3e714

no grad

7b47eaf

Merge branch 'main' into r3m_integration

a49849c

vmoens requested a review from vikashplus August 8, 2022 17:03

vmoens added 2 commits August 30, 2022 21:53

download weights from s3

acb5eac

Merge branch 'main' into r3m_integration

e6cbc9e

vmoens changed the title ~~[WIP]: R3M integration~~ [Feature]: R3M integration Aug 31, 2022

vmoens added 2 commits August 31, 2022 10:51

amend

6442ba0

amend

577e17d

vmoens marked this pull request as ready for review August 31, 2022 18:22

vmoens merged commit a61c8a5 into main Aug 31, 2022

vmoens deleted the r3m_integration branch August 31, 2022 18:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature]: R3M integration #321

[Feature]: R3M integration #321

Uh oh!

vmoens commented Jul 25, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Feature]: R3M integration #321

[Feature]: R3M integration #321

Uh oh!

Conversation

vmoens commented Jul 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Downloading weights

Benefits:

Efficiency

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vmoens commented Jul 25, 2022 •

edited

Loading