Skip to content

GPU execution time statistics for backward computation. #1132

@jiaoff-hub

Description

@jiaoff-hub

I would like to use record_function to mark the GPU-side execution time of both the forward and backward passes during training. However, in the collected profiling results, I found that only the forward pass was correctly annotated on the stream, while the backward pass only showed up in the CPU-side profiler. Could you tell me how to apply record_function so that it is correctly associated with the GPU stream during the backward pass?

Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions