Skip to content

Conversation

cypof
Copy link
Member

@cypof cypof commented Sep 9, 2016

For timing operations in a net, it should be possible to add events to a GPU stream without synchronizing, to have a lower impact on performance and more accuracy. Synchronizing is only required when reading the timer, e.g. once at the end of an iteration instead of at each layer.

@shelhamer
Copy link
Member

Looks good to me. Thanks Cyprien!

@shelhamer shelhamer merged commit 048530a into BVLC:master Sep 9, 2016
@shelhamer
Copy link
Member

shelhamer commented Sep 9, 2016

@flx42 could you comment on whether this resolves #4453? I see that you took another approach there with two passes, but is that necessary/is the overhead that significant?

@ujsyehao
Copy link

@shelhamer It is a big overhead.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants