Why should the image sizes be divisible by 8 ? #272

tcourat · 2023-05-24T15:38:25Z

tcourat
May 24, 2023

The model seems to run fine even when image sizes are not divisible by 8. Then why is it stated that it should be divisible by 8 ?

williamhoole · 2024-07-23T06:44:27Z

williamhoole
Jul 23, 2024

The reason for this is because of the Resolution in the coarse matching. This means that a coarse match is done at every 8 pixels. Another reason is due to the input into the Vision transformer as the token size is reduced. This is why it is recomended to use a Image size that is divisible by 8.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Why should the image sizes be divisible by 8 ? #272

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Why should the image sizes be divisible by 8 ? #272

Uh oh!

tcourat May 24, 2023

Replies: 1 comment

Uh oh!

williamhoole Jul 23, 2024

tcourat
May 24, 2023

williamhoole
Jul 23, 2024