Quantcast
Viewing latest article 18
Browse Latest Browse All 62441

[D] Dynamic patch weighting in ViTs

Has anyone explored weighting non-overlapping patches in images using ViTs? The weights would be part of learnable parameters. For instance, the background patches are sometimes useless for an image classification task. I am hypothesising that including this as a part of image embedding might be adding noise.

It would be great if someone could point me to some relevant works.

submitted by /u/arjun_r_kaushik
[link] [comments]

Viewing latest article 18
Browse Latest Browse All 62441

Trending Articles