_d0s_

_d0s_ t1_ivxk6y9 wrote

It's also used for spatial embedding of patches in an image

Besides the positional embedding transforms also use the attention mechanism which can be beneficial for some problems on its own

1

_d0s_ t1_ivp036a wrote

besides being extremely computationally expensive, how would one define the size of the volume? it's similar to the problem of defining a step size which oversteps when too large or takes forever when too small. likewise defining a very small volume might get us caught in local minima.

i guess this thought is similar to smoothing like another poster mentioned.

5