Submitted by Animated-AI t3_118c8pp in MachineLearning

Hey everyone,

I've created what I believe is the first animation of a depthwise-separable convolution, and I thought you might appreciate it. I think this fills a legitimate gap in the instructional material available out there.

https://i.redd.it/o1bns0jjskja1.gif

I've actually been dissatisfied with the existing convolution animations in general (and ranted about it on youtube). So I made my own set of animations and published them on animatedai.github.io.

If you find any of them useful, please feel free to copy them, post them on your website, throw them in a powerpoint, or just link to them.

338

Comments

You must log in or register to comment.

marcus_hk t1_j9gij1a wrote

Looks great. Might not be intelligible to those who don't know what they're looking at, though. Maybe include labels of, say, filters, what each slice of input represents, etc.?

Would like to see the same for normalization layers. And RNNs. And transformers. Keep it up!

61

dahitokiri t1_j9gnuhp wrote

Can you share how you go about creating these animations? A tutorial on that would help others in the field produce helpful animations as well.

7

Animated-AI OP t1_j9gojs5 wrote

I'm using Blender and making heavy use of the Geometry Nodes feature. Unfortunately, these animations have taken a lot of effort and blender-specific knowledge, and building on top of my work for a new application would require more of both. But if others aren't deterred by that, I could publish the blender files.

23

Own_Quality_5321 t1_j9gp7ne wrote

I teach Deep Learning and I send you a big thank you. I will refer students to your website and channel ☺️

22

gdpoc t1_j9gqaue wrote

I'll be using this content to illustrate, thanks!

2

Animated-AI OP t1_j9gvww8 wrote

Thanks for the feedback! I agree; the animations are only meant to be visual aids in the context of some larger explanation (lecture, blog post, etc). In my case, I'm making YouTube videos to serve as complete explanations.

Transformers have been the most requested topic on my YouTube channel. So I'm going to attempt to make videos/animations about that when I finish my current series on convolution.

24

jaeja_helvitid_thitt t1_j9hfvi0 wrote

I don't think the existing animations are strictly wrong, they just don't show the last dimension.

2

zds-nlp t1_j9hulkz wrote

This is brilliant, thanks for sharing

1

SankarshanaV t1_j9if980 wrote

For the input, each kernel is acting upon ONE channel only, right?

But in general, shouldn't the number of channels of the kernel be equal to that of the previous layer?

1

nirehtylsotstniop t1_j9j135a wrote

Very Sick, these are the best ive seen. Thanks for making these. May i ask how you made them.

1

Asd4Ever t1_j9j2kiv wrote

Thank you for the wonderful tool OP

1

thejuror8 t1_j9nyedd wrote

I recognized the ResNet bottleneck!

Very cool, you did a great job there.

1