Our NeurIPS 2022 paper "Wavelet Feature Maps Compression for Image-to-Image CNNs" is now available.

In this paper, we propose a novel approach to compress CNNs using a modified wavelet compression technique.

Abstract:

>Convolutional Neural Networks (CNNs) are known for requiring extensive computational resources, and quantization is among the best and most common methods for compressing them. While aggressive quantization (i.e., less than 4-bits) performs well for classification, it may cause severe performance degradation in image-to-image tasks such as semantic segmentation and depth estimation. In this paper, we propose Wavelet Compressed Convolution (WCC) -- a novel approach for high-resolution activation maps compression integrated with point-wise convolutions, which are the main computational cost of modern architectures. To this end, we use an efficient and hardware-friendly Haar-wavelet transform, known for its effectiveness in image compression, and define the convolution on the compressed activation map. We experiment with various tasks that benefit from high-resolution input. By combining WCC with light quantization, we achieve compression rates equivalent to 1-4bit activation quantization with relatively small and much more graceful degradation in performance.

Cityscapes semantic segmentation with different compressions.

KITTI depth prediction with different compressions.

Comments

londons_explorer t1_is0rn7l wrote on October 12, 2022 at 1:52 PM

This is the kind of research that makes companies with hardware accelerators (google, nvidia, tesla, etc.) suddenly have to redesign and re-buy their very expensive hardware accelerators...

shahaff32 OP t1_is0ths5 wrote on October 12, 2022 at 2:06 PM

This is aimed mostly at edge devices, where an accelerator is not available (e.g. mobile phones), or you want to design a cheaper chip for a product that requires running such networks (e.g. autonomous vehicles)

This work was, in fact, partially supported by AVATAR consortium, aimed at smart vehicles. https://avatar.org.il/

londons_explorer t1_is11x8p wrote on October 12, 2022 at 3:05 PM

Sure this work was aimed at that, but these same techniques can be used to make a datacenter-scale inference machine into an even more powerful one.

And presumably if a way can be found to do backpropagation in 'wavelet domain', then training could be done like this too.

shahaff32 OP t1_is13c2c wrote on October 12, 2022 at 3:15 PM

We are in fact doing the backpropagation in the wavelet domain :)

The gradient simply goes through the inverse wavelet transform

See WCC/util/wavelet.py in our GitHub repo, lines 52-83 define the forward/backward of WT and IWT.

NeverCast t1_is2f1jt wrote on October 12, 2022 at 8:24 PM

The immediate use case for me was on autonomous flight vehicles where weight and battery usage matters

hughperman t1_is1qyob wrote on October 12, 2022 at 5:50 PM

So. Since wavelets here are just filter banks, equivalent to fixed/non-varying convolution+downsampling blocks. Could you learn an improved set of wavelet filters to improve this result?

shahaff32 OP t1_is1tuuf wrote on October 12, 2022 at 6:08 PM

That is indeed possible, though at a computational cost. The Haar wavelet can be implemented very efficiently because of its simplicity.

Please see Appendix F, where we shortly discuss other wavelets and their added computational costs.

Ecclestoned t1_is2mf31 wrote on October 12, 2022 at 9:10 PM

Nice work, will definitely check it out. You're lucky that you didn't get dinged by reviewers for not citing recent works. Some examples:

GACT: Activation Compressed Training for Generic Network Architectures

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

[AC-GC: Lossy Activation Compression with Guaranteed Convergence] (https://proceedings.neurips.cc/paper/2021/hash/e655c7716a4b3ea67f48c6322fc42ed6-Abstract.html)

shahaff32 OP t1_is2o2yz wrote on October 12, 2022 at 9:21 PM

Thank you for your interest in our work :)

We were not aware of these recent works. Thanks for sharing :) we will definitely check those out.

pm_me_your_ensembles t1_is1busc wrote on October 12, 2022 at 4:11 PM

Could this work with 1d convolutions?

shahaff32 OP t1_is1cvgx wrote on October 12, 2022 at 4:18 PM

With some modifications to the code, I believe it can :)

pm_me_your_ensembles t1_is1d3un wrote on October 12, 2022 at 4:20 PM

Very cool, will take a look, thanks! :D

shahaff32 OP t1_is1dlvu wrote on October 12, 2022 at 4:23 PM

Thank you for your interest in our paper :)

regandeRR t1_is0jd4a wrote on October 12, 2022 at 12:47 PM

Great Work!

SearchAtlantis t1_is3pnyq wrote on October 13, 2022 at 1:54 AM

Hey my favorite wavelet! It's what I use to explain wavelets before getting into more complex things like daubechies or others.

The compression and depending on task dimension reduction you can get with wavelets is pretty impressive.

shahaff32 OP t1_is4bbd7 wrote on October 13, 2022 at 5:05 AM

Haar wavelet is also very efficient, as it can be implemented using additions and subtractions (and maybe a few bit manipulations) :)

You can also see Appendix F where we tested several others :)

davidrodord92 t1_is2q6d3 wrote on October 12, 2022 at 9:35 PM

I love wavelets

danny_fel t1_is4bv5g wrote on October 13, 2022 at 5:12 AM

This sounds great! I'd like to try your method on a small nvidia jetson setup. Do I still need to convert the "minimized" model to TFlite? Or it should be good as it is?

shahaff32 OP t1_is4jcv4 wrote on October 13, 2022 at 6:44 AM

Thanks :)

In the current state the implementation is using only standard Pytorch operations, therefore it is not as optimal as it can be, and the overhead of the wavelet transforms can outweighs the speedup of the convolution.

We are currently working on a CUDA implementation to overcome that :) see Appendix H for more details

danny_fel t1_is8eaky wrote on October 14, 2022 at 1:20 AM

Oh thanks! Probably will play around with it! This sounds exciting from a maker/hobbyist perspective wanting to do edge applications.

[deleted] t1_is3nly9 wrote on October 13, 2022 at 1:39 AM

[deleted]

[deleted] t1_is3370c wrote on October 12, 2022 at 11:08 PM

[deleted]