currentscurrents t1_j8zq4tn wrote on February 18, 2023 at 2:59 AM

Reply to comment by [deleted] in [D] Formalising information flow in NN by bjergerk1ng

>I wouldn’t say it’s common to design networks with information flow in mind

I disagree. The entire point of the attention mechanism in transformers is to have a second neural network to control the flow of information.

Similarly, the autoencoder structure is ubiquitous these days, and it's based around the idea of forcing information to flow through a bottleneck. Some information must be thrown away, so the neural network learns which parts of the data are most important to keep, and you get a good understanding of the structure of the data.

I'd say many of the recent great ideas in the field have come from manipulating information flow in interesting ways.

Phoneaccount25732 t1_j90eyfv wrote on February 18, 2023 at 7:17 AM

This is my preferred interpretation of RESNETs too.

currentscurrents t1_j90hs2i wrote on February 18, 2023 at 7:54 AM

Yeah, the skip connections allow higher layers to have access to information from lower layers. Same thing goes for U-Nets; they're basically an autoencoder with skip connections.