[deleted] t1_j89jf91 wrote on February 12, 2023 at 5:49 PM

#1,789,552

[removed]

The-Last-Lion-Turtle t1_j89jm9s wrote on February 12, 2023 at 5:51 PM

#1,789,563

The purpose of a deep network is to approximate complex non linear functions. With relu the network is piecewise linear. Imagine slicing a space with many planes, locally it's flat, but zooming out it has a very complex shape, similar to getting a 3D model out of triangles. Each layer adds an additional linear deformation and a slice to the space.

Read the resnent paper. It's a great explanation for both why depth matters for performance and how it causes issues for training. The solution of residual connections is central to every deep learning architecture after this paper.

big_ol_tender t1_j89ruh6 wrote on February 12, 2023 at 6:46 PM

#1,789,956

If you haven’t already, I’d suggest the 3blue1brown series on neural networks on YouTube. It is the easiest introduction I’ve come across.

_Redone OP t1_j89s65u wrote on February 12, 2023 at 6:48 PM

#1,789,967

Replying to big_ol_tender (#1,789,956)

I have already but i think my question is bit deeper i didn't find the answer on that vidéo

Dylan_TMB t1_j8a0hrj wrote on February 12, 2023 at 7:44 PM

#1,790,348

If you want to be someone that understands it very deeply get REALLY good at linear algebra and REALLY good understanding of multi-variate calculus.

The not so deep answer to your questions is your understanding right now is right. You have a bunch of functions that take multiple inputs and spit out 1 output and that output is combined with other outputs to be put into other functions. Each function has parameters that can vary which changes the output. When you train you give a bunch of examples that in real life you know (hope) are related. The model learns parameters such that it maps input to output.

That's all that's happening.

Dylan_TMB t1_j8a0kuw wrote on February 12, 2023 at 7:45 PM

#1,790,355

Replying to _Redone (#1,789,967)

You might be looking for something deeper when there is nothing there.

The real concept behind deep learning [Discussion]

Comments