Comments

You must log in or register to comment.

ML4Bratwurst t1_iygsjef wrote

No because stacking layers is basically what gives neural network their ability to extract high level features

5

bacon_boat t1_iygvgdf wrote

f(f(f(f(x)))) =/= f(x)+f(x)+f(x)+f(x)

9

eternal-abyss-77 OP t1_iygx65a wrote

Bro, but let me ask you one more question, please bear with me.

If the result [ f(x)+f(x)+f(x)+f(x) ] >= result [ f(f(f(f(x)))) ]

(Result is feature map, features retained or extracted )

Can I conclude that both are same?

−2

Crafty_Primary_2776 t1_iyhy4dp wrote

That’s a good point. Actually slightly change your question leads to the problem of neural network width vs depth. Check these materials.

Do Wide and Deep Networks Learn the Same Things?

Universal approximation theorem.

2