[D] Are there any results on convergence guarantees when optimizing NNs? Submitted by Dartagnjan t3_10ee9kp on January 17, 2023 at 2:57 PM in MachineLearning 10 comments 11
rikkajounin t1_j4umb8q wrote on January 18, 2023 at 11:04 AM The following work shows that with sufficiently large width (overparameterized regime) you can have polynomial convergence to the global minimum which gets worse (but polynomially) with the depth of the network. A Convergence Theory for Deep Learning via Over-Parameterization Permalink 2
Viewing a single comment thread. View all comments