Submitted by AutoModerator t3_11pgj86 in MachineLearning
dwarfarchist9001 t1_jdd33ha wrote
Reply to comment by andrew21w in [D] Simple Questions Thread by AutoModerator
Short answer: Polynomials can have very large derivatives compared to sigmoid or rectified linear functions which leads to exploding gradients.
https://en.wikipedia.org/wiki/Vanishing_gradient_problem#Recurrent_network_model
Viewing a single comment thread. View all comments