Submitted by mikef0x t3_ylrngf in MachineLearning
vedrano- t1_iv06kdh wrote
Blaze have right, CNNs re way to go. You might even get away with fully connected model, but you should use billions of images and quite a large model to reach similar results that much smaler CNN model with much smaller dataset can.
Btw, during last epochs val loss is oscillating, meaning learning rate is too large at that particular point.
Think_Olive_1000 t1_iv0c2eb wrote
Can it also mean learning rate is too small because it could be trapped in a local minima?
vedrano- t1_iv125dq wrote
If it would be trapped in local minima (gradient vanishing), it would not change loss for a quite margin.
Viewing a single comment thread. View all comments