ThePhantomPhoton t1_iyk2wnq wrote on December 1, 2022 at 11:49 PM

Reply to comment by bushrod in [R] Statistical vs Deep Learning forecasting methods by fedegarzar

I think you have a good argument for images, but language is more challenging because we rely on positional encodings (a kind of "time") to provide us with contextual clues which beat out the following form of statistical language model: Pr{x_{t+1}|x_0, x_1, ..., x_{t}} (Edit-- that is, predicting the next word in sequence given all preceding words in the sequence)