yaru22

yaru22 t1_jdron1b wrote on March 26, 2023 at 5:39 PM

Reply to comment by LowPressureUsername in [D] Simple Questions Thread by AutoModerator

So it's not an inherent limitation on the number of parameters the model has? Or is that what you meant by more processing power? Do you or does anyone have some pointers to papers that talk about this?

yaru22 t1_jdn17j5 wrote on March 25, 2023 at 4:49 PM

Reply to [D] Simple Questions Thread by AutoModerator

Hello,

GPT4 has context length of 32K tokens while some others have 2-4K tokens. What decides the limit on these context lengths? Is it simply bigger the model, larger the context length? Or is it possible to have a large context length even on a smaller model like LLaMA 7/13/30B?

Thank you!