42gauge t1_j7e9twt wrote on February 6, 2023 at 3:46 AM

> I was teaching my kid how to solve arithmetic reasoning problems (not from the MultiArith dataset...

lol ..

zisyfos t1_j9j7zsk wrote on February 22, 2023 at 10:40 AM

Really interesting! What are the minimum requirements to run this?

astonzhang t1_j9scuwn wrote on February 24, 2023 at 5:15 AM

We ran experiments on 4 NVIDIA Tesla V100 32G GPUs

IluvBsissa t1_j9j9ml9 wrote on February 22, 2023 at 11:01 AM

Dr. Zhang, thank you so much. Please can you tell us more about your model's performance ? How would it do on standard MMLU ? Can it be improved by increasing parameters count ? The paper didn't mention if the human testers were average human or experts ?

astonzhang t1_j9sd3mw wrote on February 24, 2023 at 5:17 AM

The human performance was taken from the paper from Lu et al.

chinguetti t1_j9joqfu wrote on February 22, 2023 at 1:34 PM

Will make a good story when you accept your Nobel prize. Well done.

ihopeshelovedme t1_j9nhhgs wrote on February 23, 2023 at 5:53 AM

You think the r/singularity will be kind enough to grant him a Nobel price?

lwl t1_j8hoxpg wrote on February 14, 2023 at 11:42 AM

Super interesting work, thank you for sharing! If you are still active on reddit - we noticed that the pdf is no longer available on arxiv, are you able to say why that is?

astonzhang t1_j8kcydh wrote on February 14, 2023 at 11:02 PM

Can you check it again?

lwl t1_j8m2h7b wrote on February 15, 2023 at 8:33 AM

Ah great, thanks!!

JClub t1_jabyh73 wrote on February 28, 2023 at 9:30 AM

GPT was never trained with image data, why is this a fair comparison? The UnifiedQA model is from 2022, so it doesn't seem fair either. Why don't we have some comparisons with other SOTA multimodal models? Such as OFA or UniT

[R] Multimodal Chain-of-Thought Reasoning in Language Models - Amazon Web Services Zhuosheng Zhang et al - Outperforms GPT-3.5 by 16% (75%->91%) and surpasses human performance on ScienceQA while having less than 1B params!

astonzhang t1_j79i4jj wrote on February 5, 2023 at 2:44 AM