singularpanda OP t1_j3eohh7 wrote on January 8, 2023 at 12:59 AM

Reply to comment by currentscurrents in [D] Will NLP Researchers Lose Our Jobs after ChatGPT? by singularpanda

Yes, it is quite costy. However, it seems not easy to modify it in our research as it is not open.

KBM_KBM t1_j3g7swj wrote on January 8, 2023 at 9:30 AM

https://github.com/lucidrains/PaLM-rlhf-pytorch

Similar to chat get architecture you can play with this

singularpanda OP t1_j3gdv9p wrote on January 8, 2023 at 10:50 AM

Thanks! Yes, there are many similar things. But the ChatGPT seems to have the most amazing performance.

Think_Olive_1000 t1_j3tnqyd wrote on January 11, 2023 at 12:04 AM

I feel like you'd make a really bad research student

KBM_KBM t1_j3gere2 wrote on January 8, 2023 at 11:02 AM

True but practically training a gpt model is not computationally cheap. I think instead of making such generalized language models we need to focus more one subject specific language models.

f_max t1_j3frhxs wrote on January 8, 2023 at 6:14 AM

Megawatt sounds right for training. But kilowatts for inference. Take a look at tim dettmer’s work (he’s at UW) on int8 to see some of this kind of efficiency work. There’s definitely significant work happening in the open.