FirstOrderCat t1_iyw6ute wrote on December 4, 2022 at 5:28 PM

Reply to comment by Ambiwlans in bit of a call back ;) by GeneralZain

I watch NLP/LLM papers, people sure will release arxiv paper and likely apply on conference with few % improvement.

Ambiwlans t1_iyw78uu wrote on December 4, 2022 at 5:30 PM

What metric? 5% reduction in errors of 5% improvement in score? I mean, one might be a lot bigger.

Llms are basically doa waiting on gpt4 in a few months now anyways unless they offer something really novel.

FirstOrderCat t1_iyw8tuu wrote on December 4, 2022 at 5:41 PM

Here is recent paper, they improved previous SOTA in GSM8K by 2%: 78->80: https://arxiv.org/pdf/2211.12588v3.pdf

>Llms are basically doa waiting on gpt4 in a few months now anyways unless they offer something really novel.

why are you so confident? Current gpt is very far from doing any useful work, it can't replace programmer, lawyer, accounter, the is a huge space for improvement before they reach some AGI and replace knowledge workers.

Ambiwlans t1_iywjrxk wrote on December 4, 2022 at 6:52 PM

>why are you so confident?

I never made any claim of strong agi any time soon dude. And gpt4 certainly will not be strong agi.

Although automation is taking jobs today.

FirstOrderCat t1_iywkjp6 wrote on December 4, 2022 at 6:57 PM

yes, hand coded automation empowered by LLMs can take many jobs.

Madrawn t1_iyxwdi2 wrote on December 5, 2022 at 12:16 AM

The current codex-davinci model from openAI still blows me away.

I basically asked it nicely to write me a vscode plugin that takes the selected text, prompts the user for the instructions and sends it off to the edit-api endpoint and replaces the text with the response. Including the changes to the package json needed to expose the setting where you put the api key and and a prompt if the key setting is empty to fill the setting.

All that in around seven prompts and in only 2 of them I had to make some changes as it fucked up a bracket in one and one where it forgot to read the apikey setting first before checking it.

It's not perfect, you still need to be able to code to check for errors, but it's already more helpful than some of my colleagues.

fastinguy11 t1_iyxer6n wrote on December 4, 2022 at 10:10 PM

!remindme 1 year

RemindMeBot t1_iyxevtk wrote on December 4, 2022 at 10:11 PM

I will be messaging you in 1 year on 2023-12-04 22:10:55 UTC to remind you of this link

CLICK THIS LINK to send a PM to also be reminded and to reduce spam.

^(Parent commenter can ) ^(delete this message to hide from others.)

^(Info)	^(Custom)	^(Your Reminders)	^(Feedback)