Cheap_Meeting t1_j7chivx wrote on February 5, 2023 at 7:49 PM

In terms of Consumer Apps, the Poe app from Quora has access to two models from Open AI and one from Anthropic.

Perplexity.ai, YouChat and Neeva are search engines that integrated LLMs.

Google has an AI + Search Event on Wednesday where they are likely to announce something as well.

In terms of APIs and getting a feeling for these models, I would use OpenAI's APIs. Their models are the best publically available models. Open Source models are still far behind.

danysdragons t1_j7gt2ak wrote on February 6, 2023 at 6:19 PM

To pre-empt possible confusion by people wanting to try YouChat, its URL is you.com/chat, while youchat.com is an unrelated messaging service.

MysteryInc152 t1_j7g83pw wrote on February 6, 2023 at 4:04 PM

GLM-130B is really really good. https://crfm.stanford.edu/helm/latest/?group=core_scenarios

I think some instruction tuning is all it needs to match the text-davinci models

Cheap_Meeting t1_j7j70tj wrote on February 7, 2023 at 4:17 AM

That's not my takeway. GLM-130B is even behind OPT according to the mean win rate, and the instruction tuned version of OPT in turn is worse than FLAN-T5 which is a 10x smaller model (https://arxiv.org/pdf/2212.12017.pdf Table 14)

MysteryInc152 t1_j7ja39c wrote on February 7, 2023 at 4:45 AM

I believe the fine-tuning dataset matters as well as the model but I guess we'll see. I think they plan on fine-tuning.

The set used to tune OPT doesn't contain any chain of thought.