Viewing a single comment thread. View all comments

Pro_RazE OP t1_j9u3sra wrote

Man announced it through Instagram channels lmao. There's no paper or anything else posted yet.

Edit: They posted. Here's the link: https://ai.facebook.com/blog/large-language-model-llama-meta-ai/?utm_source=twitter&utm_medium=organic_social&utm_campaign=llama&utm_content=blog

"Today we're publicly releasing LLAMA, a state-of-the-art foundational LLM, as part of our ongoing commitment to open science, transparency and democratized access to new research.

We trained LLaMA 65B and LLaMA 33B on 1.4 trillion tokens. Our smallest model, LLaMA 7B, is trained on one trillion tokens"

There are 4 foundation models ranging from 7B to 65B parameters. LLaMA-13B outperforms OPT and GPT-3 175B on most benchmarks. LLaMA-65B is competitive with Chinchilla 70B and PaLM 540B

From this tweet (if you want more info) : https://twitter.com/GuillaumeLample/status/1629151231800115202?t=4cLD6Ko2Ld9Y3EIU72-M2g&s=19

35

YobaiYamete t1_j9uga58 wrote

> LLaMA-65B is competitive with Chinchilla 70B and PaLM 540B

As per always with these claims lately, "I'll believe it when I can talk to it"

There's so many trying to make these big claims but then, the only we can actually talk to is ChatGPT and Bing.

26

MysteryInc152 t1_j9uhssy wrote

I think peer-reviewed research papers are a bit more than just "claims".

As much as i'd like all the SOTA research models to be usable by the public, research is research and not every research project is done with the interest of making a viable commercial product. Inference with these models are expensive. That's valid too.

Also seems like this will be released under a non commercial license like the OPT models.

37

9985172177 t1_j9ziz50 wrote

That's not true at all. You.com's chat is very strong, more than comparable to chatgpt, and it is even more open, as in you don't need to provide a phone number to use it. Plus there are other models like Bloom and so on that are far more open, as in you can download them and run them yourself and integrate them into other software.

1

YobaiYamete t1_j9zwgdi wrote

You.com is okay, but it definitely not on par with ChatGPT lol. It's running on a weaker version of GPT and you can't just talk to it the same way

CharacterAI was smarter than ChatGPT until they nerfed it into the ground, but that's issue. Everywhere that has a decent AI suddenly nerfs it until it's too useless to use

2

WarAndGeese t1_ja7z1kn wrote

Are you sure that YouChat is running on a version of GPT? (Presumably you mean openai's software.) I was speaking to a founder of a company that had some partnership with You.com and he was saying they roll their own machine learning stuff, that they (You.com) were already machine learning experts.

1

Hemanth536 t1_j9u74e3 wrote

Looks like Channels might become new type of blogs for companies and influencers to announce something

2

Pro_RazE OP t1_j9ua28q wrote

Maybe. It is their latest addition to Instagram, so it makes sense him using it to announce new stuff. As this will inspire some to do the same.

1