nivrams_brain t1_iuo3l24 wrote on November 1, 2022 at 8:03 PM

What kind of downstream tasks are you looking at?

timy2shoes t1_iuo4xa4 wrote on November 1, 2022 at 8:12 PM

ML-guided protein engineering.

nivrams_brain t1_iuo8d1h wrote on November 1, 2022 at 8:33 PM

Sounds cool, are you in academia or industry?

timy2shoes t1_iuoafb9 wrote on November 1, 2022 at 8:46 PM

Industry

MangoGuyyy t1_iuq81od wrote on November 2, 2022 at 5:48 AM

What company, I’m curious

[deleted] t1_iupief7 wrote on November 2, 2022 at 1:51 AM

[deleted]

ROFLLOLSTER t1_iupsskm wrote on November 2, 2022 at 3:11 AM

> requires a workaround that is difficult to implement

What workaround? I've also been working with ESM and tried the 15B parameter variant. It seemed worse than the 3B in my tests, but maybe I just missed the problem?

timy2shoes t1_iuptv7y wrote on November 2, 2022 at 3:20 AM

We had to do a workaround to fit the 15b parameter model on a p3.8xlarge instance.

> I've also been working with ESM and tried the 15B parameter variant.

Huh. We’ve noticed the same thing. Interesting that others are having the same problem.

Mister_Abc t1_iur4gme wrote on November 2, 2022 at 12:42 PM

First author here. We've had some indication that the 15B model may be overfit. It seemed to sightly improve on a few important metrics (casp14) which is why we included it.

[N] Meta AI | Evolutionary-scale prediction of atomic level protein structure with a language model

timy2shoes t1_iunym8r wrote on November 1, 2022 at 7:32 PM