Has the research community embraced any of the frameworks or findings published by Anthropic at all? Google Scholar seems to indicate no, but I'm curious. I work on the applied side and not on the research side, so I don't have a good sense for how influential their work on interpretability is.

The motivation for my question is that they have a huge amount of funding (although how long that will last after SBF's downfall remains to be seen) and a lot of press attention and fans in the rationalist/EA communities, but my feeling is that their work is largely not being adopted or cited in AI research. If I am correct in this, I'm curious if this is because it is seen as unoriginal, incorrect, or misguided? Or is there something else going on?

Comments

You must log in or register to comment.

Ready-Farmer7451 t1_j2620xk wrote on December 29, 2022 at 10:37 PM

#1,160,713

The research is fine.

LLM research is really new.
LLM research is in general a bit shallow and not that interesting.
Ability to do large scale LLM research is limited to a few labs, so there's not that many that could cite them. The few labs who do it tend to focus on their own work rather than the work of others.

FlavoredQuark t1_j265iy0 wrote on December 29, 2022 at 11:00 PM

#1,161,453

I think their research is cool

AGI_aint_happening t1_j26847d wrote on December 29, 2022 at 11:18 PM

#1,161,982

As a former interpretability researcher who has skimmed their work but not read it closely, I just don't find it terribly interesting or novel. Also, frankly, I find the writing style for the papers pretty hard to parse (as they don't follow standard paper formats) and a tad grandiose, as they tend to avoid standard things like comparing against other methods or citing other work. Relatedly, I think their choice to avoid peer review has impacted how people perceive their work, and limited its distribution.

veejarAmrev t1_j2724im wrote on December 30, 2022 at 2:52 AM

#1,168,100

As you said, it's kind of cult in the EA community. Outside of that, no one bothers. They haven't done anything significant to be of any value to the community.

ThePerson654321 t1_j280380 wrote on December 30, 2022 at 8:18 AM

#1,174,831

Replying to AGI_aint_happening (#1,161,982)

Why don't you think the issue rationalists try to raise is important in terms of AGI?

ThePerson654321 t1_j2804lq wrote on December 30, 2022 at 8:19 AM

#1,174,837

Replying to veejarAmrev (#1,168,100)

You should read LessWrong

thejaminator t1_j286yqs wrote on December 30, 2022 at 9:51 AM

#1,176,121

I think it's the case where they are still pretty new and comparatively unknown.

They have done good work like releasing their paper and dataset for training an assistant RLHF model. https://github.com/anthropics/hh-rlhf

You won't get any dataset like that from OpenAI. It's useful for anyone who wants to experiment with RLHF with LLMs. Which is pretty important as OpenAI is having lots of success with it in InstructGPT and ChatGPT

[deleted] t1_j28be5a wrote on December 30, 2022 at 10:51 AM

#1,176,952

Replying to ThePerson654321 (#1,174,837)

[deleted]

KvanteKat t1_j291xw0 wrote on December 30, 2022 at 3:08 PM

#1,182,580

Replying to ThePerson654321 (#1,174,837)

I'm not sure reading LessWrong will necessaryly disuade someone who is already a bit sceptical of the Rationalist EA community from believing that there is something culty going on. One of the things that really rubbed me the wrong way about that blog back in the day (I'll be up front and say that I haven't been keeping up with it for the past 10 years) was exactly how insular a lot of the writing was and how little it seriously engaged with existing literature and research in favor of reinventing the wheel and relying on their own private language which was not used by anyone else working in similar fields (as an example, Yudkovski is far from the first person to promote naive Bayesianism (basically the idea that if you get good enough at applying Bayes' rule, you will have solved the problem of induction), but if you only read his blog back then you could easily come to believe that he was doing groundbreaking stuff with respect to this topic when this was far from the case).

nic001a t1_j2941zb wrote on December 30, 2022 at 3:23 PM

#1,183,145

Not an expert But wishing you best of luck !!

AGI_aint_happening t1_j298oex wrote on December 30, 2022 at 3:55 PM

#1,184,355

Replying to ThePerson654321 (#1,174,831)

*shrug*, I don't really care who does the research, I care if I learn anything from reading it. FWIW, their interp papers are pretty separate from AGI

papajan18 t1_j29cxmw wrote on December 30, 2022 at 4:23 PM

#1,185,454

Chris Olah's work is very solid. Actually some of the best interpretability work I've seen. Haven't heard of anyone else in particular.

jgrayatwork t1_j2ac6tv wrote on December 30, 2022 at 8:07 PM

#1,193,956

Replying to papajan18 (#1,185,454)

They have some very good people. Tom Brown and Ben Mann are the first two authors on the GPT-3 paper. Jared Kaplan is the first author of the openai scaling laws paper.

frenchmap t1_j2ark9g wrote on December 30, 2022 at 9:47 PM

#1,197,561

Replying to veejarAmrev (#1,168,100)

what does EA stand for?

Flag_Red t1_j2cql0a wrote on December 31, 2022 at 7:18 AM

#1,216,582

Replying to frenchmap (#1,197,561)

Effective Altruism

Hyper1on t1_j2dzz01 wrote on December 31, 2022 at 3:44 PM

#1,229,557

Bit early to say, but I'd be willing to bet that most of their major papers this year will be widely cited. Their work on RLHF, including constitutional AI and HH seems particularly likely to be picked up by other industry labs, since it provides a way to improve LLMs deployed in the wild while reducing the cost of collecting human feedback data.

frenchmap t1_j2efp3r wrote on December 31, 2022 at 5:32 PM

#1,235,070

Replying to Flag_Red (#1,216,582)

How does a philosophical ideology of "using evidence-based reasoning to help others" result in a machine learning cult?

Flag_Red t1_j2ek25d wrote on December 31, 2022 at 6:01 PM

#1,236,564

Replying to frenchmap (#1,235,070)

I, personally, don't consider LessWrong a cult (I lurk the blog, and have even been to an ACX meetup). There's definitely a very insular core community, though, which regularly gets caught up in "cults of personality". Yudkowski is the most obvious person to point to here, but Leverage Research is the best example of cult behaviour coming out of LessWrong and the EA community IMO.

With regards to machine learning in particular, there's some very extreme views about the mid/long term prospects of AI. Yudkowski himself explicitly believes humanity is doomed, and AI will takeover the world within our lifetimes.