Posts

Robustness of Model-Graded Evaluations and Automated Interpretability 2023-07-15T19:12:48.686Z

Comments

Comment by viluon on GPTs are Predictors, not Imitators · 2023-04-16T11:04:06.549Z · LW · GW

I'd really like to see Eliezer engage with this comment, because to me it looks like the following sentence's well-foundedness is rightly being questioned.

it's naked mathematical truth that the task GPTs are being trained on is harder than being an actual human.

While I generally agree that powerful optimizers are dangerous, the fact that the GPT task and the "being an actual human" task are somewhat different has nothing to do with it.

Comment by viluon on Lies Told To Children · 2022-04-17T10:30:09.796Z · LW · GW

I'm delighted I read the post in the curated newsletter without noticing who was the author and only then decided to head here to upvote it. I wonder if moving the authorship information to the end of the newsletter influences the reader's willingness to upvote the post – perhaps an opportunity for an A/B test?

Comment by viluon on Outside the Laboratory · 2020-08-23T22:12:09.764Z · LW · GW

Indeed.