What would a post that argues against the Orthogonality Thesis that LessWrong users approve of look like?

thoth-hermes

What would a post that argues against the Orthogonality Thesis that LessWrong users approve of look like?

post by Thoth Hermes (thoth-hermes) · 2023-06-03T21:21:48.602Z · LW · GW · 2 comments

This is a question post.

  Answers
    13 Charlie Steiner
None
2 comments

This question is motivated by the following reasons:

Not many pieces exist that argue against the Orthogonality Thesis (on LessWrong, or anywhere, to my knowledge). Of those that do, none have received positive feedback.

Commenters on those pieces have stated that it is not, in principle, impossible that they would up-vote such a piece, only that none thus far have met or exceeded their standards for what they would consider to be a successful attempt (even if they were not ultimately persuaded by the arguments).

What attributes would a "successful attempt" (even one that does not persuade you to disbelieve the Orthogonality Thesis) have?

Answers

answer by Charlie Steiner · 2023-06-03T21:38:23.555Z · LW(p) · GW(p)

There are plenty of good posts that contradict a "strict" orthogonality thesis by showing correlation between capabilities and various values-related properties (scaling laws / inverse scaling laws).

What really gets you downvoted is the claim that super-intelligent AI cannot want things that are bad for humanity, or even agitating that we should give that idea serious weight.

What also gets you downvoted is the in-between claim that all the scaling laws tend towards superhuman morality and everything will work out fine, no need to be worried or spend lots of hours working.

How to make a successful piece in the latter categories? Simple - just be right, for communcable reasons. Simple, but maybe not possible.

2 comments

Comments sorted by top scores.

comment by Shmi (shminux) · 2023-06-03T23:58:14.632Z · LW(p) · GW(p)

Thoughtfully engaging with the existing body of literature might help. Show that you understand the claims, the counter-claims, the arguments for and against. Show that your argument is novel and interesting, not something that has been already put forward and critiqued numerous times. Basically, whatever makes a good scientific paper.

comment by green_leaf · 2023-06-04T01:10:21.592Z · LW(p) · GW(p)

It would bring on an enormous amount of new evidence, since the position of the orthogonality thesis is so strong (rather than arguing from some vague and visibly false philosophical assumptions).

What would a post that argues against the Orthogonality Thesis that LessWrong users approve of look like?

Contents

Answers

2 comments