Outcome Terminology?post by Dach · 2020-09-14T18:04:05.048Z · score: 6 (3 votes) · LW · GW · None comments
This is a question post.
I'm writing a post about S-risks, and I need access to some clean, established terminology/background material for discussing AI-based long-term outcomes for humanity.
My current (very limited) vocabulary can be summarized with the following categories:
- Outcomes which are roughly maximally bad: Hyperexistential risk/S-risk/Unfriendly AI/Existential risk
- Outcomes which are nontrivially worse than paperclipping-equivalents but better than approximate minimization of human utility: Hyperexistential risk/S-risk/Unfriendly AI/Existential risk
- Outcomes which are produced by agents essentially orthogonal to human values: Paperclipping/Unfriendly AI/Existential risk
- Outcomes which are nontrivially better than paperclipping but worse than Friendly AI: ???
- Outcomes which are roughly maximally good: Friendly AI
The problems are manifold:
- I haven't read any discussion which specifically addresses parts 1 or 2. I have read general discussion of parts 1 and 2 combined under the names of "Outcomes worse than death", "Hyperexistential risk", "S-risk", etc.
- My current terminology overlaps too strongly to use to uniquely identify outcomes 1 and 2.
- I have no terminology or background information for outcome 4.
I've done a small amount of investigation and determined less brainpower would be wasted by just asking for links.
Comments sorted by top scores.