Interviews with 97 AI Researchers: Quantitative Analysis

post by Maheen Shermohammed (msherm), Vael Gates · 2023-02-02T01:01:32.087Z · LW · GW · 0 comments

Contents

  Overview
  Findings Summary
  Implications for Field Building
  More Information, Full Report, and Further Posts
  Contributors
None
No comments

TLDR: Last year, Vael Gates interviewed 97 AI researchers about their perceptions on the future of AI, focusing on risks from advanced AI. Among other questions, researchers were asked about the alignment problem, the problem of instrumental incentives, and their interest in AI alignment research. Following up after 5-6 months, 51% reported the interview had a lasting effect on their beliefs. Our new report analyzes these interviews in depth. We describe our primary results and some implications for field-building below. Check out the full report (interactive graph version), a complementary writeup [EA · GW] describing whether we can predict a researcher’s interest in alignment, and our results below!

[Link to post on the EA Forum [EA · GW]]

Overview

This report (interactive graph version) is a quantitative analysis of 97 interviews conducted in Feb-March 2022 with machine learning researchers, who were asked about their perceptions of artificial intelligence (AI) now and in the future, with particular focus on risks from advanced AI systems. Of the interviewees, 92 were selected from NeurIPS or ICML 2021 submissions, and 5 were informally recommended experts. For each interview, a transcript was generated, and common responses were identified and tagged to support quantitative analysis. The transcripts, as well as a qualitative walkthrough of common perspectives, are available at Interviews.

Several core questions were asked in these interviews: 

Findings Summary

Some key findings from our primary questions of interest:

(These sum to more than 100% because several people endorsed multiple timelines over the course of the conversation.) (Source)

  1. A set of responses that included the idea that AI alignment problems would be solved over the normal course of AI development (caveat: this was a very heterogeneous tag).
  2. Pointing out that humans have alignment problems too (so the potential risk of the AI alignment problem is capped in some sense by how bad alignment problems are for humans).
  3. AI systems will be tested (and humans will catch issues and implement safeguards before systems are rolled out in the real world).
  4. The objective function will not be designed in a way that causes the alignment problem / dangerous consequences of the alignment problem to arise.
  5. Perfect alignment is not needed.

Implications for Field Building

While the current report summarizes responses from the primary questions asked in the interview, there is also an accompanying writeup of how responses interact with each other and other participant features: i.e., whether we can use demographics, or other information about the researchers, to predict their interest (or lack thereof) in AI alignment. Take a look here! Predicting researcher interest in AI alignment [EA · GW

More Information, Full Report, and Further Posts

The above is a relatively concise description of the interviews and results. To read more, see below:

We welcome feedback!

Contributors

Analysis and writing by Maheen Shermohammed, with help from Vael Gates. 

Interviews were conducted by Vael Gates, with guidance from Mary Collier Wilks.

Tagging was completed by Zi Cheng (Sam) Huang and Vael Gates.

Copyediting of the report by David Spearman, and copyediting of this writeup by Lukas Trötzmüller.

This project was supported by the AI Safety Field-Building Hub [EA · GW]. 

  1. ^

    The eagle-eyed may note that this is out of 30 despite an earlier statement that only 22% (i.e. 21 people) thought humanity would never develop AGI. That's because the 9 who had timeline tags (meaning they also expressed some belief that it would happen) were removed for this 22% estimate. (Source)

0 comments

Comments sorted by top scores.