TTS audio of "Ngo and Yudkowsky on alignment difficulty"

post by Quintin Pope (quintin-pope) · 2021-11-28T18:11:38.498Z · LW · GW · 3 comments

My impression is that some people were put off by the length of the articles in Late 2021 MIRI Conversations [? · GW]. Personally, I've used my iPhone's text-to-speech functionality to listen to these and similarly long LessWrong posts as I do other things. After someone else commented on how convenient that seemed, I thought I should try posting a text-to-speech audio version of "Ngo and Yudkowsky on alignment difficulty [? · GW]" and see if that made the content more accessible. 

If you find TTS audio versions of longer posts helpful or have other feedback, please let me know. I'm planning to generate TTS versions of the other MIRI conversations after getting feedback here. In the future, we may even want some sort of integrated TTS service for long LessWrong posts. Edit: thanks to Steven Byrnes [LW · GW] for pointing out that we already have such a service from the Nonlinear Library [LW · GW]. Here's their version of "Ngo and Yudkowsky on alignment difficulty".

Here is a SoundCloud link for my version.

The mp3 files are available at this Google Drive folder.

I generated the audio files with Amazon Polly using the neural version of the English/US voice Joanna. 

Following TTS audio of technical discussions is difficult at first. I've used my iPhone's TTS for years, and it still took me a few minutes to adapt to the Amazon voice. I suggest listening for at least 10 minutes, and not getting too invested in following all the details, especially at first.

I've striped out the timestamps on the posts, since they're difficult to follow and distracting in an audio-only format. If any of the participants would like me to add them back, make other minor changes, or remove this post entirely, I'd be happy to oblige.

3 comments

Comments sorted by top scores.

comment by Steven Byrnes (steve2152) · 2021-11-28T18:37:08.838Z · LW(p) · GW(p)

Cool! Just curious: Is there something wrong with the Nonlinear Library [LW · GW] version, or had you not heard of Nonlinear Library, or did Nonlinear Library not do those posts?

Replies from: ea247, quintin-pope
comment by KatWoods (ea247) · 2021-11-29T18:37:24.609Z · LW(p) · GW(p)

To be fair, there was indeed something wrong with our version! It was so long it messed up our system and we've only now fixed it and it's released in three parts.  Along with the other Eliezer, Richard, and Paul conversations 

comment by Quintin Pope (quintin-pope) · 2021-11-28T18:54:28.769Z · LW(p) · GW(p)

I'd not known of the Nonlinear Library. Thank you for letting me know about it!

The Nonlinear Library mostly fills the gap I was aiming to address with this post and is more or less what I was suggesting with "In the future, we may even want some sort of integrated TTS service for long LessWrong posts."

I do think the Nonlinear Library versions could benefit from some more pre-processing. E.g., removing the exact timestamps of the discussion posts. Also maybe adding descriptions for figures/images in the original texts.