Posts
Comments
I'm curious about if a good "hero-GPT" or "alignment-research-support-GPT" could be useful today or with slightly improved tech. Of course having something like this run autonomously is not without risk, but might be quite valuable/important in the sub-critical AI era.
Hey Valentine, I really like this post. I think it hits on some key things that traditional LW culture was missing for a while. Was wondering if you've ever encountered The Conscious Leadership Group (https://conscious.is/)- they explicitly train some techniques similar to what you're describing here (as well as some quite different ones).
Cool, thanks for sharing! Hadn't heard of Metaphor before.
I might be able to code up an 'editing' pass to catch things like that!
:)
Have spent some time playing with reversible CAs, and can confirm that they are very interesting. They are a great example of how provable high-level properties (things like conservation of gliders) can come out of low level properties (reversibility).
This is absolutely hilarious, thank you for the post.
Great answer, thanks!
Thanks for the post! I think asking AI Capabilities researchers to stop is pretty reasonable, but I think we should be especially careful not to alienate the people closest to our side. E.g. consider how the Protestants and Catholics fought even though they agree on so much.
I like focusing on our common ground and using that to win people over.
Please comment! Excited to hear everyone’s thoughts and feedback on these ideas.
Guidelines: please try to keep it positive and constructive, even when providing critical feedback. But my door is open for anything!
Eliezar- I love the content, but similar to some other commenters, I think you are missing the value (and rationality) of positivity. Specifically, when faced with an extremely difficult challenge, assume that you (and the other smart people who care about it) have a real shot at solving it! This is the rational strategy for a simple reason: if you don’t have a real shot at solving it then you haven’t lost anything anyway. But if you do have a real shot at solving it, then let’s all give it our 110%!
I’m not proposing being unrealistic about the challenges we face - I’m as concerned as you are. But I believe thinking this way and inviting the community and our broader society to work together on this challenge is part of Good Strategy
This looks awesome! Would love to chat about being involved in some way.
Valid concern. I would say (1) keep our research results very secret (2) hire people that are fairly aligned? But I agree that’s not a sure fire solution at all.