Can AI agents learn to be good?

post by Ram Rachum (ram@rachum.com) · 2024-08-29T14:20:04.336Z · LW · GW · 0 comments

This is a link post for https://futureoflife.org/ai-research/can-ai-agents-learn-to-be-good/

Contents

No comments

Hi everyone!

My name is Ram Rachum and I'm working on AI Safety research. I want to elicit social behavior in RL agents and use it to achieve AI Safety goals such as alignment, interpretability and corrigibility.

I made a guest post on the Future of Life Institute's blog: https://futureoflife.org/ai-research/can-ai-agents-learn-to-be-good/

This isn't specifically about my research, as it's mostly geared towards the public so it's pretty basic. I do have a plug for my latest paper at the bottom. This is my first public writing on AI Safety, so I'd appreciate any comments or corrections.

I'm currently raising funding for my research. If you know of relevant funders, I'd appreciate a connection.

0 comments

Comments sorted by top scores.