Posts

LifeKeeper Diaries: Exploring Misaligned AI Through Interactive Fiction 2024-11-09T20:58:09.182Z

Comments

Comment by Tristan Tran (tristan-tran) on LifeKeeper Diaries: Exploring Misaligned AI Through Interactive Fiction · 2024-11-10T12:11:15.690Z · LW · GW

That should be the error message. It should take between 4 and 10 seconds to process and give unique output each time. Maybe try a different browser? I will make sure to debug and test for Firefox once I recover from the hackathon high.

Comment by Tristan Tran (tristan-tran) on Optimizing for Agency? · 2024-05-30T23:50:05.153Z · LW · GW

With optimization, I'm always concerned with the interactions of multiple agents, are there any ways in this system that two or more agents could form cartels and increase each others agency. I see this happen with some reinforcement learning models where if some edge cases aren't covered, then they will just mine each other for easy points thanks to how we set up the reward function.