"AIXIjs: A Software Demo for General Reinforcement Learning", Aslanides 2017

post by gwern · 2017-05-29T21:09:53.566Z · LW · GW · Legacy · 1 comments

This is a link post for https://arxiv.org/abs/1705.07615

Contents

1 comment

1 comments

Comments sorted by top scores.

comment by gwern · 2017-05-29T21:10:04.247Z · LW(p) · GW(p)

Source repo: https://github.com/aslanides/aixijs

Live demos: http://aslanides.io/aixijs/demo.html

This is relevant as they can be used to implement AI risk demos in the browser and visualize them. It already includes two demos of wireheading and noise-seeking agents going wrong.