"AIXIjs: A Software Demo for General Reinforcement Learning", Aslanides 2017
post by gwern · 2017-05-29T21:09:53.566Z · LW · GW · Legacy · 1 commentsThis is a link post for https://arxiv.org/abs/1705.07615
Contents
1 comment
1 comments
Comments sorted by top scores.
comment by gwern · 2017-05-29T21:10:04.247Z · LW(p) · GW(p)
Source repo: https://github.com/aslanides/aixijs
Live demos: http://aslanides.io/aixijs/demo.html
This is relevant as they can be used to implement AI risk demos in the browser and visualize them. It already includes two demos of wireheading and noise-seeking agents going wrong.