Posts
2025 Q1 Pivotal Research Fellowship (Technical & Policy)
2024-11-12T10:56:24.858Z
Apply to the Pivotal Research Fellowship (AI Safety & Biosecurity)
2024-04-10T12:08:30.849Z
Understanding mesa-optimization using toy models
2023-05-07T17:00:52.620Z
How harmful are improvements in AI? + Poll
2022-02-15T18:16:07.854Z
Comments
Comment by
tilmanr (tilman-ra) on
How harmful are improvements in AI? + Poll ·
2022-02-21T11:58:05.216Z ·
LW ·
GW
Thank you for giving more context to EleutherAI's stance on acceleration and linking to your newest paper.
I support the claim that your open model contributes to AI safety research, and I generally agree with the improvements for the alignment landscape. I can also understand why you are not detailing possible failure modes of realising LLM, as this would basically be stating a bunch of infohazards.
But at least for me, this opens the space for discussing until which point to open up previously closed models for the sake of alignment research. If an aligned researcher can benefit from access, so could a non-aligned researcher, hence the " accidental acceleration."