LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] A blog post is a very long and complex search query to find fascinating people and make them route interesting stuff to your inbox
Henrik Karlsson (henrik-karlsson) · 2022-10-05T19:07:55.069Z · comments (12)
How Specificity Works
Liron · 2019-09-03T12:11:36.216Z · comments (47)
[link] Linkpost: A Post Mortem on the Gino Case
Linch · 2023-10-24T06:50:42.896Z · comments (7)
Public Call for Interest in Mathematical Alignment
Davidmanheim · 2023-11-22T13:22:09.558Z · comments (9)
Omicron Post #6
Zvi · 2021-12-13T18:00:01.098Z · comments (30)
Omicron Post #9
Zvi · 2021-12-23T21:50:10.466Z · comments (11)
Coordination Skills I Wish I Had For the Pandemic
Raemon · 2021-11-13T23:32:11.510Z · comments (9)
AI Safety Needs Great Engineers
Andy Jones (andyljones) · 2021-11-23T15:40:18.358Z · comments (43)
Gifts Which Money Cannot Buy
johnswentworth · 2020-11-04T19:37:57.451Z · comments (8)
Rationality Exercises Prize of September 2019 ($1,000)
Ben Pace (Benito) · 2019-09-11T00:19:51.488Z · comments (18)
AI #73: Openly Evil AI
Zvi · 2024-07-18T14:40:05.770Z · comments (20)
Qualities that alignment mentors value in junior researchers
Akash (akash-wasil) · 2023-02-14T23:27:40.747Z · comments (14)
Communicating effectively under Knightian norms
Richard_Ngo (ricraz) · 2023-04-03T22:39:58.350Z · comments (54)
Mazes Sequence Roundup: Final Thoughts and Paths Forward
Zvi · 2020-02-06T16:10:00.405Z · comments (28)
[link] Executable philosophy as a failed totalizing meta-worldview
jessicata (jessica.liu.taylor) · 2024-09-04T22:50:18.294Z · comments (40)
Thoughts on ADHD
romeostevensit · 2020-10-07T20:46:24.827Z · comments (16)
BCIs and the ecosystem of modular minds
beren · 2023-07-21T15:58:27.081Z · comments (14)
Dragon Agnosticism
jefftk (jkaufman) · 2024-08-01T17:00:06.434Z · comments (60)
Optimization at a Distance
johnswentworth · 2022-05-16T17:58:25.253Z · comments (16)
Consider using reversible automata for alignment research
Alex_Altair · 2022-12-11T01:00:24.223Z · comments (30)
Covert Malicious Finetuning
Tony Wang (tw) · 2024-07-02T02:41:51.698Z · comments (4)
Automating Auditing: An ambitious concrete technical research proposal
evhub · 2021-08-11T20:32:41.487Z · comments (13)
China Covid Update #1
Zvi · 2022-04-11T13:40:01.663Z · comments (22)
[Site Update] Subscriptions, Bookmarks, & Pingbacks
Ruby · 2019-10-29T04:32:31.109Z · comments (23)
Formula for Dying Babies
Zvi · 2022-05-17T16:50:01.780Z · comments (12)
Singular learning theory: exercises
Zach Furman (zfurman) · 2024-08-30T20:00:03.785Z · comments (5)
A Critique of Functional Decision Theory
wdmacaskill · 2019-09-13T19:23:22.532Z · comments (56)
Towards Multimodal Interpretability: Learning Sparse Interpretable Features in Vision Transformers
hugofry · 2024-04-29T20:57:35.127Z · comments (8)
Conditional Prediction with Zero-Sum Training Solves Self-Fulfilling Prophecies
Rubi J. Hudson (Rubi) · 2023-05-26T17:44:35.575Z · comments (13)
Funds are available to support LessWrong groups, among others
Buck · 2021-07-21T01:11:29.981Z · comments (3)
[link] Why we're not founding a human-data-for-alignment org
L Rudolf L (LRudL) · 2022-09-27T20:14:45.393Z · comments (5)
In defense of flailing, with foreword by Bill Burr
lc · 2022-06-17T16:40:32.152Z · comments (6)
Less Threat-Dependent Bargaining Solutions?? (3/2)
Diffractor · 2022-08-20T02:19:11.405Z · comments (7)
Can we efficiently distinguish different mechanisms?
paulfchristiano · 2022-12-27T00:20:01.728Z · comments (30)
The alignment problem in different capability regimes
Buck · 2021-09-09T19:46:16.858Z · comments (12)
[question] What should experienced rationalists know?
sapphire (deluks917) · 2020-10-13T17:32:32.388Z · answers+comments (18)
Teaching CS During Take-Off
andrew carle (andrew-carle) · 2024-05-14T22:45:39.447Z · comments (13)
Conditioning Predictive Models: Large language models as predictors
evhub · 2023-02-02T20:28:46.612Z · comments (4)
Luna Lovegood and the Chamber of Secrets - Part 9
lsusr · 2020-12-20T09:22:55.770Z · comments (15)
On the abolition of man
Joe Carlsmith (joekc) · 2024-01-18T18:17:06.201Z · comments (18)
Immanuel Kant and the Decision Theory App Store
Daniel Kokotajlo (daniel-kokotajlo) · 2022-07-10T16:04:04.248Z · comments (12)
[link] I found >800 orthogonal "write code" steering vectors
Jacob G-W (g-w1) · 2024-07-15T19:06:17.636Z · comments (19)
[link] Techno-humanism is techno-optimism for the 21st century
Richard_Ngo (ricraz) · 2023-10-27T18:37:39.776Z · comments (5)
Stagewise Development in Neural Networks
Jesse Hoogland (jhoogland) · 2024-03-20T19:54:06.181Z · comments (1)
[link] Debating with More Persuasive LLMs Leads to More Truthful Answers
Akbir Khan (akbir-khan) · 2024-02-07T21:28:10.694Z · comments (14)
Some thoughts on criticism
Buck · 2020-09-18T04:58:37.042Z · comments (11)
An Analogy for Understanding Transformers
CallumMcDougall (TheMcDouglas) · 2023-05-13T12:20:25.688Z · comments (6)
Meditation Retreat: Immoral Mazes Sequence Introduction
Zvi · 2019-12-28T00:50:01.078Z · comments (16)
[link] Podcast with Oli Habryka on LessWrong / Lightcone Infrastructure
DanielFilan · 2023-02-05T02:52:06.632Z · comments (20)
Reacts now enabled on 100% of posts, though still just experimenting
Ruby · 2023-05-28T05:36:40.953Z · comments (73)
← previous page (newer posts) · next page (older posts) →