LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Synthetic Neuroscience
hpcfung · 2025-03-25T17:45:05.916Z · comments (3)
Evaluating Collaborative AI Performance Subject to Sabotage
Matthew Khoriaty (matthew-khoriaty) · 2025-04-18T19:33:41.547Z · comments (0)
[link] SecureDrop review
samuelshadrach (xpostah) · 2025-04-19T04:29:32.270Z · comments (0)
[link] The System Didn’t, and Doesn’t Need to be This Way ~ Thomas Paine on Economic Justice
James Stephen Brown (james-brown) · 2025-04-19T05:16:05.682Z · comments (3)
[link] Developing AI Safety: Bridging the Power-Ethics Gap (Introducing New Concepts)
Ronen Bar (ronen-bar) · 2025-04-20T04:40:42.983Z · comments (0)
Recommender Alignment for Lock-In Risk
alamerton · 2025-03-24T12:56:46.389Z · comments (0)
Eulogy to the Obits
Niko_McCarty (niko-2) · 2025-04-21T14:10:27.211Z · comments (1)
Feature-Based Analysis of Safety-Relevant Multi-Agent Behavior
Maria Kapros (maria-kapros) · 2025-04-21T18:12:13.548Z · comments (0)
[link] A Letter to His Highness Louis XV, the King of France
testingthewaters · 2025-04-22T00:51:12.090Z · comments (0)
[question] Share AI Safety Ideas: Both Crazy and Not. №2
ank · 2025-03-28T17:22:22.814Z · answers+comments (10)
Proposal for a Post-Labor Societal Structure to Mitigate ASI Risks: The 'Game Culture Civilization' (GCC) Model
Beyond Singularity (beyond-singularity) · 2025-03-29T11:31:04.894Z · comments (0)
[question] Is AGI actually that likely to take off given the world energy consumption?
StanislavKrym · 2025-03-27T23:13:14.959Z · answers+comments (2)
Baltimore – ACX Meetups Everywhere Spring 2025
Rivka (rivka) · 2025-03-25T23:49:10.759Z · comments (0)
[link] Bias Mitigation in Language Models by Steering Features
akankshanc · 2025-04-12T00:10:16.878Z · comments (0)
Brooklyn – ACX Meetups Everywhere Spring 2025
sleno · 2025-03-25T23:49:47.414Z · comments (0)
Buffalo – ACX Meetups Everywhere Spring 2025
Sarah W. (sarah w. ) · 2025-03-25T23:49:50.257Z · comments (0)
Cairo, Egypt – ACX Meetups Everywhere Spring 2025
Mostafa Shahat (mostafa shahat) · 2025-03-25T23:49:50.423Z · comments (0)
Wollongong – ACX Meetups Everywhere Spring 2025
Andy Bachler (andy bachler) · 2025-03-25T23:50:51.156Z · comments (0)
Williamsburg – ACX Meetups Everywhere Spring 2025
Jough (jough) · 2025-03-25T23:50:51.287Z · comments (0)
Prague – ACX Meetups Everywhere Spring 2025
JK (jk-1) · 2025-03-25T23:50:52.543Z · comments (0)
Mexico City – ACX Meetups Everywhere Spring 2025
EddieMorra · 2025-03-26T00:11:38.225Z · comments (0)
Santiago – ACX Meetups Everywhere Spring 2025
xenophile · 2025-03-26T00:11:40.469Z · comments (0)
I’m headed to DC this week. any tips?
Wes R · 2025-04-19T02:33:18.584Z · comments (0)
Sam Altman's sister claims Sam sexually abused her -- Part 10: responses from Sam and his family members; my perspective
pythagoras5015 (pl5015) · 2025-03-31T12:26:25.256Z · comments (1)
Singapore – ACX Meetups Everywhere Spring 2025
Ewan (YM) · 2025-03-26T00:11:41.563Z · comments (0)
José Ignacio – ACX Meetups Everywhere Spring 2025
deadpine · 2025-03-26T00:11:43.616Z · comments (0)
Jos – ACX Meetups Everywhere Spring 2025
jibrinx · 2025-03-26T00:11:45.139Z · comments (0)
Astral Codex Ten meetup in Kaliningrad (Russia)
YanLyutnev (YanLutnev) · 2025-04-01T14:07:53.296Z · comments (1)
Cape Town – ACX Meetups Everywhere Spring 2025
Leo_9 · 2025-03-26T00:11:46.238Z · comments (0)
Hong Kong ACX Spring Meetup 2025
fbreton · 2025-03-24T14:27:11.854Z · comments (0)
Machines of Stolen Grace
Riley Tavassoli (riley-tavassoli) · 2025-03-27T18:15:23.736Z · comments (0)
[question] How familiar is the Lesswrong community as a whole with the concept of Reward-modelling?
Oxidize · 2025-04-09T23:33:18.044Z · answers+comments (8)
Harrisburg – ACX Meetups Everywhere Spring 2025
wordsthatmakequestions · 2025-03-25T23:49:35.053Z · comments (0)
Albany – ACX Meetups Everywhere Spring 2025
Jake S (jake s) · 2025-03-25T23:49:37.414Z · comments (0)
Asheville – ACX Meetups Everywhere Spring 2025
Vicki Williams (vicki-williams) · 2025-03-25T23:49:32.124Z · comments (0)
Ames – ACX Meetups Everywhere Spring 2025
Sarah (sarah-2) · 2025-03-25T23:49:39.223Z · comments (0)
Antalya – ACX Meetups Everywhere Spring 2025
Annalise Tarhan (annalise-tarhan) · 2025-03-25T23:49:39.835Z · comments (0)
Evaluating Oversight Robustness with Incentivized Reward Hacking
Yoav (crazytieguy) · 2025-04-20T16:53:44.897Z · comments (0)
Belgrade – ACX Meetups Everywhere Spring 2025
Tanya Trninic (tanya trninic) · 2025-03-25T23:49:40.468Z · comments (0)
[question] To what ethics is an AGI actually safely alignable?
StanislavKrym · 2025-04-20T17:09:25.279Z · answers+comments (6)
A Platform for Falsifiable Conjectures and Public Refutation — Would This Be Useful?
PetrusNonius · 2025-04-08T21:09:23.819Z · comments (1)
Text First, Evidence Later? Managing Quality and Trust in an Era of AI-Augmented Research
Thehumanproject.ai · 2025-04-10T18:52:58.934Z · comments (1)
Bern – ACX Meetups Everywhere Spring 2025
Daniel (daniel-3) · 2025-03-25T23:49:43.094Z · comments (0)
Oklahoma City – ACX Meetups Everywhere Spring 2025
Bean (Alvaromoret) · 2025-03-26T00:11:48.249Z · comments (0)
How do SAE Circuits Fail? A Case Study Using a Starts-with-'E' Letter Detection Task
adsingh-64 · 2025-03-30T00:47:18.711Z · comments (0)
Shanghai – ACX Meetups Everywhere Spring 2025
David Jiang (david-jiang) · 2025-03-26T00:11:50.908Z · comments (1)
Stuttgart – ACX Meetups Everywhere Spring 2025
Steve · 2025-03-25T23:49:26.494Z · comments (0)
[link] Digital Error Correction and Lock-In
alamerton · 2025-04-08T15:46:31.602Z · comments (0)
Gothenburg – ACX Meetups Everywhere Spring 2025
stefan · 2025-03-25T23:49:26.269Z · comments (0)
Fort Collins – ACX Meetups Everywhere Spring 2025
Spencer B (spencer0) · 2025-03-25T23:49:25.781Z · comments (0)
← previous page (newer posts) · next page (older posts) →