LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] Should AIs be Encouraged to Cooperate?
PeterMcCluskey · 2025-04-15T21:57:06.096Z · comments (2)
Host Keys and SSHing to EC2
jefftk (jkaufman) · 2025-04-17T15:10:29.139Z · comments (6)
The Mirror Problem in AI: Why Language Models Say Whatever You Want
RobT · 2025-04-15T18:40:02.793Z · comments (2)
Risers for Foot Percussion
jefftk (jkaufman) · 2025-04-15T11:10:08.577Z · comments (1)
What empirical research directions has Eliezer commented positively on?
Chris_Leong · 2025-04-15T08:53:41.677Z · comments (1)
[Rockville] Rationalist Shabbat
maia · 2025-04-18T15:38:30.650Z · comments (0)
[link] Conditional Forecasting as Model Parameterization
Molly (hickman-santini) · 2025-04-18T02:35:42.110Z · comments (0)
[link] Human-level is not the limit
Vishakha (vishakha-agrawal) · 2025-04-16T08:33:15.498Z · comments (2)
0 Motivation Mapping through Information Theory
P. João (gabriel-brito) · 2025-04-18T00:53:34.360Z · comments (0)
Mass Exposure Paradox
max-sixty · 2025-04-16T20:18:00.492Z · comments (0)
Some OthelloGPT Circuits
Alfred Wong (alfred-wong) · 2025-04-15T18:41:36.216Z · comments (0)
[link] Nihilism Is Not Enough By Peter Thiel
shawkisukkar · 2025-04-15T00:13:01.375Z · comments (4)
$500 bounty for best short-form fiction about our near future world; $100 for recommending winning piece: new “Art of Near Future World” quarterly art project
Ramon Gonzalez (ramon-gonzalez) · 2025-04-15T00:46:10.637Z · comments (0)
[link] AISN #51: AI Frontiers
Corin Katzke (corin-katzke) · 2025-04-15T16:01:56.701Z · comments (1)
How Logic "Really" Works: An Engineering Perspective
Daniil Strizhov (mila-dolontaeva) · 2025-04-16T05:34:09.443Z · comments (0)
Karma Tests in Logical Counterfactual Simulations motivates strong agents to protect weak agents
Knight Lee (Max Lee) · 2025-04-18T11:11:23.239Z · comments (0)
Gamify life from BayesianMind
P. João (gabriel-brito) · 2025-04-16T16:17:49.284Z · comments (2)
How to Defend the Indefensible
Alex Beyman (alexbeyman) · 2025-04-15T07:45:15.971Z · comments (1)
Луна Лавгуд и Комната Тайн, Часть 5
Kongo Landwalker (kongo-landwalker) · 2025-04-14T00:10:36.028Z · comments (0)
[link] 3M Subscriber YouTube Account 'Channel 5' Reporting On Rationalism
sakraf · 2025-04-15T13:02:33.736Z · comments (0)
Finance and AI Timelines
DAL · 2025-04-16T16:55:06.957Z · comments (0)
[link] AI is advancing fast
Vishakha (vishakha-agrawal) · 2025-04-16T08:17:06.055Z · comments (0)
Creating 'Making God': a Feature Documentary on risks from AGI
Connor Axiotes (connor-axiotes-1) · 2025-04-15T02:56:09.206Z · comments (0)
Sam Altman's sister claims Sam sexually abused her -- Part 8: Timeline, continued
pythagoras5015 (pl5015) · 2025-04-14T17:42:53.705Z · comments (0)
On AI personhood
p.b. · 2025-04-17T12:31:52.288Z · comments (6)
One Night in Delphi
Eggs (donald-sampson) · 2025-04-18T02:17:04.957Z · comments (2)
[link] Doing Prioritization Better
arvomm (arvo-munoz) · 2025-04-16T18:46:41.797Z · comments (1)
8 PRIME SKILLS – A construction from MaxEnt Informational Efficiency in 4 questions
P. João (gabriel-brito) · 2025-04-16T16:53:51.351Z · comments (0)
[link] The road from human-level to superintelligent AI may be short
Vishakha (vishakha-agrawal) · 2025-04-16T08:35:54.376Z · comments (0)
[link] AI may attain human level soon
Vishakha (vishakha-agrawal) · 2025-04-16T08:28:55.592Z · comments (0)
8 PRIME SKILLS - A simplified construction from MaxEnt Informational Efficiency in 4 questions
P. João (gabriel-brito) · 2025-04-17T11:04:07.424Z · comments (4)
[link] How worker co-ops can help restore social trust
B Jacobs (Bob Jacobs) · 2025-04-17T14:14:47.165Z · comments (5)
What happens when LLMs learn new things? & Continual learning forever.
sunchipsster · 2025-04-15T18:38:35.166Z · comments (0)
What if there was a nuke in Manhattan and why that could be a good thing
Ratburn · 2025-04-15T00:19:41.844Z · comments (11)
Towards Understanding the Representation of Belief State Geometry in Transformers
Karthik Viswanathan (vkarthik095) · 2025-04-18T12:39:01.251Z · comments (0)
The Case for White Box Control
J Rosser (j-rosser-uk) · 2025-04-18T16:10:57.823Z · comments (0)
Evaluating Collaborative AI Performance Subject to Sabotage
Matthew Khoriaty (matthew-khoriaty) · 2025-04-18T19:33:41.547Z · comments (0)
AI Control Methods Literature Review
Ram Potham (ram-potham) · 2025-04-18T21:15:34.682Z · comments (0)
Sam Altman's sister claims Sam sexually abused her -- Part 7: Timeline, continued
pythagoras5015 (pl5015) · 2025-04-14T17:43:28.897Z · comments (0)
Opportunity to to learn more about AI Innovation & Security Policy
PolicyTakes · 2025-04-16T01:35:27.203Z · comments (0)
Correcting Deceptive Alignment using a Deontological Approach
JeaniceK · 2025-04-14T22:07:57.860Z · comments (0)
Religious Persistence: A Missing Primitive for Robust Alignment
lauriewired · 2025-04-14T22:03:45.868Z · comments (3)
Lightning Talks!
nathandunkerley · 2025-04-14T20:39:17.593Z · comments (0)
Measuring Beliefs of Language Models During Chain-of-Thought Reasoning
Baram Sosis (baram-sosis) · 2025-04-18T22:56:28.727Z · comments (0)
Could LLMs Learn to Detect Bias Autonomously, Like Tesla’s Self-Driving Cars?
Omnipheasant · 2025-04-18T18:45:36.242Z · comments (0)
Applications Open for Impact Accelerator Program for Experienced Professionals
Clark Wisenbaker (accounts-hip) · 2025-04-14T16:27:32.340Z · comments (0)
Hierarchical Cognitive Anchoring: A Sketch Toward Scalable Structural Alignment
sparckix · 2025-04-18T19:03:51.115Z · comments (0)
An artistic illustration of Scalable Oversight - "A world apart, neither gods nor mortals"
Marius Adrian Nicoară · 2025-04-16T12:41:44.874Z · comments (0)
Automating Mechanistic Interpretability via Program Synthesis
Edy Nastase (edy-nastase) · 2025-04-17T10:58:46.748Z · comments (1)
Sam Altman's sister claims Sam sexually abused her -- Part 5: Timeline, continued
pythagoras5015 (pl5015) · 2025-04-14T01:00:07.084Z · comments (0)
← previous page (newer posts) · next page (older posts) →