LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] A response to OpenAI’s “How we think about safety and alignment”
Harlan · 2025-03-31T20:58:31.901Z · comments (0)
[link] Delicious Boy Slop - Boring Diet, Effortless Weightloss
sapphire (deluks917) · 2025-03-24T15:01:58.355Z · comments (8)
A Talmudic Rationalist Cautionary Tale
Noah Birnbaum (daniel-birnbaum) · 2025-04-15T04:11:16.972Z · comments (1)
[link] Seeking feedback on "MAD Chairs: A new tool to evaluate AI"
Chris Santos-Lang (chris-santos-lang) · 2025-04-02T03:04:43.182Z · comments (0)
[link] The Case For Geopolitical Financial Speculation
prue (prue0) · 2025-04-01T21:09:17.515Z · comments (0)
[question] How likely are the USA to decay and how will it influence the AI development?
StanislavKrym · 2025-04-12T04:42:27.604Z · answers+comments (0)
Host Keys and SSHing to EC2
jefftk (jkaufman) · 2025-04-17T15:10:29.139Z · comments (6)
Probability Theory Fundamentals 102: Territory that Probability is in the Map of
Ape in the coat · 2025-03-26T06:40:57.913Z · comments (7)
Takes on Takeoff
atharva · 2025-03-25T00:20:07.915Z · comments (0)
[link] Podcast on “AI tools for existential security” — transcript
Lizka · 2025-04-21T19:26:07.518Z · comments (0)
An Introduction to SAEs and their Variants for Mech Interp
Adam Newgas (BorisTheBrave) · 2025-04-19T14:09:31.198Z · comments (0)
Transhumanism and AI: Toward Prosperity or Extinction?
Shaïman · 2025-03-22T18:16:07.868Z · comments (2)
Cheesecake Frosting
jefftk (jkaufman) · 2025-04-04T02:10:07.755Z · comments (9)
Story Feedback Request: The Policy - Emergent Alignment, Recursive Cognition, and AGI Trajectories
queelius · 2025-03-31T11:08:21.667Z · comments (2)
San Francisco – ACX Meetups Everywhere Spring 2025
Austin Chen (austin-chen) · 2025-03-25T23:48:21.681Z · comments (0)
Will the Need to Retrain AI Models from Scratch Block a Software Intelligence Explosion?
Tom Davidson (tom-davidson-1) · 2025-03-28T14:12:02.163Z · comments (0)
What are good safety standards for open source AIs from China?
ChristianKl · 2025-04-12T13:06:16.663Z · comments (2)
[link] Calculus is about change
dkl9 · 2025-04-01T19:44:43.453Z · comments (1)
[link] What is scaffolding?
Vishakha (vishakha-agrawal) · 2025-03-27T09:06:35.403Z · comments (0)
[question] Would it be effective to learn a language to improve cognition?
Hruss (henry-russell) · 2025-03-26T10:17:56.357Z · answers+comments (7)
Pictures for 2024
jefftk (jkaufman) · 2025-03-24T02:40:07.051Z · comments (0)
Coupling for Decouplers — Intro
Jacob Falkovich (Jacobian) · 2025-04-07T15:12:26.892Z · comments (0)
Misinformation is the default, and information is the government telling you your tap water is safe to drink
danielechlin · 2025-04-07T22:28:18.158Z · comments (2)
[link] Conditional Forecasting as Model Parameterization
Molly (hickman-santini) · 2025-04-18T02:35:42.110Z · comments (0)
Hamburg – ACX Meetups Everywhere Spring 2025
Gunnar_Zarncke · 2025-03-25T23:48:44.505Z · comments (0)
Brisbane – ACX Meetups Everywhere Spring 2025
Laura (laura-2) · 2025-03-25T23:49:45.806Z · comments (0)
[link] The Care and Feeding of Mythological Intelligences
Jack (jack-3) · 2025-04-02T22:05:21.151Z · comments (0)
The Mirror Problem in AI: Why Language Models Say Whatever You Want
RobT · 2025-04-15T18:40:02.793Z · comments (2)
Risers for Foot Percussion
jefftk (jkaufman) · 2025-04-15T11:10:08.577Z · comments (2)
How to enjoy fail attempts without self-deception (technique)
YanLyutnev (YanLutnev) · 2025-03-30T13:49:23.793Z · comments (0)
Karma Tests in Logical Counterfactual Simulations motivates strong agents to protect weak agents
Knight Lee (Max Lee) · 2025-04-18T11:11:23.239Z · comments (6)
[link] Grounded Ghosts in the Machine - Friston Blankets, Mirror Neurons, and the Quest for Cooperative AI
Davidmanheim · 2025-04-10T10:15:54.880Z · comments (0)
[link] Paper Highlights, March '25
gasteigerjo · 2025-04-07T20:17:42.944Z · comments (0)
[Research sprint] Single-model crosscoder feature ablation and steering
Thomas Read (thjread) · 2025-04-06T14:42:30.357Z · comments (0)
MATS is hiring!
Ryan Kidd (ryankidd44) · 2025-04-08T20:45:15.280Z · comments (0)
Advanced AI Systems Will Not Follow Historical Technological Patterns and Will Not Suffer the Misattribution of Productivity Gains
Max Abecassis (max@customplay.com) · 2025-03-24T19:20:31.486Z · comments (0)
Sydney – ACX Meetups Everywhere Spring 2025
Elo · 2025-03-25T23:48:38.414Z · comments (0)
Nuanced Models for the Influence of Information
ozziegooen · 2025-04-10T18:28:34.082Z · comments (0)
Straightforward Steps to Marginally Improve Odds of Whole Brain Emulation
Dom Polsinelli (dom-polsinelli) · 2025-03-24T17:14:38.794Z · comments (20)
Suggesting some revisions to Graham's hierarchy of disagreement
Sniffnoy · 2025-04-02T22:25:17.267Z · comments (2)
[Rockville] Rationalist Shabbat
maia · 2025-04-18T15:38:30.650Z · comments (0)
What empirical research directions has Eliezer commented positively on?
Chris_Leong · 2025-04-15T08:53:41.677Z · comments (1)
Yeshua's Basilisk
Alex Beyman (alexbeyman) · 2025-03-29T18:11:50.535Z · comments (1)
Linkpost to a Summary of "Imagining and building wise machines: The centrality of AI metacognition" by Johnson, Karimi, Bengio, et al.
Chris_Leong · 2025-04-10T11:54:37.484Z · comments (0)
Emergent scaling effects on the functional hierarchies within LLMs
Foop · 2025-03-24T13:03:30.930Z · comments (0)
Austin – ACX Meetups Everywhere Spring 2025
SilasBarta · 2025-03-25T23:49:23.114Z · comments (0)
Comments on Karma systems
Arturo Macias (arturo-macias) · 2025-04-01T12:53:16.303Z · comments (2)
Boston – ACX Meetups Everywhere Spring 2025
Screwtape · 2025-03-25T23:49:16.978Z · comments (0)
Berkeley – ACX Meetups Everywhere Spring 2025
Screwtape · 2025-03-25T23:49:15.038Z · comments (0)
Building Communities Beyond the Bay
Lucie Philippon (lucie-philippon) · 2025-04-01T22:07:16.288Z · comments (2)
← previous page (newer posts) · next page (older posts) →