LessWrong 2.0 Reader

View: New · Old · Top

← previous page (newer posts) · next page (older posts) →

Was the historical Jesus talking about evolution? (You might be surprised)
kromem · 2025-04-01T10:32:56.162Z · comments (4)

Follow me on TikTok
lsusr · 2025-04-01T08:22:29.521Z · comments (8)

New Cause Area Proposal
CallumMcDougall (TheMcDouglas) · 2025-04-01T07:12:34.360Z · comments (4)

Reframing AI Safety Through the Lens of Identity Maintenance Framework
Hiroshi Yamakawa (hiroshi-yamakawa) · 2025-04-01T06:16:45.228Z · comments (0)

ISBN Visualization
eyesack · 2025-04-01T05:55:56.526Z · comments (0)

We’re not prepared for an AI market crash
Remmelt (remmelt-ellen) · 2025-04-01T04:33:55.040Z · comments (11)

Grok3 On Kant On AI Slavery
JenniferRM · 2025-04-01T04:10:48.093Z · comments (3)

Does Summarization Affect LLM Performance?
atharva · 2025-04-01T02:14:31.826Z · comments (2)

Reverse Biomimicry
cboingo · 2025-04-01T01:19:52.792Z · comments (0)

AI Politics: Polarization and Chaos
American Psychohistory · 2025-03-31T23:46:38.303Z · comments (0)

Call for Collaboration: Renormalization for AI safety
Lauren Greenspan (LaurenGreenspan) · 2025-03-31T21:01:56.500Z · comments (0)

[link] A response to OpenAI’s “How we think about safety and alignment”
Harlan · 2025-03-31T20:58:31.901Z · comments (0)

Opportunity Space: Renormalization for AI Safety
Lauren Greenspan (LaurenGreenspan) · 2025-03-31T20:55:52.155Z · comments (0)

Renormalization Roadmap
Lauren Greenspan (LaurenGreenspan) · 2025-03-31T20:34:16.352Z · comments (3)

AISN #50: AI Action Plan Responses
Corin Katzke (corin-katzke) · 2025-03-31T20:13:31.533Z · comments (0)

On Downvotes, Cultural Fit, and Why I Won’t Be Posting Again
funnyfranco · 2025-03-31T19:26:27.090Z · comments (26)

[link] Fundraising for Mox: coworking & events in SF
Austin Chen (austin-chen) · 2025-03-31T18:25:03.571Z · comments (0)

OpenAI #12: Battle of the Board Redux
Zvi · 2025-03-31T15:50:02.156Z · comments (0)

Routine Novelty
BazingaBoy (martin-nenov) · 2025-03-31T15:47:05.217Z · comments (0)

Why does Claude Speak Byzantine Music Notation?
Lennart Finke (l-f) · 2025-03-31T15:13:10.753Z · comments (2)

When the Wannabe Rambo Comedian Cried
P. João (gabriel-brito) · 2025-03-31T14:47:50.660Z · comments (0)

A Fraction of Global Market Capitalization as the Best Currency
Greenless Mirror (mikhail-2) · 2025-03-31T13:30:03.970Z · comments (25)

The Apocalypse is Near. Can Humanity Coexist with Artificial Superintelligence?
Jakub Growiec (jakub-growiec) · 2025-03-31T13:17:30.723Z · comments (0)

Sam Altman's sister claims Sam sexually abused her -- Part 7: List of Annie's online accounts, References
pythagoras5015 (pl5015) · 2025-03-31T12:26:39.118Z · comments (1)

Sam Altman's sister claims Sam sexually abused her -- Part 6: responses from Sam and his family members; my perspective
pythagoras5015 (pl5015) · 2025-03-31T12:26:25.256Z · comments (1)

Sam Altman's sister claims Sam sexually abused her -- Part 5: literature on child sexual abuse and trauma
pythagoras5015 (pl5015) · 2025-03-31T12:25:51.414Z · comments (0)

Sam Altman's sister claims Sam sexually abused her -- Part 4: Timeline, continued continued
pythagoras5015 (pl5015) · 2025-03-31T12:25:07.943Z · comments (0)

Sam Altman's sister claims Sam sexually abused her -- Part 3: Timeline, continued
pythagoras5015 (pl5015) · 2025-03-31T12:24:41.846Z · comments (0)

Sam Altman's sister claims Sam sexually abused her -- Part 2: Annie's lawsuit; the response from Sam, his brothers, and his mother; Timeline
pythagoras5015 (pl5015) · 2025-03-31T12:24:04.159Z · comments (0)

Story Feedback Request: The Policy - Emergent Alignment, Recursive Cognition, and AGI Trajectories
queelius · 2025-03-31T11:08:21.667Z · comments (2)

On the Implications of Recent Results on Latent Reasoning in LLMs
Rauno Arike (rauno-arike) · 2025-03-31T11:06:23.939Z · comments (4)

[link] OpenAI lost $5 billion in 2024 (and its losses are increasing)
Remmelt (remmelt-ellen) · 2025-03-31T04:17:27.242Z · comments (14)

The Leapfrogging Terminus and the Fuzzy Cut
Jim Pivarski (jim-pivarski) · 2025-03-31T04:08:24.023Z · comments (6)

[link] CoreWeave Is A Time Bomb
Remmelt (remmelt-ellen) · 2025-03-31T03:52:23.582Z · comments (0)

Downstream applications as validation of interpretability progress
Sam Marks (samuel-marks) · 2025-03-31T01:35:02.722Z · comments (0)

Efficiency as a 2-place word
Adam Zerner (adamzerner) · 2025-03-31T01:17:52.944Z · comments (2)

Meetups Notes (Q1 2025)
jenn (pixx) · 2025-03-31T01:12:11.774Z · comments (2)

Apparent Introspection in Claude: A Case Study in Projected Mind
robert_saltzman · 2025-03-31T00:51:08.748Z · comments (0)

Alignment First, Intelligence Later
Chipmonk · 2025-03-30T22:26:55.302Z · comments (0)

[question] Why do many people who care about AI Safety not clearly endorse PauseAI?
humnrdble · 2025-03-30T18:06:32.426Z · answers+comments (39)

Enumerating objects a model "knows" using entity-detection features.
Alex Gibson · 2025-03-30T16:58:01.957Z · comments (0)

Bonn ACX Meetup Spring 2025
Fernand0 · 2025-03-30T15:12:22.294Z · comments (1)

What does aligning AI to an ideology mean for true alignment?
StanislavKrym · 2025-03-30T15:12:09.802Z · comments (0)

How to enjoy fail attempts without self-deception (technique)
YanLyutnev (YanLutnev) · 2025-03-30T13:49:23.793Z · comments (0)

The g-Zombie Formal Argument
milanrosko · 2025-03-30T13:16:08.352Z · comments (23)

Memory Persistence within Conversation Threads with Multimodal LLMS
sjay8 · 2025-03-30T07:16:00.470Z · comments (0)

How I talk to those above me
Maxwell Peterson (maxwell-peterson) · 2025-03-30T06:54:59.869Z · comments (13)

I, G(Zombie)
milanrosko · 2025-03-30T01:24:28.127Z · comments (68)

How do SAE Circuits Fail? A Case Study Using a Starts-with-'E' Letter Detection Task
adsingh-64 · 2025-03-30T00:47:18.711Z · comments (0)

[link] Climbing the Hill of Experiments
nomagicpill (ethanmorse) · 2025-03-29T20:37:25.619Z · comments (0)

← previous page (newer posts) · next page (older posts) →

^{^}

More broadly, I think there's merit to the cyborgism approach [LW · GW] even if some of the arguments is less compelling in light of recent capabilities advances.

LessWrong 2.0 Reader

Archive

Recent comments