LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] Should AIs be Encouraged to Cooperate?
PeterMcCluskey · 2025-04-15T21:57:06.096Z · comments (2)

Host Keys and SSHing to EC2
jefftk (jkaufman) · 2025-04-17T15:10:29.139Z · comments (6)

The Mirror Problem in AI: Why Language Models Say Whatever You Want
RobT · 2025-04-15T18:40:02.793Z · comments (2)

Risers for Foot Percussion
jefftk (jkaufman) · 2025-04-15T11:10:08.577Z · comments (1)

What empirical research directions has Eliezer commented positively on?
Chris_Leong · 2025-04-15T08:53:41.677Z · comments (1)

[Rockville] Rationalist Shabbat
maia · 2025-04-18T15:38:30.650Z · comments (0)

[link] Conditional Forecasting as Model Parameterization
Molly (hickman-santini) · 2025-04-18T02:35:42.110Z · comments (0)

[link] Human-level is not the limit
Vishakha (vishakha-agrawal) · 2025-04-16T08:33:15.498Z · comments (2)

0 Motivation Mapping through Information Theory
P. João (gabriel-brito) · 2025-04-18T00:53:34.360Z · comments (0)

Mass Exposure Paradox
max-sixty · 2025-04-16T20:18:00.492Z · comments (0)

Some OthelloGPT Circuits
Alfred Wong (alfred-wong) · 2025-04-15T18:41:36.216Z · comments (0)

[link] Nihilism Is Not Enough By Peter Thiel
shawkisukkar · 2025-04-15T00:13:01.375Z · comments (4)

$500 bounty for best short-form fiction about our near future world; $100 for recommending winning piece: new “Art of Near Future World” quarterly art project
Ramon Gonzalez (ramon-gonzalez) · 2025-04-15T00:46:10.637Z · comments (0)

[link] AISN #51: AI Frontiers
Corin Katzke (corin-katzke) · 2025-04-15T16:01:56.701Z · comments (1)

How Logic "Really" Works: An Engineering Perspective
Daniil Strizhov (mila-dolontaeva) · 2025-04-16T05:34:09.443Z · comments (0)

Karma Tests in Logical Counterfactual Simulations motivates strong agents to protect weak agents
Knight Lee (Max Lee) · 2025-04-18T11:11:23.239Z · comments (0)

Gamify life from BayesianMind
P. João (gabriel-brito) · 2025-04-16T16:17:49.284Z · comments (2)

How to Defend the Indefensible
Alex Beyman (alexbeyman) · 2025-04-15T07:45:15.971Z · comments (1)

Луна Лавгуд и Комната Тайн, Часть 5
Kongo Landwalker (kongo-landwalker) · 2025-04-14T00:10:36.028Z · comments (0)

[link] 3M Subscriber YouTube Account 'Channel 5' Reporting On Rationalism
sakraf · 2025-04-15T13:02:33.736Z · comments (0)

Finance and AI Timelines
DAL · 2025-04-16T16:55:06.957Z · comments (0)

[link] AI is advancing fast
Vishakha (vishakha-agrawal) · 2025-04-16T08:17:06.055Z · comments (0)

Creating 'Making God': a Feature Documentary on risks from AGI
Connor Axiotes (connor-axiotes-1) · 2025-04-15T02:56:09.206Z · comments (0)

Sam Altman's sister claims Sam sexually abused her -- Part 8: Timeline, continued
pythagoras5015 (pl5015) · 2025-04-14T17:42:53.705Z · comments (0)

On AI personhood
p.b. · 2025-04-17T12:31:52.288Z · comments (6)

One Night in Delphi
Eggs (donald-sampson) · 2025-04-18T02:17:04.957Z · comments (2)

[link] Doing Prioritization Better
arvomm (arvo-munoz) · 2025-04-16T18:46:41.797Z · comments (1)

8 PRIME SKILLS – A construction from MaxEnt Informational Efficiency in 4 questions
P. João (gabriel-brito) · 2025-04-16T16:53:51.351Z · comments (0)

[link] The road from human-level to superintelligent AI may be short
Vishakha (vishakha-agrawal) · 2025-04-16T08:35:54.376Z · comments (0)

[link] AI may attain human level soon
Vishakha (vishakha-agrawal) · 2025-04-16T08:28:55.592Z · comments (0)

8 PRIME SKILLS - A simplified construction from MaxEnt Informational Efficiency in 4 questions
P. João (gabriel-brito) · 2025-04-17T11:04:07.424Z · comments (4)

[link] How worker co-ops can help restore social trust
B Jacobs (Bob Jacobs) · 2025-04-17T14:14:47.165Z · comments (5)

What happens when LLMs learn new things? & Continual learning forever.
sunchipsster · 2025-04-15T18:38:35.166Z · comments (0)

What if there was a nuke in Manhattan and why that could be a good thing
Ratburn · 2025-04-15T00:19:41.844Z · comments (11)

Towards Understanding the Representation of Belief State Geometry in Transformers
Karthik Viswanathan (vkarthik095) · 2025-04-18T12:39:01.251Z · comments (0)

The Case for White Box Control
J Rosser (j-rosser-uk) · 2025-04-18T16:10:57.823Z · comments (0)

Evaluating Collaborative AI Performance Subject to Sabotage
Matthew Khoriaty (matthew-khoriaty) · 2025-04-18T19:33:41.547Z · comments (0)

AI Control Methods Literature Review
Ram Potham (ram-potham) · 2025-04-18T21:15:34.682Z · comments (0)

Sam Altman's sister claims Sam sexually abused her -- Part 7: Timeline, continued
pythagoras5015 (pl5015) · 2025-04-14T17:43:28.897Z · comments (0)

Opportunity to to learn more about AI Innovation & Security Policy
PolicyTakes · 2025-04-16T01:35:27.203Z · comments (0)

Correcting Deceptive Alignment using a Deontological Approach
JeaniceK · 2025-04-14T22:07:57.860Z · comments (0)

Religious Persistence: A Missing Primitive for Robust Alignment
lauriewired · 2025-04-14T22:03:45.868Z · comments (3)

Lightning Talks!
nathandunkerley · 2025-04-14T20:39:17.593Z · comments (0)

Measuring Beliefs of Language Models During Chain-of-Thought Reasoning
Baram Sosis (baram-sosis) · 2025-04-18T22:56:28.727Z · comments (0)

Could LLMs Learn to Detect Bias Autonomously, Like Tesla’s Self-Driving Cars?
Omnipheasant · 2025-04-18T18:45:36.242Z · comments (0)

Applications Open for Impact Accelerator Program for Experienced Professionals
Clark Wisenbaker (accounts-hip) · 2025-04-14T16:27:32.340Z · comments (0)

Hierarchical Cognitive Anchoring: A Sketch Toward Scalable Structural Alignment
sparckix · 2025-04-18T19:03:51.115Z · comments (0)

An artistic illustration of Scalable Oversight - "A world apart, neither gods nor mortals"
Marius Adrian Nicoară · 2025-04-16T12:41:44.874Z · comments (0)

Automating Mechanistic Interpretability via Program Synthesis
Edy Nastase (edy-nastase) · 2025-04-17T10:58:46.748Z · comments (1)

Sam Altman's sister claims Sam sexually abused her -- Part 5: Timeline, continued
pythagoras5015 (pl5015) · 2025-04-14T01:00:07.084Z · comments (0)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

hpcfung on hpcfung's Shortform

Is there any attempt at compiling a list of all publicly available university courses materials (lecture notes, videos, reference books, syllabi), across all institutions? I seem to remember cosmolearning.org but the site is no longer running.

I imagine this kind of infrastructure is really helpful, or even necessary to self learners.

The equivalent for researchers would be conferences, summer schools/workshops, powerpoints for talks, etc.

caerulea-lawrence on What If Galaxies Are Alive and Atoms Have Minds? A Thought Experiment on Life Across Scales

To answer the question to pose as a precision in your comment [LW · GW], if there are structures that could be analogous to intelligence, without being literal biological? - The simple answer to that is 'yes'.

What we call 'consciousness' is not a 'neutral' lens - and there is no issue with imagining and understanding that there could be types of 'consciousness' that are shaped by very different processes than our own.

Personally I want to be part of a conscious universe, where there is communication going in all directions, and there is a shared goal and purpose. Though, since the structures might be so different, even reaching the step where they are able to differentiate themselves, and even communicate anywhere close to effectively, won't be easy. Considering how hard it is to understand ourselves, aka the signals from cells, bacteria and viruses, it might not be much easier for, say, the Earth to communicate with us.

Ideas/theories that are similar:
Panpsychism, but an idea/theory that might also fit would be Analytical Idealism.
A theory that explores this in a much more general way, looking at it from the perspective of values and paradigms, would be Spiral Dynamics.

I also don't see anything wrong with going in this direction, as an exploration. Complexity theory and emergence duly point out that there is much more to our reality, even to biology, than meets the eye.

nmca on Recent AI model progress feels mostly like bullshit

Is there an o3 update yet?

knight-lee on Power Lies Trembling: a three-book review

:) thank you so much for your thoughts.

Unfortunately, my model of the world is that if AI kills "more than 10%," it's probably going to be everyone and everything, so the insurance won't work according to my beliefs.

I only defined AI catastrophe as "killing more than 10%" because it's what the survey by Karger et al. asked the participants.

I don't believe in option 2, because if you asked people to bet against AI risk with unfavourable odds, they probably won't feel too confident against AI risk.

daniel-kokotajlo on AI 2027: What Superintelligence Looks Like

That's part of it, but also, over the course of 2027 OpenBrain works hard to optimize for data-efficiency, generalization and transfer learning ability, etc. and undergoes at least two major paradigm shifts in AI architecture.

michaeldickens on What Makes an AI Startup "Net Positive" for Safety?

I think the statement in the parent comment is too general. What I should have said is that every generalist frontier AI company has been net negative. Narrow AI companies that provide useful services and have ~zero chance of accelerating AGI are probably net positive.

lc on Three Months In, Evaluating Three Rationalist Cases for Trump

The indexes above seem to be concerned only with state restrictions on speech. But even if they weren't, I would be surprised if the private situation was any better in the UK than it is here.

gurkenglas on What Makes an AI Startup "Net Positive" for Safety?

They did the opposite, incentivizing themselves to reach the profit cap. I'm talking about making sure that any net worth beyond a billion goes to someone else.

chris_leong on Chris_Leong's Shortform

I believe those are useful frames for understanding the impacts.

jay95 on Consequentialists should have a comprehensive set of deontological beliefs they adhere to

It is, but I'm specifically saying a form of rule consequentialism that serves personal happiness about as well as it could be served is in fact rational (for anyone who is trying to maximize impersonal happiness and probably for anyone who is a consequentialist of any kind).