LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

LLMs as a Planning Overhang
Larks · 2024-07-14T02:54:14.295Z · comments (8)

[link] Simple Kelly betting in prediction markets
jessicata (jessica.liu.taylor) · 2024-03-06T18:59:18.243Z · comments (3)

Tort Law Can Play an Important Role in Mitigating AI Risk
Gabriel Weil (gabriel-weil) · 2024-02-12T17:17:59.135Z · comments (9)

[link] AISafety.info: What is the "natural abstractions hypothesis"?
Algon · 2024-10-05T12:31:14.195Z · comments (2)

[link] On what research policymakers actually need
MondSemmel · 2024-04-23T19:50:12.833Z · comments (0)

Are we so good to simulate?
KatjaGrace · 2024-03-04T05:20:03.535Z · comments (24)

[link] Elon files grave charges against OpenAI
mako yass (MakoYass) · 2024-03-01T17:42:13.963Z · comments (10)

Index of rationalist groups in the Bay Area July 2024
Lucie Philippon (lucie-philippon) · 2024-07-26T16:32:25.337Z · comments (14)

[link] The consistent guessing problem is easier than the halting problem
jessicata (jessica.liu.taylor) · 2024-05-20T04:02:03.865Z · comments (5)

Dialogue on What It Means For Something to Have A Function/Purpose
johnswentworth · 2024-07-15T16:28:56.609Z · comments (5)

The "context window" analogy for human minds
Ruby · 2024-02-13T19:29:10.387Z · comments (0)

Free Will and Dodging Anvils: AIXI Off-Policy
Cole Wyeth (Amyr) · 2024-08-29T22:42:24.485Z · comments (12)

[link] [Fiction] A Confession
Arjun Panickssery (arjun-panickssery) · 2024-04-18T16:28:48.194Z · comments (2)

AI #70: A Beautiful Sonnet
Zvi · 2024-06-27T14:40:08.087Z · comments (0)

0.202 Bits of Evidence In Favor of Futarchy
niplav · 2024-09-29T21:57:59.896Z · comments (0)

Glitch Token Catalog - (Almost) a Full Clear
Lao Mein (derpherpize) · 2024-09-21T12:22:16.403Z · comments (3)

Mud and Despair (Part 4 of "The Sense Of Physical Necessity")
LoganStrohl (BrienneYudkowsky) · 2024-03-07T00:14:23.975Z · comments (0)

Rolling Thresholds for AGI Scaling Regulation
Larks · 2025-01-12T01:30:23.797Z · comments (6)

Lecture Series on Tiling Agents
abramdemski · 2025-01-14T21:34:03.907Z · comments (14)

Evolution and the Low Road to Nash
Aydin Mohseni (aydin-mohseni) · 2025-01-22T07:06:32.305Z · comments (2)

[link] Scaling Wargaming for Global Catastrophic Risks with AI
rai (nonveumann) · 2025-01-18T15:10:39.696Z · comments (2)

The Laws of Large Numbers
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-04T11:54:16.967Z · comments (11)

Six Small Cohabitive Games
Screwtape · 2025-01-15T21:59:29.778Z · comments (7)

Why modelling multi-objective homeostasis is essential for AI alignment (and how it helps with AI safety as well)
Roland Pihlakas (roland-pihlakas) · 2025-01-12T03:37:59.692Z · comments (5)

AI Safety Camp 10
Robert Kralisch (nonmali-1) · 2024-10-26T11:08:09.887Z · comments (9)

Mech Interp Lacks Good Paradigms
Daniel Tan (dtch1997) · 2024-07-16T15:47:32.171Z · comments (0)

Winning isn't enough
JesseClifton · 2024-11-05T11:37:39.486Z · comments (18)

Book Review: On the Edge: The Business
Zvi · 2024-09-25T12:20:06.230Z · comments (0)

Drug development costs can range over two orders of magnitude
rossry · 2024-11-03T23:13:17.685Z · comments (0)

Losing Faith In Contrarianism
omnizoid · 2024-04-25T20:53:34.842Z · comments (44)

Litigate-for-Impact: Preparing Legal Action against an AGI Frontier Lab Leader
Sonia Joseph (redhat) · 2024-12-07T21:42:29.038Z · comments (7)

Compelling Villains and Coherent Values
Cole Wyeth (Amyr) · 2024-10-06T19:53:47.891Z · comments (4)

D&D.Sci: Whom Shall You Call?
abstractapplic · 2024-07-05T20:53:37.010Z · comments (6)

[link] An X-Ray is Worth 15 Features: Sparse Autoencoders for Interpretable Radiology Report Generation
hugofry · 2024-10-07T08:53:14.658Z · comments (0)

[link] Tinker
Richard_Ngo (ricraz) · 2024-04-16T18:26:38.679Z · comments (0)

Resolving von Neumann-Morgenstern Inconsistent Preferences
niplav · 2024-10-22T11:45:20.915Z · comments (5)

OODA your OODA Loop
Raemon · 2024-10-11T00:50:48.119Z · comments (3)

Inducing Unprompted Misalignment in LLMs
Sam Svenningsen (sven) · 2024-04-19T20:00:58.067Z · comments (7)

[link] Generative ML in chemistry is bottlenecked by synthesis
Abhishaike Mahajan (abhishaike-mahajan) · 2024-09-16T16:31:34.801Z · comments (2)

Evaluating Sparse Autoencoders with Board Game Models
Adam Karvonen (karvonenadam) · 2024-08-02T19:50:21.525Z · comments (1)

[question] What progress have we made on automated auditing?
LawrenceC (LawChan) · 2024-07-06T01:49:43.714Z · answers+comments (1)

[link] Locally optimal psychology
Chipmonk · 2024-11-25T18:35:11.985Z · comments (7)

A New Class of Glitch Tokens - BPE Subtoken Artifacts (BSA)
Lao Mein (derpherpize) · 2024-09-20T13:13:26.181Z · comments (7)

UDT1.01: The Story So Far (1/10)
Diffractor · 2024-03-27T23:22:35.170Z · comments (6)

Orca communication project - seeking feedback (and collaborators)
Towards_Keeperhood (Simon Skade) · 2024-12-03T17:29:40.802Z · comments (16)

The murderous shortcut: a toy model of instrumental convergence
Thomas Kwa (thomas-kwa) · 2024-10-02T06:48:06.787Z · comments (0)

Doing Research Part-Time is Great
casualphysicsenjoyer (hatta_afiq) · 2024-11-22T19:01:15.542Z · comments (7)

[link] Twitter thread on AI takeover scenarios
Richard_Ngo (ricraz) · 2024-07-31T00:24:33.866Z · comments (0)

Your LLM Judge may be biased
Henry Papadatos (henry) · 2024-03-29T16:39:22.534Z · comments (9)

Distinguish worst-case analysis from instrumental training-gaming
Olli Järviniemi (jarviniemi) · 2024-09-05T19:13:34.443Z · comments (0)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

ape-in-the-coat on Nick Land: Orthogonality

I had an initial impulse to simply downvote the post based on ideological misalignment even without properly reading it, caught myself in the process of thinking about it, and made myself read the post first. As a result I strongly downvoted it based on its quality.

Most of it is low effor propaganda pamphlet. Vibes based word salad instead of clear reasoning. Thesises mostly without justifications. And where there is some, it's so comically weak that there is not much to have a productive discussion about, like the idea that the existence of instrumental values somehow disproves orthogonality thesis or the fact that all our values are the product of evolution must make us care about evolution instead of our values.

Most of blame of course goes to original author, Nick Land, not the @lumpenspace [LW · GW], who simply has reposted the ideas. But I think low effort reposting of poor reasoning also shouldn't be rewarded and I'd like to see less of it on this site.

A better post about Land's ideas on Orthogonality would present his reasoning in a clear way, some possible arguments and counterarguments, steelmans and ideological turing tests. At least it would put the ideas in proper context instead starting with proclamations how "neoreaction and dark enlightment are totally not fashist, though maybe racists but who even cares about that in this day and age, am I right?".

And such a better post already exists. Written more than ten ears ago and now is considered to be classics of Less Wrong. So what does this worse version even contribute to the discourse?

self on Self's Shortform

Yes. The product I bought identifies itself as "Sceletium tortuosum".

I've only tried 1 brand/product, and haven't seen any outstanding sources on it either, so I can't offer much guidance there.

I can anecdotally note that the effects seem quite strong for a legal substance at 0.5g, that it has short term effects + potentially also weaker long term effects (made me more relaxed? hard to say) (comparable to MDMA used in trauma therapy)

christiankl on Davey Morse's Shortform

Human content isn't easy to distinguish from non-human content.

tsvibt on Nick Land: Orthogonality

A start of one critique is:

It simply means Darwinian processes have no limits that matter to us.

Not true! Roughly speaking, we can in principle just decide to not do that. A body can in principle have an immune system that doesn't lose to infection; there could in principle be a world government that picks the lightcone's destiny. The arguments about novel understanding implying novel values might be partly right, but they don't really cut against Mateusz's point.

viliam on Thread for Sense-Making on Recent Murders and How to Sanely Respond

Teaching rationality the shallow way -- nope; knowing about biases can hurt people [LW · GW]

Teaching rationality the deep way -- nope; reason as a memetic immune disorder [LW · GW]

:(

Perhaps there should be some "pre-rationality" lessons. Something stabilizing you need to learn first, so that learning about rationality does not make you crazy.

There are some materials that already seem to point in that direction: adding up to normality [? · GW], ethical injunctions [? · GW]. Perhaps the CFAR workshops should start with focusing on these things, in a serious way (like, spend at least one day only debating this, check that the participants understood the lesson, and maybe kick out those who didn't?).

Because, although some people get damaged by learning about rationality, it seems to me that many people don't (some of them only because they don't change in any significant way, but some of them internalize the lessons in a good way). If we could predict who would end up which way, that could allow us to reduce the damage, while still delivering the value.

Of course this only applies to the workshops; online communication is a different questions. But seems to me that the bad things mostly happen offline.

mateusz-baginski on Thread for Sense-Making on Recent Murders and How to Sanely Respond

Why is it the case that a majority of Zizians that we hear about in the news is trans/nb/queer? (If this is representative of Zizians in general, why is it true of Zizians in general?)

matrice-jacobine on Thread for Sense-Making on Recent Murders and How to Sanely Respond

TBF it is fairly striking reading about early Soviet history how many of the Old Bolshevik intelligentsia would have fit right in this community but the whole "Putin is a secret cosmist" crowd is... unhinged.

mateusz-baginski on Nick Land: Orthogonality

I believe the opposite - that the drives identified by Steve Omohundro as instrumental goals for any sufficiently advanced AI (like self-preservation, efficiency, resource acquisition) are really the only terminal goals that matter.

Even if this is ~technically true, if your [essence of self that you want to preserve] involves something like [effectively ensuring that X happens], this is at least behaviorally equivalent to having a terminal goal that is not instrumental in the sense that instrumental convergence is not going to universally produce it in the limit.

matrice-jacobine on Thread for Sense-Making on Recent Murders and How to Sanely Respond

@PhilGoetz [LW · GW]'s Reason as memetic immune disorder [LW · GW] seems relevant here. It has been noted many times that engineers are disproportionately involved in terrorism, in ways that the mere usefulness of their engineering skills can't explain.

jan_kulveit on Catastrophe through Chaos

For emergency response, new ALERT. Personally think the forecasting/horizon scanning part of Sentinel is good, the emergency response negative in expectation. What does it mean for funders idk, would donate conditionally on the funds being restricted to the horizon scanning part.