LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Claude's Constitutional Consequentialism?
1a3orn · 2024-12-19T19:53:33.254Z · comments (6)

[question] What's the Right Way to think about Information Theoretic quantities in Neural Networks?
Dalcy (Darcy) · 2025-01-19T08:04:30.236Z · answers+comments (12)

Causal inference for the home gardener
braces · 2024-11-27T17:55:52.629Z · comments (1)

MATS mentor selection
DanielFilan · 2025-01-10T03:12:52.141Z · comments (11)

My January alignment theory Nanowrimo
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-02T00:07:24.050Z · comments (2)

Causal Undertow: A Work of Seed Fiction
Daniel Murfet (dmurfet) · 2024-12-08T21:41:48.132Z · comments (0)

[link] We don't want to post again "This might be the last AI Safety Camp"
Remmelt (remmelt-ellen) · 2025-01-21T12:03:33.171Z · comments (17)

[link] A car journey with conservative evangelicals - Understanding some British political-religious beliefs
Nathan Young · 2024-12-06T11:22:45.563Z · comments (8)

Brainrot
Jesse Hoogland (jhoogland) · 2025-01-26T05:35:35.396Z · comments (0)

AXRP Episode 39 - Evan Hubinger on Model Organisms of Misalignment
DanielFilan · 2024-12-01T06:00:06.345Z · comments (0)

Trying to translate when people talk past each other
Kaj_Sotala · 2024-12-17T09:40:02.640Z · comments (12)

What happens next?
Logan Zoellner (logan-zoellner) · 2024-12-29T01:41:33.685Z · comments (19)

Worries about latent reasoning in LLMs
CBiddulph (caleb-biddulph) · 2025-01-20T09:09:02.335Z · comments (3)

Sleep, Diet, Exercise and GLP-1 Drugs
Zvi · 2025-01-21T12:20:06.018Z · comments (4)

Estimating the benefits of a new flu drug (BXM)
DirectedEvolution (AllAmericanBreakfast) · 2025-01-06T04:31:16.837Z · comments (2)

[question] What are the most interesting / challenging evals (for humans) available?
Raemon · 2024-12-27T03:05:26.831Z · answers+comments (13)

[question] Are You More Real If You're Really Forgetful?
Thane Ruthenis · 2024-11-24T19:30:55.233Z · answers+comments (25)

Evolution and the Low Road to Nash
Aydin Mohseni (aydin-mohseni) · 2025-01-22T07:06:32.305Z · comments (2)

Rolling Thresholds for AGI Scaling Regulation
Larks · 2025-01-12T01:30:23.797Z · comments (6)

Litigate-for-Impact: Preparing Legal Action against an AGI Frontier Lab Leader
Sonia Joseph (redhat) · 2024-12-07T21:42:29.038Z · comments (7)

Six Small Cohabitive Games
Screwtape · 2025-01-15T21:59:29.778Z · comments (7)

Lecture Series on Tiling Agents
abramdemski · 2025-01-14T21:34:03.907Z · comments (14)

Why modelling multi-objective homeostasis is essential for AI alignment (and how it helps with AI safety as well)
Roland Pihlakas (roland-pihlakas) · 2025-01-12T03:37:59.692Z · comments (5)

[link] Scaling Wargaming for Global Catastrophic Risks with AI
rai (nonveumann) · 2025-01-18T15:10:39.696Z · comments (2)

The Laws of Large Numbers
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-04T11:54:16.967Z · comments (11)

Childhood and Education #8: Dealing with the Internet
Zvi · 2025-01-06T14:00:09.604Z · comments (7)

[link] You should read Hobbes, Locke, Hume, and Mill via EarlyModernTexts.com
Arjun Panickssery (arjun-panickssery) · 2025-01-30T12:35:03.564Z · comments (1)

Thread for Sense-Making on Recent Murders and How to Sanely Respond
Ben Pace (Benito) · 2025-01-31T03:45:48.201Z · comments (2)

Building Big Science from the Bottom-Up: A Fractal Approach to AI Safety
Lauren Greenspan (LaurenGreenspan) · 2025-01-07T03:08:51.447Z · comments (2)

Why care about AI personhood?
Francis Rhys Ward (francis-rhys-ward) · 2025-01-26T11:24:45.596Z · comments (6)

Orca communication project - seeking feedback (and collaborators)
Towards_Keeperhood (Simon Skade) · 2024-12-03T17:29:40.802Z · comments (16)

Doing Research Part-Time is Great
casualphysicsenjoyer (hatta_afiq) · 2024-11-22T19:01:15.542Z · comments (7)

[link] Locally optimal psychology
Chipmonk · 2024-11-25T18:35:11.985Z · comments (7)

The quantum red pill or: They lied to you, we live in the (density) matrix
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-17T13:58:16.186Z · comments (34)

AI #98: World Ends With Six Word Story
Zvi · 2025-01-09T16:30:07.341Z · comments (2)

Grammars, subgrammars, and combinatorics of generalization in transformers
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-02T09:37:23.191Z · comments (0)

[link] The Way According To Zvi
Sable · 2024-12-07T17:35:48.769Z · comments (0)

Deep Learning is cheap Solomonoff induction?
Lucius Bushnaq (Lblack) · 2024-12-07T11:00:56.455Z · comments (1)

A Matter of Taste
Zvi · 2024-12-18T17:50:07.201Z · comments (4)

Why We Need More Shovel-Ready AI Notkilleveryoneism Megaproject Proposals
Peter Berggren (peter-berggren) · 2025-01-20T22:38:26.593Z · comments (1)

Fireplace and Candle Smoke
jefftk (jkaufman) · 2025-01-01T01:50:01.408Z · comments (4)

Last week of the Discussion Phase
Raemon · 2025-01-09T19:26:59.136Z · comments (0)

[link] Is the AI Doomsday Narrative the Product of a Big Tech Conspiracy?
garrison · 2024-12-04T19:20:59.286Z · comments (1)

Don’t Legalize Drugs
Declan Molony (declan-molony) · 2025-01-14T06:51:14.005Z · comments (9)

Fertility Roundup #4
Zvi · 2024-12-02T14:30:05.968Z · comments (16)

[question] Which Biases are most important to Overcome?
abstractapplic · 2024-12-01T15:40:06.096Z · answers+comments (24)

Kitchen Air Purifier Comparison
jefftk (jkaufman) · 2025-01-22T03:20:03.224Z · comments (2)

Alternative Cancer Care As Biohacking & Book Review: Surviving "Terminal" Cancer
DenizT · 2025-01-06T07:43:52.773Z · comments (6)

Monthly Roundup #26: January 2025
Zvi · 2025-01-20T15:30:08.680Z · comments (15)

Writing experiments and the banana escape valve
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-23T13:11:24.215Z · comments (1)

← previous page (newer posts) · next page (older posts) →

^{^}

Though I'll likely lose interest if it seems like we're talking past each other / won't resolve any cruxy disagreements.

^{^}

(except arguably the qualia research institute's discord server, which might count because it has psychedelics users in it)

^{^}

(Questioning with the goal of causing the questioned one to notice specific assumptions or intuitions to their beliefs, as a result of trying to generate a coherent answer)

^{^}

From an unposted text:

The paradox of recursive explanation applies to metaphysics too
In Explain/Worship/Ignore [LW · GW] (~500 words), Eliezer describes an apparent paradox: if you ask for some physical phenomena to be explained (shown to have a more fundamental cause), then ask the same of the explanation, and so on, the only conceivable outcomes seem paradoxical: infinite recurse, circular self-reference, or a 'special' first cause that does not itself need to be explained.
This is sometimes called the paradox of why. It's usually applied to physics; this text applies it to logic/math-structure and metaphysics/ontology too. In short, you can continually ask "but what is x" for any aspect of logic or ontology.
Here's a hypothetical Socratic dialogue.
Author: "What is reality?"
Interlocutor: "Reality is <complete description of [very large mathematical structure that perfectly reflects the behavior of the universe]>."
Author: "Let's suppose that is true. But this 'math' you just used; what is it?"
Interlocutor: "Mathematics is a program which assigns, based on a few simple rules, 'T' or 'F' to inputs formatted in a certain way."
Author: "I think you just moved the hard part of the question to be one question away: What is a program?"
Interlocutor: "Hmm. I can't define a program in terms of math, because I just did the reverse. Wikipedia says a program is 'a sequence of instructions for a computer to execute'. If I just wrote that, Author would ask what it means for a computer to execute an instruction. What else could I write?
A computer is a part of physics arranged into a localized structure, and for this structure to 'execute an instruction' is for physical law to flow through (operate on, apply to) it, such that the structure's observed behavior matches that of a simpler-than-physics abstract system for transforming inputs to outputs. Unfortunately, I've already defined physics in terms of math, so defining a program in terms of physics would be circular. I think I'm at a dead end."
Author: "Do you want to revise one of your previous definitions?"
Interlocutor: "Maybe I could define math as some more fundamental thing instead of as a certain kind of program. But I just failed to find such a more fundamental thing for 'program'. Let's check Wikipedia again...
WP:Mathematics: 'Mathematics involves the description and manipulation of abstract objects that consist of either abstractions from nature or—in modern mathematics—purely abstract entities that are stipulated to have certain properties, called axioms'
WP:Mathematical_Object 'Typically, a mathematical object can be a value that can be assigned to a symbol, and therefore can be involved in formulas. Commonly encountered mathematical objects include numbers, expressions, shapes, functions, and sets.'
"I can't use the 'abstractions from nature' part, because I've already defined nature to be mathematical, so that would be circular. Saying math is made of numbers, expressions, symbols, etc, isn't helpful either, though; Author will just ask what the symbols themselves are, metaphysically. Okay, I concede for now, though maybe a commenter will propose such a more-fundamental-thing that I'm not aware of, though it would invite the same question."
[End of dialogue]

LessWrong 2.0 Reader

Archive

Recent comments