LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] List of Collective Intelligence Projects
Chipmonk · 2024-07-02T14:10:41.789Z · comments (8)

Economics Roundup #2
Zvi · 2024-07-02T12:40:05.908Z · comments (5)

Book Review: On the Edge: The Business
Zvi · 2024-09-25T12:20:06.230Z · comments (0)

[link] OpenAI appoints Retired U.S. Army General Paul M. Nakasone to Board of Directors
Joel Burget (joel-burget) · 2024-06-13T21:28:18.110Z · comments (10)

[link] Hyperreals in a Nutshell
Yudhister Kumar (randomwalks) · 2023-10-15T14:23:58.027Z · comments (27)

Humans aren't fleeb.
Charlie Steiner · 2024-01-24T05:31:46.929Z · comments (5)

Open Thread – Winter 2023/2024
habryka (habryka4) · 2023-12-04T22:59:49.957Z · comments (160)

Secondary Risk Markets
Vaniver · 2023-12-11T21:52:46.836Z · comments (4)

[Valence series] 4. Valence & Social Status (deprecated)
Steven Byrnes (steve2152) · 2023-12-15T14:24:41.040Z · comments (19)

ARENA 2.0 - Impact Report
CallumMcDougall (TheMcDouglas) · 2023-09-26T17:13:19.952Z · comments (5)

Dangers of Closed-Loop AI
Gordon Seidoh Worley (gworley) · 2024-03-22T23:52:22.010Z · comments (7)

[question] What is an "anti-Occamian prior"?
Zane · 2023-10-23T02:26:10.851Z · answers+comments (22)

Predictive model agents are sort of corrigible
Raymond D · 2024-01-05T14:05:03.037Z · comments (6)

Proposal for improving the global online discourse through personalised comment ordering on all websites
Roman Leventov · 2023-12-06T18:51:37.645Z · comments (21)

List of strategies for mitigating deceptive alignment
joshc (joshua-clymer) · 2023-12-02T05:56:50.867Z · comments (2)

A sketch of acausal trade in practice
Richard_Ngo (ricraz) · 2024-02-04T00:32:54.622Z · comments (4)

[link] AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI Risks
aogara (Aidan O'Gara) · 2023-10-31T19:34:54.837Z · comments (1)

Introduce a Speed Maximum
jefftk (jkaufman) · 2024-01-11T02:50:04.284Z · comments (28)

Protocol evaluations: good analogies vs control
Fabien Roger (Fabien) · 2024-02-19T18:00:09.794Z · comments (10)

My Detailed Notes & Commentary from Secular Solstice
Jeffrey Heninger (jeffrey-heninger) · 2024-03-23T18:48:51.894Z · comments (16)

'Theories of Values' and 'Theories of Agents': confusions, musings and desiderata
Mateusz Bagiński (mateusz-baginski) · 2023-11-15T16:00:48.926Z · comments (8)

[link] On Fables and Nuanced Charts
Niko_McCarty (niko-2) · 2024-09-08T17:09:07.503Z · comments (2)

[link] Twitter thread on politics of AI safety
Richard_Ngo (ricraz) · 2024-07-31T00:00:34.298Z · comments (2)

The "context window" analogy for human minds
Ruby · 2024-02-13T19:29:10.387Z · comments (0)

Open consultancy: Letting untrusted AIs choose what answer to argue for
Fabien Roger (Fabien) · 2024-03-12T20:38:03.785Z · comments (5)

How I select alignment research projects
Ethan Perez (ethan-perez) · 2024-04-10T04:33:08.092Z · comments (4)

How predictive processing solved my wrist pain
max_shen (makoshen) · 2024-07-04T01:56:20.162Z · comments (8)

What Helped Me - Kale, Blood, CPAP, X-tiamine, Methylphenidate
Johannes C. Mayer (johannes-c-mayer) · 2024-01-03T13:22:11.700Z · comments (12)

Forecasting AI (Overview)
jsteinhardt · 2023-11-16T19:00:04.218Z · comments (0)

Empirical vs. Mathematical Joints of Nature
Elizabeth (pktechgirl) · 2024-06-26T01:55:22.858Z · comments (1)

[link] Twitter thread on AI takeover scenarios
Richard_Ngo (ricraz) · 2024-07-31T00:24:33.866Z · comments (0)

Open Problems in AIXI Agent Foundations
Cole Wyeth (Amyr) · 2024-09-12T15:38:59.007Z · comments (2)

[link] The last era of human mistakes
owencb · 2024-07-24T09:58:42.116Z · comments (2)

AI Impacts Survey: December 2023 Edition
Zvi · 2024-01-05T14:40:06.156Z · comments (6)

Finding the Wisdom to Build Safe AI
Gordon Seidoh Worley (gworley) · 2024-07-04T19:04:16.089Z · comments (10)

[link] legged robot scaling laws
bhauth · 2024-01-20T05:45:56.632Z · comments (8)

[link] Suffering Is Not Pain
jbkjr · 2024-06-18T18:04:43.407Z · comments (45)

AXRP Episode 33 - RLHF Problems with Scott Emmons
DanielFilan · 2024-06-12T03:30:05.747Z · comments (0)

Copyright Confrontation #1
Zvi · 2024-01-03T15:50:04.850Z · comments (7)

Augmenting Statistical Models with Natural Language Parameters
jsteinhardt · 2024-09-20T18:30:10.816Z · comments (0)

Intransitive Trust
Screwtape · 2024-05-27T16:55:29.294Z · comments (15)

Reflective consistency, randomized decisions, and the dangers of unrealistic thought experiments
Radford Neal · 2023-12-07T03:33:16.149Z · comments (25)

Glitch Token Catalog - (Almost) a Full Clear
Lao Mein (derpherpize) · 2024-09-21T12:22:16.403Z · comments (3)

Doomsday Argument and the False Dilemma of Anthropic Reasoning
Ape in the coat · 2024-07-05T05:38:39.428Z · comments (55)

If You Can Climb Up, You Can Climb Down
jefftk (jkaufman) · 2024-07-30T00:00:06.295Z · comments (9)

[link] hydrogen tube transport
bhauth · 2024-04-18T22:47:08.790Z · comments (12)

How to develop a photographic memory 1/3
PhilosophicalSoul (LiamLaw) · 2023-12-28T13:26:36.669Z · comments (6)

Linear encoding of character-level information in GPT-J token embeddings
mwatkins · 2023-11-10T22:19:14.654Z · comments (4)

AI #56: Blackwell That Ends Well
Zvi · 2024-03-21T12:10:05.412Z · comments (16)

Motivating Alignment of LLM-Powered Agents: Easy for AGI, Hard for ASI?
RogerDearnaley (roger-d-1) · 2024-01-11T12:56:29.672Z · comments (4)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

gwern on Is cybercrime really costing trillions per year?

But that's a bit like saying the cost of other crime includes all spending on the criminal and civil justice system, all spending on private security and surveillance by individuals and businesses, the entire salary of every cashier (since they wouldn't be needed if people would just count up their own purchases and leave payment), and every time someone doesn't do something because they don't want to go out wandering by themselves at 3am. Not actually a useful metric for deciding where it's worthwhile to increase or decrease resource allocations or to make regulatory decisions.

That sounds obviously correct and in fact a useful metric which is how you ought to be deciding how much to invest in policing: including the negative externalities and the nice high-trust-society things we could have if there was less crime. Why would you not include those?

jbash on The Existential Dread of Being a Powerful AI System

Why would you think that an AGI/ASI, even if conscious, would have an emotional makeup or motivational structure in any way similar to that of a human? Why should it care about any of this?

viliam on A Path out of Insufficient Views

Thank you for your interesting personal story!

(And more "meta" is better at coordinating more people, so you would expect a trend toward more "meta" or more "general" views over time becoming more dominant. Protestantism was more "meta-coordinated" than Catholicism. Science is pretty meta in this way. Dataism is an even more meta subset of "science".)

Not sure what you mean by more "meta" here. Like, people like to create tribes based on shared beliefs, so having some beliefs is better than having none (because then you cannot create a tribe), but having more general beliefs is better than having more specific ones (because each arbitrary belief can make some people object against it)?

So the best belief system is kinda like the smallest number that is still greater than zero... but there is no such thing; there is only the unending process of approaching the zero from above? (But you can never jump to the zero exactly, because then people would notice that they have literally nothing to coordinate their tribe about?)

In such situation, I think the one weird trick would be to invent a belief system that actively denies being one. To teach people a dogma that would (among other things) insist that there is no dogma, you just see the reality as it is (unlike all the other people, who merely see their dogmas). To invent rituals that consist (among other things) of telling yourself repeatedly that you have no rituals (unlike all the other people). To have leaders that deny being leaders (and yet they are surrounded by followers who obey them, but hey that's just how reality is).

So, basically... science.

But of course, people will soon notice that your supposed non-belief non-system often behaves suspiciously similarly to other belief systems, despite all the explicit denial. And they will keep hoping for a better system, which would teach them that there is no dogma, activities that would give them the feeling of certainty that they are following no rituals, and high-status people who would tell them to follow no leaders.

And maybe there is no end, only more iterations of the same. Because the more people around you join the currently popular non-belief non-system, the more obvious its nature as a belief system becomes. You notice how they keep saying the same non-dogmatic statements, performing the same non-rituals, and following the same non-leaders. Once you see it, you cannot unsee it, so you need to move further...

steve2152 on [Intuitive self-models] 2. Conscious Awareness

The color phi phenomenon doesn't work for or anyone I've asked

Were you using this demo? If so, I set the times to 1000,30,60, demagnified as much as possible, and then stood 20 feet away from my computer to “demagnify” even more. I might have also moved the dots a bit closer. I think I got some motion illusion?

I’m skeptical of the hypothesis that the color phi phenomenon is just BS. It doesn’t seem like that kind of psych result. I think it’s more likely that this applet is terribly designed.

wyatt-s on Hammertime Day 9: Time Calibration

This is more reverse planning fallacy than planning fallacy, but I thought it was important to mention. I remember most of my incidents of planning fallacy, like college admissions, and making a posterboard for my student group, but can't tell you exactly how bad they were.

wyatt-s on Hammertime Day 9: Time Calibration

One of my worst errors was when I thought my morning routine, which includes brushing my hair, shaving, brushing my teeth, applying deodorant, and using topical acne treatment. I thought this took somewhere around 20 minutes, but it was only about 6 for each of the individual tasks involved.

sherrinford on Sherrinford's Shortform

Sorry, but where/how would I do that?

steve2152 on [Intuitive self-models] 2. Conscious Awareness

Take any intuitive notion X, where people’s intuitions are generally a bit incoherent or poorly-thought-through—stream of consciousness, free will, divine grace, the voice in my head, etc.

(A) One thing you can say is: “X, when properly understood, is coherent, and here’s how to properly understand it …”
(B) Another you can say is: “X, as commonly understood by the average person, is incoherent, but hey let me tell you about these closely related concepts which are coherent and which rescue some or all of those intuitions about X that you find compelling …”

Fundamentally, neither of these strategies is right or wrong. You say tomato, I say to-mah-to. :)

This is one of many causes of those annoying debates that go around in circles, that I’m trying to declare out-of-scope for this series, cf. §1.6.2 [LW · GW]. :)

For science terminology like “acceleration”, we take approach (A). People often have incoherent intuitions about acceleration, and when they do, we prompt them to discard their “wrong” intuitions, leaving the “real” acceleration concept.

For more everyday terminology, (A) versus (B) is more of a judgment call—for example, some physicalists say “‘God’ doesn’t exist”, others say “‘God’ is just the term for order and beauty in the universe” or whatever. As another example, I read Elbow Room recently, and Dennett’s revised preface says that he’s taken approach (A) to “free will” for his whole career, but now he’s thinking that maybe all along he should have taken the (B) path and said “free will (as commonly understood) doesn’t exist”.

Anyway, I feel like my post section is pointing out that people’s everyday poorly-thought-through intuitions about “stream of consciousness” are a bit incoherent, but I didn’t go further than that by advocating for either (A) or (B). Whereas your comment is advocating for the (A) path, where we prod people to update their intuitions about what happens when they try to remember what happened one second ago.

gwern on Abs-E (or, speak only in the positive)

This might be an interesting use of LLM rewrites: negative->positive rephrasing feels like something within GPT-4's capabilities, and it would let you quickly translate a large corpus to read & evaluate without putting in a huge amount of work to write a large varied corpus of Abs-E text yourself. (I dislike the current name 'Abs-E' and by analogy to E-Prime, suggest 'E⁺' - short for 'English-positive'.)

Took a first stab at just the positive rewrite: https://github.com/gwern/gwern.net/blob/master/build/text2epositive.py

A major issue is that GPT-4s want to rewrite into ChatGPTese and hypercorrect any 'error', even with instructions to preserve the style and a lot of examples showing style preservation. I greatly want to avoid ChatGPTese in my writing, so it leaking through anyway is a problem. Another LLM API might be better for this (Claude?) but I don't have tokens+scripts set up right this instant for alternatives.

That aside, I'm unimpressed right now with the generated rewrites. Working through various examples of negation is hard and yields ugly-sounding or too-strong assertions, and makes me think that often a negative assertion is the most informative (without being false) way to state something. There are negatives that should be reworded to positive, but fewer than I was hoping beforehand. (For example, the comment about the LW2 AWS hack [LW(p) · GW(p)] reads better when rewritten into an affirmative positive form, definitely.)

At best, this seems like something to integrate into a grammar/style checker and only occasionally suggest a rewrite. And of course, if you only occasionally rewrite some text, the value is much lower than if you were rewriting every other sentence.

(A linter-style tool just flagging some text is also harder to integrate into my Emacs writing workflow: if it rewrites an entire block of text, that is relatively easy (simply pipe every paragraph into the script, and blindly replace it with the script output, which is even keyboardable: select region & C-u M-; tex<TAB>... to overwrite the region with the output of passing the script into a shell-command which tab-completes to that executable Python script), but to analyze it and just color or highlight troublesome negations? Hm. That's not something I've done in Emacs before...)

christian-z-r on D&D.Sci: Whom Shall You Call? [Evaluation and Ruleset]

Thanks for a good one, where I finally could use a bunch of linear regressions. Steamrolling! (I was sure there would be some devious trap, but I guess sometimes the basics actually do work, which is how they became the basics)

One thing that perhaps would make it easier was if the web interactive could tell whether or not your selection was the optimal one directly, and possibly how higher your expected price was than the optimal price (I first plugged mine in, then had to double check with your table out here)

Anyway, greetings, and looking forward to seeing the next one. Will train on the older ones until then