LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Categories of leadership on technical teams
benkuhn · 2024-07-22T04:50:04.071Z · comments (0)

My Detailed Notes & Commentary from Secular Solstice
Jeffrey Heninger (jeffrey-heninger) · 2024-03-23T18:48:51.894Z · comments (16)

Open consultancy: Letting untrusted AIs choose what answer to argue for
Fabien Roger (Fabien) · 2024-03-12T20:38:03.785Z · comments (5)

Dangers of Closed-Loop AI
Gordon Seidoh Worley (gworley) · 2024-03-22T23:52:22.010Z · comments (9)

[link] Twitter thread on politics of AI safety
Richard_Ngo (ricraz) · 2024-07-31T00:00:34.298Z · comments (2)

Alternative Cancer Care As Biohacking & Book Review: Surviving "Terminal" Cancer
DenizT · 2025-01-06T07:43:52.773Z · comments (6)

[link] Robin Hanson & Liron Shapira Debate AI X-Risk
Liron · 2024-07-08T21:45:40.609Z · comments (4)

AI #56: Blackwell That Ends Well
Zvi · 2024-03-21T12:10:05.412Z · comments (16)

If You Can Climb Up, You Can Climb Down
jefftk (jkaufman) · 2024-07-30T00:00:06.295Z · comments (9)

[link] GPT2, Five Years On
Joel Burget (joel-burget) · 2024-06-05T17:44:17.552Z · comments (0)

Childhood and Education Roundup #7
Zvi · 2024-12-09T13:10:05.588Z · comments (10)

What I Learned (Conclusion To "The Sense Of Physical Necessity")
LoganStrohl (BrienneYudkowsky) · 2024-03-20T21:24:37.464Z · comments (0)

[link] My Apartment Art Commission Process
jenn (pixx) · 2024-08-26T18:36:44.363Z · comments (4)

[link] Inferring the model dimension of API-protected LLMs
Ege Erdil (ege-erdil) · 2024-03-18T06:19:25.974Z · comments (3)

[link] Romae Industriae
Maxwell Tabarrok (maxwell-tabarrok) · 2024-07-19T13:03:31.536Z · comments (2)

ARENA4.0 Capstone: Hyperparameter tuning for MELBO + replication on Llama-3.2-1b-Instruct
25Hour (aaron-kaufman) · 2024-10-05T11:30:11.953Z · comments (2)

Basics of Handling Disagreements with People
Camille Berger (Camille Berger) · 2024-11-12T17:55:08.143Z · comments (4)

AXRP Episode 38.2 - Jesse Hoogland on Singular Learning Theory
DanielFilan · 2024-11-27T06:30:03.821Z · comments (0)

“Charity” as a conflationary alliance term
Jan_Kulveit · 2024-12-12T21:49:50.057Z · comments (2)

Augmenting Statistical Models with Natural Language Parameters
jsteinhardt · 2024-09-20T18:30:10.816Z · comments (0)

Flipping Out: The Cosmic Coinflip Thought Experiment Is Bad Philosophy
Joe Rogero · 2024-11-12T23:55:46.770Z · comments (17)

(Maybe) A Bag of Heuristics is All There Is & A Bag of Heuristics is All You Need
Sodium · 2024-10-03T19:11:58.032Z · comments (17)

Empathy/Systemizing Quotient is a poor/biased model for the autism/sex link
tailcalled · 2024-11-04T21:11:57.788Z · comments (0)

Algebraic Linguistics
abstractapplic · 2024-12-07T19:18:39.935Z · comments (27)

Intransitive Trust
Screwtape · 2024-05-27T16:55:29.294Z · comments (15)

AXRP Episode 33 - RLHF Problems with Scott Emmons
DanielFilan · 2024-06-12T03:30:05.747Z · comments (0)

D&D.Sci (Easy Mode): On The Construction Of Impossible Structures
abstractapplic · 2024-05-17T00:25:42.950Z · comments (12)

Musings on LLM Scale (Jul 2024)
Vladimir_Nesov · 2024-07-03T18:35:48.373Z · comments (0)

The Schumer Report on AI (RTFB)
Zvi · 2024-05-24T15:10:03.122Z · comments (3)

Writing experiments and the banana escape valve
Dmitry Vaintrob (dmitry-vaintrob) · 2025-01-23T13:11:24.215Z · comments (1)

Computational Mechanics Hackathon (June 1 & 2)
Adam Shai (adam-shai) · 2024-05-24T22:18:44.352Z · comments (5)

Theory of Change for AI Safety Camp
Linda Linsefors · 2025-01-22T22:07:10.664Z · comments (3)

Monthly Roundup #26: January 2025
Zvi · 2025-01-20T15:30:08.680Z · comments (15)

[link] hydrogen tube transport
bhauth · 2024-04-18T22:47:08.790Z · comments (12)

[link] Forecasting Frontier Language Model Agent Capabilities
Govind Pimpale (govind-pimpale) · 2025-02-24T16:51:32.022Z · comments (0)

Geometric Utilitarianism (And Why It Matters)
StrivingForLegibility · 2024-05-12T03:41:21.342Z · comments (2)

Renormalization Redux: QFT Techniques for AI Interpretability
Lauren Greenspan (LaurenGreenspan) · 2025-01-18T03:54:28.652Z · comments (12)

AI Safety Strategies Landscape
Charbel-Raphaël (charbel-raphael-segerie) · 2024-05-09T17:33:45.853Z · comments (1)

[link] Suffering Is Not Pain
jbkjr · 2024-06-18T18:04:43.407Z · comments (45)

Reasons-based choice and cluelessness
JesseClifton · 2025-02-07T22:21:47.232Z · comments (0)

Training AI to do alignment research we don’t already know how to do
joshc (joshua-clymer) · 2025-02-24T19:19:43.067Z · comments (23)

[link] AI Safety Memes Wiki
plex (ete) · 2024-07-24T18:53:04.977Z · comments (1)

[link] The last era of human mistakes
owencb · 2024-07-24T09:58:42.116Z · comments (2)

[link] The Cancer Resolution?
PeterMcCluskey · 2024-07-24T00:25:17.322Z · comments (27)

AI #63: Introducing Alpha Fold 3
Zvi · 2024-05-09T14:20:03.176Z · comments (2)

One way violinists fail
Solenoid_Entity · 2024-05-29T04:08:17.675Z · comments (5)

UDT1.01: Logical Inductors and Implicit Beliefs (5/10)
Diffractor · 2024-04-18T08:39:13.368Z · comments (2)

[link] Meta: Frontier AI Framework
Zach Stein-Perlman · 2025-02-03T22:00:17.103Z · comments (2)

The Monster in Our Heads
testingthewaters · 2025-01-19T23:58:11.251Z · comments (4)

Monthly Roundup #20: July 2024
Zvi · 2024-07-23T12:50:07.991Z · comments (9)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

james-camacho on Economics Roundup #5

Dan Neidle: The 20,000% spike at £100,000 is absolutely not a joke – someone earning £99,999.99 with two children under three in London will lose an immediate £20k if they earn a penny more. The practical effect is clearer if we plot gross vs net income.

Can't it actually be good to encourage people to not work? I'd imagine if everyone in the United Kingdom worked half the number of hours, salaries wouldn't decrease very much. Their society, as a whole, doesn't need to work so many hours to maintain the quality of life, they only individually need to because they drive each others' wages down.

mis-understandings on The Useful Idea of Truth

If the above is true, aren't the postmodernists right? Isn't all this talk of 'truth' just an attempt to assert the privilege of your own beliefs over others, when there's nothing that can actually compare a belief to reality itself, outside of anyone's head?

No, we are talking about personal epistemology. That is, if we cannot compare a belief to reality, you cannot also compare it to some other persons beliefs (there is reality at least in between). We want truth to be a way of privledging some of our beliefs over other beliefs, in a way so that we can functionalize an epistemology.

For this, what we want are beliefs where our expected value for the difference between the believed results and the actual results (that is our loss in ML terms (retroactivly)) and expected informaiton magintutde, bayesian, are small, are more true, which propogates according to our update rule (that is, our "logical system")

cole-wyeth on What does it mean to apply decision theory?

We don't want to talk about partial rationality; we want notions of rationality which bounded agents can fully satisfy.

Why expect this kind of thing to exist? It seems to me that the ideas of computational boundedness and optimality are naturally in tension.

raemon on AI Rapidly Gets Smarter, And Makes Some of Us Dumber"

Yep, thank you!

evan_gaensbauer on AI Rapidly Gets Smarter, And Makes Some of Us Dumber"

I've now summarized those details as they were presented in the video. 'Staying more grounded in how bad it is' with more precision would require you or whoever learning more about these developments from the respective companies on your own, though the summaries I've now provided can hopefully serve as a starting point for doing so.

npostavs on The non-tribal tribes

You say this:

If you’re thinking, “Wait no, I’m pretty sure my group is fundamentally about X, which is fundamentally good,” then you’re probably still in Red or Blue.

But you also say this:

First, the Grey tribe is about something, [...] things that people already think are good in themselves.

Doesn't the first statement completely undermine the second one?

annapurna on Osaka

Seems the perfect post to link one of the best blog posts I have ever read on the internet: How Japanese zoning works.

https://urbankchoze.blogspot.com/2014/04/japanese-zoning.html

lorec on Lorec's Shortform

[ Look at those same authors with some other mention-counting tool, you mean? ]

owain_evans on Lorec's Shortform

I'm still interested in this question! Someone could look at the sources I discuss in my tweet and see if this is real. https://x.com/OwainEvans_UK/status/1869357399108198489

lorec on Lorec's Shortform

Interesting question: why are people quickly becoming less interested in previous standards?