LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Alignment by default: the simulation hypothesis
gb (ghb) · 2024-09-25T16:26:00.552Z · comments (39)

AXRP Episode 37 - Jaime Sevilla on Forecasting AI
DanielFilan · 2024-10-04T21:00:03.077Z · comments (3)

[link] To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-09-19T16:13:55.835Z · comments (1)

Simon DeDeo on Explore vs Exploit in Science
Elizabeth (pktechgirl) · 2024-09-10T03:40:08.311Z · comments (0)

[link] Four Randomized Control Trials In Economics
Maxwell Tabarrok (maxwell-tabarrok) · 2024-08-08T15:59:23.250Z · comments (1)

[link] Green and golden: a meditation
Richard_Ngo (ricraz) · 2024-08-18T01:36:43.613Z · comments (0)

[link] AI Model Registries: A Foundational Tool for AI Governance
Elliot Mckernon (elliot) · 2024-10-07T19:27:43.466Z · comments (1)

[link] Compression Moves for Prediction
adamShimi · 2024-09-14T17:51:12.004Z · comments (0)

My decomposition of the alignment problem
Daniel C (harper-owen) · 2024-09-02T00:21:08.359Z · comments (22)

[link] Anthropic is being sued for copying books to train Claude
Remmelt (remmelt-ellen) · 2024-08-31T02:57:27.092Z · comments (4)

[question] What is the alpha in one bit of evidence?
J Bostock (Jemist) · 2024-10-22T21:57:09.056Z · answers+comments (12)

The Bar for Contributing to AI Safety is Lower than You Think
Chris_Leong · 2024-08-16T15:20:19.055Z · comments (1)

Gell-Mann checks
Cleo Scrolls (cleo-scrolls) · 2024-09-26T22:45:43.569Z · comments (7)

AI Can be “Gradient Aware” Without Doing Gradient hacking.
Sodium · 2024-10-20T21:02:10.754Z · comments (0)

[link] [Linkpost] 'The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery'
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-08-15T21:32:59.979Z · comments (1)

Musings on Text Data Wall (Oct 2024)
Vladimir_Nesov · 2024-10-05T19:00:21.286Z · comments (2)

[link] Does natural selection favor AIs over humans?
cdkg · 2024-10-03T18:47:43.517Z · comments (1)

[link] The Great Organism Theory of Evolution
rogersbacon · 2024-08-10T12:26:02.434Z · comments (0)

How Often Does Taking Away Options Help?
niplav · 2024-09-21T21:52:40.822Z · comments (6)

Resolving von Neumann-Morgenstern Inconsistent Preferences
niplav · 2024-10-22T11:45:20.915Z · comments (2)

[link] Towards the Operationalization of Philosophy & Wisdom
Thane Ruthenis · 2024-10-28T19:45:07.571Z · comments (2)

A necessary Membrane formalism feature
ThomasCederborg · 2024-09-10T21:33:09.508Z · comments (6)

Scaling Laws and Likely Limits to AI
Davidmanheim · 2024-08-18T17:19:46.597Z · comments (0)

Looking for Goal Representations in an RL Agent - Update Post
CatGoddess · 2024-08-28T16:42:19.367Z · comments (0)

Why Reflective Stability is Important
Johannes C. Mayer (johannes-c-mayer) · 2024-09-05T15:28:19.913Z · comments (2)

What program structures enable efficient induction?
Daniel C (harper-owen) · 2024-09-05T10:12:14.058Z · comments (4)

Announcing the PIBBSS Symposium '24!
DusanDNesic · 2024-09-03T11:19:47.568Z · comments (0)

Economics Roundup #4
Zvi · 2024-10-15T13:20:06.923Z · comments (4)

[link] Miles Brundage: Finding Ways to Credibly Signal the Benignness of AI Development and Deployment is an Urgent Priority
Zach Stein-Perlman · 2024-10-28T17:00:18.660Z · comments (3)

Lab governance reading list
Zach Stein-Perlman · 2024-10-25T18:00:28.346Z · comments (3)

Option control
Joe Carlsmith (joekc) · 2024-11-04T17:54:03.073Z · comments (0)

[link] To Be Born in a Bag
Niko_McCarty (niko-2) · 2024-10-06T17:21:00.605Z · comments (1)

Review: “The Case Against Reality”
David Gross (David_Gross) · 2024-10-29T13:13:29.643Z · comments (9)

Ten counter-arguments that AI is (not) an existential risk (for now)
Ariel Kwiatkowski (ariel-kwiatkowski) · 2024-08-13T22:35:15.341Z · comments (5)

Why I'm bearish on mechanistic interpretability: the shards are not in the network
tailcalled · 2024-09-13T17:09:25.407Z · comments (40)

[question] What are the best resources for building gears-level models of how governments actually work?
adamShimi · 2024-08-19T14:05:02.590Z · answers+comments (6)

D/acc AI Security Salon
Allison Duettmann (allison-duettmann) · 2024-10-19T22:17:57.067Z · comments (0)

Word Spaghetti
Gordon Seidoh Worley (gworley) · 2024-10-23T05:39:20.105Z · comments (9)

Should CA, TX, OK, and LA merge into a giant swing state, just for elections?
Thomas Kwa (thomas-kwa) · 2024-11-06T23:01:48.992Z · comments (0)

Avoiding the Bog of Moral Hazard for AI
Nathan Helm-Burger (nathan-helm-burger) · 2024-09-13T21:24:34.137Z · comments (12)

Can Large Language Models effectively identify cybersecurity risks?
emile delcourt (emile-delcourt) · 2024-08-30T20:20:21.345Z · comments (0)

Drug development costs can range over two orders of magnitude
rossry · 2024-11-03T23:13:17.685Z · comments (0)

[question] How great is the utility of "saving" endangered languages?
SpectrumDT · 2024-08-20T13:14:32.895Z · answers+comments (29)

[link] Should Sports Betting Be Banned?
Maxwell Tabarrok (maxwell-tabarrok) · 2024-09-21T14:13:35.404Z · comments (2)

Finding Deception in Language Models
Esben Kran (esben-kran) · 2024-08-20T09:42:13.060Z · comments (4)

Rabin's Paradox
Charlie Steiner · 2024-08-14T05:40:25.572Z · comments (40)

"Real AGI"
Seth Herd · 2024-09-13T14:13:24.124Z · comments (20)

[link] Jonothan Gorard:The territory is isomorphic to an equivalence class of its maps
Daniel C (harper-owen) · 2024-09-07T10:04:47.840Z · comments (18)

[question] Is this voting system strategy proof?
Donald Hobson (donald-hobson) · 2024-09-06T20:44:46.691Z · answers+comments (9)

Bridging the VLM and mech interp communities for multimodal interpretability
Sonia Joseph (redhat) · 2024-10-28T14:41:41.969Z · comments (5)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

anthony-digiovanni on Winning isn't enough

Without a clear definition of "winning,"

This is part of the problem we're pointing out in the post. We've encountered claims of this "winning" flavor that haven't been made precise, so we survey different things "winning" could mean more precisely, and argue that they're inadequate for figuring out which norms of rationality to adopt.

ricraz on Against Almost Every Theory of Impact of Interpretability

The former can be sufficient—e.g. there are good theoretical researchers who have never done empirical work themselves.

In hindsight I think "close conjunction" was too strong—it's more about picking up the ontologies and key insights from empirical work, which can be possible without following it very closely.

thomas-kwa on How to put California and Texas on the campaign trail!

I think it would be better to form a big winner-take-all bloc [LW · GW]. With proportional voting, the number of electoral votes at stake will be only a small fraction of the total, so the per-voter influence of CA and TX would probably remain below the national average.

shankar-sivarajan on How to put California and Texas on the campaign trail!

I don't see any reason to structure this agreement as an open-ended compact other states can join instead of a bilateral agreement between just California and Texas as proposed.

(The same reasoning applied to the National Popular Vote Interstate Compact would have its membership closed as soon as they reach a majority in electoral votes, and then completely disregard the votes of any state that didn't sign on, voting in whoever gets the most votes in member states.)

johnswentworth on Some Rules for an Algebra of Bayes Nets

Proof that the quoted bookkeeping rule works, for the exact case:

The original DAG asserts $P [X] = \prod_{i} P [X_{i} | X_{p a^{G} (i)}]$
If $G^{'}$ just adds an edge from $j$ to $k$ , then $G^{'}$ says $P [X] = P [X_{k} | X_{p a^{G} (k)}, X_{j}] \prod_{i \neq k} P [X_{i} | X_{p a^{G} (i)}]$
The original DAG's assertion $P [X] = \prod_{i} P [X_{i} | X_{p a^{G} (i)}]$ also implies $P [X_{k} | X_{p a^{G} (k)}, X_{j}] = P [X_{k} | X_{p a^{G} (k)}]$ , and therefore implies $G^{'}$ 's assertion $P [X] = P [X_{k} | X_{p a^{G} (k)}, X_{j}] \prod_{i \neq k} P [X_{i} | X_{p a^{G} (i)}]$ .

The approximate case then follows by the new-and-improved Bookkeeping Theorem [LW(p) · GW(p)].

Not sure where the disconnect/confusion is.

douglas_knight on Is the Power Grid Sustainable?

You say solar is getting cheaper, but it is only the panels that are getting cheaper. They will continue to get even cheaper, but this is not relevant to retrofitting individual houses, where the cost is already dominated by labor. As the cost of labor dominates, economies of scale in labor will be more relevant.

douglas_knight on Is the Power Grid Sustainable?

To a first approximation, solar is legal for individual residences and illegal on a larger scale.

douglas_knight on Is the Power Grid Sustainable?

Maybe you could learn something by looking at the public filings, but you didn't look at them. By regulation, not by being public, it has to spend proportionate to its income, but whether it is spending on transmission or generation is a fiction dictated by the regulator. It may well be that its transmission operating costs are much lower than its price and that a change of prices would be viable without any improvement in efficiency. This is exactly what I would how I would expect the company to set prices if it controlled the regulator: to extract as much money as possible on transmission to minimize competition. I don't know how corrupt the regulator is, but that ignorance is exactly my point.

anthonyc on How to cite LessWrong as an academic source?

Depending on the posts I think you could argue they're comparable to one of thosebother source types I listed.

keltan on keltan's Shortform

I asked for further details on the 10th point and Claude listed a bunch of stuff I’ve absolutely never heard of. I’d say it’s probably related to meditation if I had to guess. Here’s that.

—

Claude Let me break down Time Perception Management into its deeper components, as this is one of the most subtle yet powerful micro-skills...

Understanding Psychological Time vs Clock Time

How different mental states alter time perception
- Flow states can make hours feel like minutes
- Anxiety can make minutes feel like hours
Learning to intentionally shift between time perspectives
- When to use event-based time vs clock-based time
- How to enter "timeless" states for creative work
Reading your internal clock
- Understanding your natural rhythms
- Recognizing when time perception is distorted by emotions

Managing Time Horizons

Developing multiple concurrent time frames
- Thinking in day/month/year/decade simultaneously
- Balancing immediate tasks with long-term vision
Time horizon switching
- When to zoom out to years
- When to zoom in to minutes
- How to maintain strategic clarity while handling tactics
Future-self continuity
- Techniques for making future consequences feel real
- Building strong mental connections to your future self

Chronotype Optimization

Understanding your natural energy patterns
- Peak creative hours
- Peak analytical hours
- Recovery periods
Designing your life around your chronotype
- When to schedule different types of work
- How to handle obligations that fight your natural rhythm
Seasonal adjustments
- Adapting to changing daylight
- Modifying routines for different seasons

Timeline Compression and Expansion Techniques

How to make time feel longer when needed
- Breaking experiences into smaller chunks
- Increasing novelty to slow perceived time
- Using attention to expand moments
How to make time feel shorter when beneficial
- Chunking similar tasks
- Creating flow states
- Using routine to speed through necessary tasks

Personal Tempo Awareness

Understanding your natural pace
- Speaking tempo
- Walking pace
- Decision-making speed
Tempo matching and mismatching
- When to match others' tempo
- When to intentionally mismatch
- How to set the tempo in group situations
Rhythm management
- Creating sustainable work rhythms
- Building in recovery cycles
- Recognizing when to speed up or slow down

Advanced Applications:

Using time perception in negotiations
- Creating or relieving time pressure
- Understanding how others perceive time
Memory engineering
- Making good times feel longer in memory
- Making difficult times feel shorter in memory
Social time management
- Managing group time perception
- Creating shared time experiences
Decision timing
- Understanding when to decide quickly vs slowly
- Recognizing optimal timing windows

This skill set is particularly valuable because most people are unconscious of how their time perception affects their decisions and experiences. Mastering it gives you significant advantages in:

Personal productivity
Emotional regulation
Strategic planning
Social influence
Life satisfaction

Would you like me to elaborate on any particular aspect? Or shall we explore how this connects with some of the other micro-skills we discussed?