LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-09-19T16:13:55.835Z · comments (1)

[question] What should OpenAI do that it hasn't already done, to stop their vacancies from being advertised on the 80k Job Board?
WitheringWeights (EZ97) · 2024-10-21T13:57:30.934Z · answers+comments (0)

[question] What is the alpha in one bit of evidence?
J Bostock (Jemist) · 2024-10-22T21:57:09.056Z · answers+comments (13)

Lab governance reading list
Zach Stein-Perlman · 2024-10-25T18:00:28.346Z · comments (3)

Gell-Mann checks
Cleo Scrolls (cleo-scrolls) · 2024-09-26T22:45:43.569Z · comments (7)

Filled Cupcakes
jefftk (jkaufman) · 2024-11-26T03:20:08.504Z · comments (1)

Gwerns
Tomás B. (Bjartur Tómas) · 2024-11-16T14:31:57.791Z · comments (2)

Text Posts from the Kids Group: 2018
jefftk (jkaufman) · 2024-11-23T12:50:05.325Z · comments (0)

How Often Does Taking Away Options Help?
niplav · 2024-09-21T21:52:40.822Z · comments (6)

[link] Towards the Operationalization of Philosophy & Wisdom
Thane Ruthenis · 2024-10-28T19:45:07.571Z · comments (2)

My decomposition of the alignment problem
Daniel C (harper-owen) · 2024-09-02T00:21:08.359Z · comments (22)

[link] Does natural selection favor AIs over humans?
cdkg · 2024-10-03T18:47:43.517Z · comments (1)

[question] Programmers, How Bad Is It out There?
Tomás B. (Bjartur Tómas) · 2024-11-20T00:57:16.802Z · answers+comments (4)

[link] Anthropic is being sued for copying books to train Claude
Remmelt (remmelt-ellen) · 2024-08-31T02:57:27.092Z · comments (4)

The Queen’s Dilemma: A Paradox of Control
Daniel Murfet (dmurfet) · 2024-11-27T10:40:14.346Z · comments (11)

[link] Compression Moves for Prediction
adamShimi · 2024-09-14T17:51:12.004Z · comments (0)

AI Can be “Gradient Aware” Without Doing Gradient hacking.
Sodium · 2024-10-20T21:02:10.754Z · comments (0)

Musings on Text Data Wall (Oct 2024)
Vladimir_Nesov · 2024-10-05T19:00:21.286Z · comments (2)

[link] Mechanistic Interpretability of Llama 3.2 with Sparse Autoencoders
PaulPauls · 2024-11-24T05:45:20.124Z · comments (2)

A necessary Membrane formalism feature
ThomasCederborg · 2024-09-10T21:33:09.508Z · comments (6)

Simon DeDeo on Explore vs Exploit in Science
Elizabeth (pktechgirl) · 2024-09-10T03:40:08.311Z · comments (0)

[link] AI Model Registries: A Foundational Tool for AI Governance
Elliot Mckernon (elliot) · 2024-10-07T19:27:43.466Z · comments (1)

[link] Chess As The Model Game
criticalpoints · 2024-11-17T19:45:26.499Z · comments (0)

Why Reflective Stability is Important
Johannes C. Mayer (johannes-c-mayer) · 2024-09-05T15:28:19.913Z · comments (2)

[link] To Be Born in a Bag
Niko_McCarty (niko-2) · 2024-10-06T17:21:00.605Z · comments (1)

[link] Fragile, Robust, and Antifragile Preference Satisfaction
adamShimi · 2024-11-02T17:25:55.986Z · comments (0)

Announcing the PIBBSS Symposium '24!
DusanDNesic · 2024-09-03T11:19:47.568Z · comments (0)

Economics Roundup #4
Zvi · 2024-10-15T13:20:06.923Z · comments (4)

[link] Update on the Mysterious Trump Buyers on Polymarket
Annapurna (jorge-velez) · 2024-11-04T19:22:06.540Z · comments (9)

Review: “The Case Against Reality”
David Gross (David_Gross) · 2024-10-29T13:13:29.643Z · comments (9)

How likely is brain preservation to work?
Andy_McKenzie · 2024-11-18T16:58:54.632Z · comments (3)

Why I'm bearish on mechanistic interpretability: the shards are not in the network
tailcalled · 2024-09-13T17:09:25.407Z · comments (40)

D/acc AI Security Salon
Allison Duettmann (allison-duettmann) · 2024-10-19T22:17:57.067Z · comments (0)

Bridging the VLM and mech interp communities for multimodal interpretability
Sonia Joseph (redhat) · 2024-10-28T14:41:41.969Z · comments (5)

Word Spaghetti
Gordon Seidoh Worley (gworley) · 2024-10-23T05:39:20.105Z · comments (9)

A few questions about recent developments in EA
Peter Berggren (peter-berggren) · 2024-11-23T02:36:25.728Z · comments (12)

[link] AI & Liability Ideathon
Kabir Kumar (kabir-kumar) · 2024-11-26T13:54:01.820Z · comments (2)

[link] Should Sports Betting Be Banned?
Maxwell Tabarrok (maxwell-tabarrok) · 2024-09-21T14:13:35.404Z · comments (2)

In the Name of All That Needs Saving
pleiotroth · 2024-11-07T15:26:12.252Z · comments (2)

Can Large Language Models effectively identify cybersecurity risks?
emile delcourt (emile-delcourt) · 2024-08-30T20:20:21.345Z · comments (0)

Avoiding the Bog of Moral Hazard for AI
Nathan Helm-Burger (nathan-helm-burger) · 2024-09-13T21:24:34.137Z · comments (12)

"Real AGI"
Seth Herd · 2024-09-13T14:13:24.124Z · comments (20)

Advisors for Smaller Major Donors?
jefftk (jkaufman) · 2024-11-06T14:30:06.187Z · comments (2)

[link] Jonothan Gorard:The territory is isomorphic to an equivalence class of its maps
Daniel C (harper-owen) · 2024-09-07T10:04:47.840Z · comments (18)

[question] Is this voting system strategy proof?
Donald Hobson (donald-hobson) · 2024-09-06T20:44:46.691Z · answers+comments (9)

[link] Why Swiss watches and Taylor Swift are AGI-proof
Kevin Kohler (KevinKohler) · 2024-09-05T13:23:27.033Z · comments (11)

[link] Four Levels of Voting Methods
hive · 2024-09-26T18:15:00.565Z · comments (3)

Proposal to increase fertility: University parent clubs
Fluffnutt (Pear) · 2024-11-18T04:21:26.346Z · comments (3)

Automating LLM Auditing with Developmental Interpretability
htlou · 2024-09-04T15:50:04.337Z · comments (0)

[link] Instruction Following without Instruction Tuning
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-09-24T13:49:09.078Z · comments (0)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

shardphoenix on Bogdan Ionut Cirstea's Shortform

I think they meant that as an analogy to how developed/sophisticated it was (ie they're saying that it's still early days for reasoning models and to expect rapid improvement), not that the underlying model size is similar.

mrtreasure on Bogdan Ionut Cirstea's Shortform

There have been comments from OAI staff that o1 is "GPT-2 level" so I wonder if it's a similar size?

benito on Lighthaven Sequences Reading Group #12 (Tuesday 11/26)

I've updated future posts to have start time at 6:30 and doors open at 6pm.

benito on Repeal the Jones Act of 1920

Well that escalated quickly (at the very end).

czynski on Lighthaven Sequences Reading Group #12 (Tuesday 11/26)

That was true this week, but the first time I attended (the 12th) I believe it wasn't, I arrived at what I think was 6:20-6:25 and found everything had already started.

benito on Repeal the Jones Act of 1920

cabotage

I assumed this was a typo for 'sabotage' the first time I saw it. For those wondering, here's a definition from google.

restriction of the operation of sea, air, or other transport services within or into a particular country to that country's own transport services.

bhauth on a space habitat design

As a "physicist and dabbler in writing fantasy/science fiction" I assume you took the 10 seconds to do the calculation and found that a 1km radius cylinder would have ~100 kW of losses per person from roller bearings supporting it, for the mass per person of the ISS. But I guess I don't understand how you expect to generate that power or dissipate that heat.

benito on Repeal the Jones Act of 1920

By contrast, a report by the pro-Jones Act American Maritime Partnership claims ‘the Jones Act is responsible for’ 13,000 jobs and adding $3.3 billion to the economy, which means that is currently the value to Hawaii of all shipborne trade with America.

Noob question: is this supposed to be low or high? Or is this just a list of datapoints regardless of how they fall [LW · GW]?

feel_love on You are not too "irrational" to know your preferences.

I appreciate and largely agree with the content of this post, but question the framing. I would argue that "wanting" and "preferring" are not useful behaviors in the first place.

Noticing that eating ice cream tends to be a sequence of pleasant sensations is to discover one's personal taste for ice cream. The fact stands without any analysis of why that is the case. I agree that this empirical observation is not a matter of rationality.

However, a decision to eat ice cream can be based on true or untrue beliefs, sound or unsound reasoning.

Perhaps I eat ice cream because I believe tasting its sweetness will cause me to become happier. But if I frequently taste the sweetness of ice cream and fail to become happier, I should notice that my decisions have been based on a world model that leads to poor predictions. Failing to better inform future decisions by updating the model is irrational.

Alternatively, I might eat ice cream because I believe doing so will reduce feelings of hunger that distract me from important work. If eating ice cream indeed enables me to better perform the task at hand, maintaining this understanding is rational.

What about eating ice cream for no reason at all? This would be mindless, random behavior that does not correspond to any world model or process for updating beliefs. Rationality is not possible if actions are purely impulsive and independent of mental processes.

Finally, there's merely wanting as a potential basis of decision-making: "Why should I eat ice cream? Because I notice that I want to eat it, and I do whatever I want." This line of reasoning has nothing to do with ice cream. Rather, it supposes that wanting is somehow a useful process in itself. It is not.

Wanting is a superfluous mental activity that causes unnecessary suffering in the form of a feeling of lack, a debt between a hypothetical preferred experience and the actual experience of life. There is no need to be at odds with reality this way in order to decide and act.

Ice cream may have a pleasant taste, and that's fine. But wanting to experience that particular sensation is an extra, wasteful step of the mind that distracts from perceiving what is happening and places one's emotional state at the mercy of uncontrollable, ever-changing events that can never provide lasting satisfaction. The desired indulgence may never come, and if it does, it will surely end.

Wants and preferences are not things we possess; they are processes we do at an opportunity cost. The world responds lawfully to our actions, but it cares nothing for our wanting, preferring, or fantasizing in general.

benito on Repeal the Jones Act of 1920

Emergency Case Study: Salt Shipment to NJ in the Winter of 2013-2014
In Benefits of the Jones Act, this was used as an example of a problem that the paper claimed was falsely blamed on the Jones Act.
The defense says, look at all the salt, almost all of it, that was indeed delivered as ordered and on time.

Am I expected to read the entire linked book-chapter to find out what exactly happened in New Jersey in 2013? I feel like the post should have a brief summary here.