LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

AI #82: The Governor Ponders
Zvi · 2024-09-19T13:30:04.863Z · comments (8)

GPT-2030 and Catastrophic Drives: Four Vignettes
jsteinhardt · 2023-11-10T07:30:06.480Z · comments (5)

Apply to the Conceptual Boundaries Workshop for AI Safety
Chipmonk · 2023-11-27T21:04:59.037Z · comments (0)

The Shortest Path Between Scylla and Charybdis
Thane Ruthenis · 2023-12-18T20:08:34.995Z · comments (8)

Scenario Forecasting Workshop: Materials and Learnings
elifland · 2024-03-08T02:30:46.517Z · comments (3)

Goal-Completeness is like Turing-Completeness for AGI
Liron · 2023-12-19T18:12:29.947Z · comments (26)

Altman firing retaliation incoming?
trevor (TrevorWiesinger) · 2023-11-19T00:10:15.645Z · comments (23)

Transfer learning and generalization-qua-capability in Babbage and Davinci (or, why division is better than Spanish)
RP (Complex Bubble Tea) · 2024-02-09T07:00:45.825Z · comments (6)

AI #52: Oops
Zvi · 2024-02-22T21:50:07.393Z · comments (9)

Toy models of AI control for concentrated catastrophe prevention
Fabien Roger (Fabien) · 2024-02-06T01:38:19.865Z · comments (2)

[link] A starter guide for evals
Marius Hobbhahn (marius-hobbhahn) · 2024-01-08T18:24:23.913Z · comments (2)

Vipassana Meditation and Active Inference: A Framework for Understanding Suffering and its Cessation
Benjamin Sturgeon (benjamin-sturgeon) · 2024-03-21T12:32:22.475Z · comments (8)

n of m ring signatures
DanielFilan · 2023-12-04T20:00:06.580Z · comments (7)

Gemini 1.0
Zvi · 2023-12-07T14:40:05.243Z · comments (7)

On Overhangs and Technological Change
Roko · 2023-11-05T22:58:51.306Z · comments (19)

Bounty: Diverse hard tasks for LLM agents
Beth Barnes (beth-barnes) · 2023-12-17T01:04:05.460Z · comments (31)

Notes on control evaluations for safety cases
ryan_greenblatt · 2024-02-28T16:15:17.799Z · comments (0)

The Broken Screwdriver and other parables
bhauth · 2024-03-04T03:34:38.807Z · comments (1)

Job listing: Communications Generalist / Project Manager
Gretta Duleba (gretta-duleba) · 2023-11-06T20:21:03.721Z · comments (7)

[question] why did OpenAI employees sign
bhauth · 2023-11-27T05:21:28.612Z · answers+comments (23)

[link] Chapter 1 of How to Win Friends and Influence People
gull · 2024-01-28T00:32:52.865Z · comments (5)

They are made of repeating patterns
quetzal_rainbow · 2023-11-13T18:17:43.189Z · comments (4)

Wrong answer bias
lukehmiles (lcmgcd) · 2024-02-01T20:05:38.573Z · comments (24)

Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models
Felix Hofstätter · 2023-11-08T11:37:43.997Z · comments (0)

Should rationalists be spiritual / Spirituality as overcoming delusion
Kaj_Sotala · 2024-03-25T16:48:08.397Z · comments (57)

Interoperable High Level Structures: Early Thoughts on Adjectives
johnswentworth · 2024-08-22T21:12:38.223Z · comments (1)

An issue with training schemers with supervised fine-tuning
Fabien Roger (Fabien) · 2024-06-27T15:37:56.020Z · comments (12)

AI #67: Brief Strange Trip
Zvi · 2024-06-06T18:50:03.514Z · comments (6)

AI #58: Stargate AGI
Zvi · 2024-04-04T13:10:06.342Z · comments (9)

[link] On scalable oversight with weak LLMs judging strong LLMs
zac_kenton (zkenton) · 2024-07-08T08:59:58.523Z · comments (18)

The Dunning-Kruger of disproving Dunning-Kruger
kromem · 2024-05-16T10:11:33.108Z · comments (0)

[link] DM Parenting
Shoshannah Tekofsky (DarkSym) · 2024-07-16T08:50:08.144Z · comments (4)

[LDSL#0] Some epistemological conundrums
tailcalled · 2024-08-07T19:52:55.688Z · comments (10)

[link] in defense of Linus Pauling
bhauth · 2024-06-03T21:27:43.962Z · comments (8)

Sherlockian Abduction Master List
Cole Wyeth (Amyr) · 2024-07-11T20:27:00.000Z · comments (63)

[link] Anthropic announces interpretability advances. How much does this advance alignment?
Seth Herd · 2024-05-21T22:30:52.638Z · comments (4)

Book Review: Righteous Victims - A History of the Zionist-Arab Conflict
Yair Halberstadt (yair-halberstadt) · 2024-06-24T11:02:03.490Z · comments (8)

So you want to work on technical AI safety
gw · 2024-06-24T14:29:57.481Z · comments (3)

Low Probability Estimation in Language Models
Gabriel Wu (gabriel-wu) · 2024-10-18T15:50:05.947Z · comments (0)

[link] Book review: Xenosystems
jessicata (jessica.liu.taylor) · 2024-09-16T20:17:56.670Z · comments (18)

Interested in Cognitive Bootcamp?
Raemon · 2024-09-19T22:12:13.348Z · comments (0)

Seeking Collaborators
abramdemski · 2024-11-01T17:13:36.162Z · comments (11)

[question] If I wanted to spend WAY more on AI, what would I spend it on?
Logan Zoellner (logan-zoellner) · 2024-09-15T21:24:46.742Z · answers+comments (16)

[question] Could orcas be (trained to be) smarter than humans? 
Towards_Keeperhood (Simon Skade) · 2024-11-04T23:29:26.677Z · answers+comments (5)

AI and the Technological Richter Scale
Zvi · 2024-09-04T14:00:08.625Z · comments (8)

The Fragility of Life Hypothesis and the Evolution of Cooperation
KristianRonn · 2024-09-04T21:04:49.878Z · comments (6)

Evaluating the truth of statements in a world of ambiguous language.
Hastings (hastings-greer) · 2024-10-07T18:08:09.920Z · comments (19)

On the lethality of biased human reward ratings
Eli Tyre (elityre) · 2023-11-17T18:59:02.303Z · comments (10)

“Why can’t you just turn it off?”
Roko · 2023-11-19T14:46:18.427Z · comments (25)

Highlights from Lex Fridman’s interview of Yann LeCun
Joel Burget (joel-burget) · 2024-03-13T20:58:13.052Z · comments (15)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

adam_scholl on adam_scholl's Shortform

It seems the pro-Trump Polymarket whale may have had a real edge after all. Wall Street Journal reports (paywalled link, screenshot) that he’s a former professional trader, who commissioned his own polls from a major polling firm using an alternate methodology—the neighbor method, i.e. asking respondents who they expect their neighbors will vote for—he thought would be less biased by preference falsification.

I didn't bet against him, though I strongly considered it; feeling glad this morning that I didn't.

anthony-digiovanni on Winning isn't enough

Without a clear definition of "winning,"

This is part of the problem we're pointing out in the post. We've encountered claims of this "winning" flavor that haven't been made precise, so we survey different things "winning" could mean more precisely, and argue that they're inadequate for figuring out which norms of rationality to adopt.

ricraz on Against Almost Every Theory of Impact of Interpretability

The former can be sufficient—e.g. there are good theoretical researchers who have never done empirical work themselves.

In hindsight I think "close conjunction" was too strong—it's more about picking up the ontologies and key insights from empirical work, which can be possible without following it very closely.

thomas-kwa on How to put California and Texas on the campaign trail!

I think it would be better to form a big winner-take-all bloc [LW · GW]. With proportional voting, the number of electoral votes at stake will be only a small fraction of the total, so the per-voter influence of CA and TX would probably remain below the national average.

shankar-sivarajan on How to put California and Texas on the campaign trail!

I don't see any reason to structure this agreement as an open-ended compact other states can join instead of a bilateral agreement between just California and Texas as proposed.

(The same reasoning applied to the National Popular Vote Interstate Compact would have its membership closed as soon as they reach a majority in electoral votes, and then completely disregard the votes of any state that didn't sign on, voting in whoever gets the most votes in member states.)

johnswentworth on Some Rules for an Algebra of Bayes Nets

Proof that the quoted bookkeeping rule works, for the exact case:

The original DAG asserts $P [X] = \prod_{i} P [X_{i} | X_{p a^{G} (i)}]$
If $G^{'}$ just adds an edge from $j$ to $k$ , then $G^{'}$ says $P [X] = P [X_{k} | X_{p a^{G} (k)}, X_{j}] \prod_{i \neq k} P [X_{i} | X_{p a^{G} (i)}]$
The original DAG's assertion $P [X] = \prod_{i} P [X_{i} | X_{p a^{G} (i)}]$ also implies $P [X_{k} | X_{p a^{G} (k)}, X_{j}] = P [X_{k} | X_{p a^{G} (k)}]$ , and therefore implies $G^{'}$ 's assertion $P [X] = P [X_{k} | X_{p a^{G} (k)}, X_{j}] \prod_{i \neq k} P [X_{i} | X_{p a^{G} (i)}]$ .

The approximate case then follows by the new-and-improved Bookkeeping Theorem [LW(p) · GW(p)].

Not sure where the disconnect/confusion is.

douglas_knight on Is the Power Grid Sustainable?

You say solar is getting cheaper, but it is only the panels that are getting cheaper. They will continue to get even cheaper, but this is not relevant to retrofitting individual houses, where the cost is already dominated by labor. As the cost of labor dominates, economies of scale in labor will be more relevant.

douglas_knight on Is the Power Grid Sustainable?

To a first approximation, solar is legal for individual residences and illegal on a larger scale.

douglas_knight on Is the Power Grid Sustainable?

Maybe you could learn something by looking at the public filings, but you didn't look at them. By regulation, not by being public, it has to spend proportionate to its income, but whether it is spending on transmission or generation is a fiction dictated by the regulator. It may well be that its transmission operating costs are much lower than its price and that a change of prices would be viable without any improvement in efficiency. This is exactly what I would how I would expect the company to set prices if it controlled the regulator: to extract as much money as possible on transmission to minimize competition. I don't know how corrupt the regulator is, but that ignorance is exactly my point.

anthonyc on How to cite LessWrong as an academic source?

Depending on the posts I think you could argue they're comparable to one of thosebother source types I listed.