LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

A Poem Is All You Need: Jailbreaking ChatGPT, Meta & More
Sharat Jacob Jacob (sharat-jacob-jacob) · 2024-10-29T12:41:30.337Z · comments (0)

Learn to Develop Your Advantage
ReverendBayes (vedernikov-andrei) · 2025-01-29T22:06:00.641Z · comments (0)

LLM Psychometrics and Prompt-Induced Psychopathy
Korbinian K. (korbinian-koch) · 2024-10-18T18:11:24.256Z · comments (2)

[link] Looking for humanness in the world wide social
Itay Dreyfus (itay-dreyfus) · 2025-01-15T14:50:54.966Z · comments (0)

Conversational Signposts—An Antidote to Dull Social Interactions
Declan Molony (declan-molony) · 2024-10-22T05:37:56.175Z · comments (6)

Substituting Talkbox for Breath Controller
jefftk (jkaufman) · 2024-10-27T19:10:03.768Z · comments (0)

[link] LLMs Do Not Think Step-by-step In Implicit Reasoning
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-11-28T09:16:57.463Z · comments (0)

Reward Bases: A simple mechanism for adaptive acquisition of multiple reward type
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-11-23T12:45:01.067Z · comments (0)

Rethinking Laplace's Rule of Succession
Cleo Nardo (strawberry calm) · 2024-11-22T18:46:25.156Z · comments (5)

What does success look like?
Raymond D · 2025-01-23T17:48:35.618Z · comments (0)

[question] Where should one post to get into the training data?
keltan · 2025-01-15T00:41:19.405Z · answers+comments (4)

Updating the NAO Simulator
jefftk (jkaufman) · 2024-10-30T13:50:06.908Z · comments (0)

[link] The Computational Complexity of Circuit Discovery for Inner Interpretability
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-10-17T13:18:46.378Z · comments (2)

[link] Progress links and short notes, 2024-12-27: Clinical trial abundance, grid-scale fusion, permitting vs. compliance, crossword mania, and more
jasoncrawford · 2024-12-27T23:34:43.807Z · comments (0)

[link] How to Do a PhD (in AI Safety)
Lewis Hammond (lewis-hammond-1) · 2025-01-05T16:57:35.409Z · comments (0)

7. Iterate the Game: Racing Where?
Allison Duettmann (allison-duettmann) · 2025-01-02T19:06:22.165Z · comments (0)

LDT (and everything else) can be irrational
Christopher King (christopher-king) · 2024-11-06T04:05:36.932Z · comments (7)

[link] Picking favourites is hard
dkl9 · 2024-12-04T20:46:47.470Z · comments (3)

Spooky Recommendation System Scaling
phdead · 2024-10-31T22:00:51.728Z · comments (0)

My Mental Model of AI Optimist Opinions
tailcalled · 2025-01-29T18:44:36.485Z · comments (2)

Apply now to SPAR!
agucova · 2024-12-19T22:29:58.963Z · comments (0)

[link] Uncontrollable: A Surprisingly Good Introduction to AI Risk
PeterMcCluskey · 2025-01-24T04:30:37.499Z · comments (0)

Last Line of Defense: Minimum Viable Shelters for Mirror Bacteria
Ulrik Horn (ulrik-horn) · 2024-12-21T08:28:14.860Z · comments (25)

Rethink Wellbeing’s Year 2 Update: Foster Sustainable High Performance for Ambitious Altruists
Inga G. (inga-g) · 2024-12-08T14:32:39.902Z · comments (1)

Alignment ideas
qbolec · 2025-01-18T12:43:49.384Z · comments (1)

Untrusted monitoring insights from watching ChatGPT play coordination games
jwfiredragon · 2025-01-29T04:53:33.125Z · comments (0)

[link] OpenAI’s cybersecurity is probably regulated by NIS Regulations
Adam Jones (domdomegg) · 2024-10-25T11:06:38.392Z · comments (2)

[link] My Mental Model of AI Creativity – Creativity Kiki
Adam Newgas (BorisTheBrave) · 2024-12-09T22:24:23.096Z · comments (0)

How I'd like alignment to get done (as of 2024-10-18)
TristanTrim · 2024-10-18T23:39:03.107Z · comments (4)

[link] Forecast With GiveWell
ChristianWilliams · 2024-12-11T17:52:32.293Z · comments (0)

[question] How counterfactual are logical counterfactuals?
Donald Hobson (donald-hobson) · 2024-12-15T21:16:40.515Z · answers+comments (10)

Launching Applications for the Global AI Safety Fellowship 2025!
Aditya_SK (team-ai-safety) · 2024-11-30T14:02:16.537Z · comments (5)

Contra Dances Getting Shorter and Earlier
jefftk (jkaufman) · 2025-01-23T23:30:03.595Z · comments (0)

[Cross-post] Welcome to the Essay Meta
davekasten · 2025-01-16T23:36:49.152Z · comments (2)

Panology
JenniferRM · 2024-12-23T21:40:14.540Z · comments (8)

Fundamental Uncertainty: Chapter 9 - How do we live with uncertainty?
Gordon Seidoh Worley (gworley) · 2024-11-07T18:15:45.049Z · comments (2)

[link] Anthropic - The case for targeted regulation
anaguma · 2024-11-05T07:07:48.174Z · comments (0)

[link] The Philosophical Glossary of AI
David Gross (David_Gross) · 2025-01-14T17:36:37.241Z · comments (0)

The Three Warnings of the Zentradi
Trevor Hill-Hand (Jadael) · 2024-11-21T20:28:45.567Z · comments (1)

Doing a self-randomized study of the impacts of glycine on sleep (Science is hard)
thedissonance.net · 2025-01-17T18:49:30.989Z · comments (5)

Do you need a better map of your myriad of maps to the territory?
CstineSublime · 2024-12-24T02:00:30.426Z · comments (2)

Orange and Strawberry Truffles
jefftk (jkaufman) · 2025-01-05T01:50:01.587Z · comments (1)

Low Temperature Solomonoff Induction
dil-leik-og (samuel-buteau) · 2024-12-06T18:55:08.948Z · comments (4)

Why We Wouldn't Build Aligned AI Even If We Could
Snowyiu · 2024-11-16T20:19:59.324Z · comments (7)

Fundamental Uncertainty: Epilogue
Gordon Seidoh Worley (gworley) · 2024-11-16T00:57:48.823Z · comments (0)

[link] Proposing the Conditional AI Safety Treaty (linkpost TIME)
otto.barten (otto-barten) · 2024-11-15T13:59:01.050Z · comments (8)

Proactive 'If-Then' Safety Cases
Nathan Helm-Burger (nathan-helm-burger) · 2024-11-18T21:16:37.237Z · comments (0)

Americans are fat and sick—and it’s their fault…right?
Declan Molony (declan-molony) · 2024-11-19T06:41:36.648Z · comments (6)

The Quantum Mars Teleporter: An Empirical Test Of Personal Identity Theories
avturchin · 2025-01-22T11:48:46.071Z · comments (18)

[question] Has Someone Checked The Cold-Water-In-Left-Ear Thing?
Maloew (maloew-valenar) · 2024-12-28T20:15:35.951Z · answers+comments (0)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

alvin-anestrand on The Case Against AI Control Research

I can think of a few reasons someone might think AI Control research should receive very high priority, apart from what is mentioned in the post or in Buck's comment: [LW(p) · GW(p)]

You hope/expect early transformative AI to be used for provable safety approaches, using formal verification methods.
You think AI control research is more tractable than other research agendas, or will have useful results faster, before they are too late to apply.
Our only chance of aligning a superintelligence is to delegate the problem to AIs, either because it is too hard for humans, or it will arrive sooner than the proper alignment techniques can feasibly be developed.
You expect a significant fraction of total AI safety research over all time to be done by early transformative AI, so control research has high leverage value in improving the probability of successfully getting the AI to do valuable safety research, even if slop is quite likely.

I agree with basically everything in the post but put enough probability on these points to think that control research has really high expected value anyway.

fdrocha on Hzn's Shortform

One quick observation about NVDA dividends that not many people might be aware of: NVDA pays a quarterly dividend of exactly once cent ($0.01) per share. They don't do this for the "usual" reason companies pay dividends (returning money to shareholders) but because by paying a non-zero dividend at all NVDA becomes part of dividend-paying company indexes and that means that ETFs that follow those indexes will buy NVDA shares. So they technically pay a dividend but for the purposes of valuation you should think of it as a non dividend paying stock.

Regarding the more general question of valuation, if you want to value a company based on how much they are currently distributing to shareholders you need to consider not only dividends but also share buybacks. Buybacks are effectively just a more tax-efficient form of paying dividends. I am not sure what the total numbers are for 2024, but in August for instance NVDA announced a $50 billion buyback.

And of course, the proper measure is not current distribution, but total expected discounted distributions over all time. That's hard to estimate, but for a company experiencing explosive growth it is surely higher than current distributions.

mateusz-baginski on The Manhattan Trap: Why a Race to Artificial Superintelligence is Self-Defeating

It seems to me that by saying this the authors wanted to communicate "this is not a place to discuss this". But I agree that the phrasing used may inaccurately (?) communicate that the authors are more uncertain/agnostic about this issue than they really are (or that they believe something like "both sides have comparably good arguments"), so I'd suggest to replace it with something like:

The likelihood of loss of control is beyond the scope of this report (for discussion, see: [sources]).

ben-lang on Hzn's Shortform

I freely admit to not really understanding how shares are priced. To me it seems like the value of a share should be related to the expected dividend pay-out of that share over the remaining lifetime of the company, with a discount rate applied on pay-outs that are expected to happen further in the future (IE dividend yields 100 years from now are valued much less than equivalent payments this year). By this measure, justifying the current price sounds hard.

Google says that the annual dividend on Nvidia shares is 0.032%. (Yes, the leading digits are 0.0). So, right now, you get a much better rate of return just leaving your money in your bank's current account. So, at least by this measure, Nvidia shares are ludicrously over-priced. You could argue that future Nvidia pay outs might be much larger than the historical ones due to some big AI related profits. But, I don't find this argument convincing. Are future pay outs going to be 100x bigger? It would require a 100-fold yield increase for it to just be competitive with a savings account. If you time discount a little (say those 100-fold increases don't materialise for 3 years) then it looks even worse.

Now, clearly the world doesn't value shares according to the same heuristics that make sense to a non-expert like me. For example, the method "time integrate future expected dividend pay outs with some kind of time discounting" tells us that cryptocurrencies are worthless, because they are like shares with zero dividends. But, people clearly do put a nonzero value on bitcoin - and there is no plausible way that many people are that wrong. So they are grasping something that I am missing, and that same thing is probably what allows company shares to be prices so high relative to the dividends.

hzn on Hzn's Shortform

Does Deepseek actually mean that Nvidia is over valued?

FYI I wrote this a few days ago but could not post it due to being rate limited. The one change I made is adding actually to the 1st line.

To be clear I have no intention whatsoever of shorting NVDA

Epistemic status -- I have no idea what I'm talking about

Epistemic status -- very speculative but not quite a DMT hallucination

Super human AI will run on computers not much more expensive than personal computers but perhaps with highly specialized chips maybe even specialized for the task of running a single AI instance

Investment in AI proper will be small relative to AI directed production

There will be a period of increasing marginal returns from AI; but this will eventually become diminishing marginal returns

Even during the period of increasing marginal returns more $$$ will go to AI directed production than AI proper

Companies that most successfully transition to AI will blow the competition away; some of these companies will have a moat & continue to make high profits. But how can such high profits be justified? Maybe the government needs to take 50% of the shares & create trust funds for its citizens.

Companies that buy up the right kinds of land & natural resources will also do well

Companies that are least affected by AI will benefit b/c of the Baumol effect

So what are the get rick quick schemes? Specialized chips, incorporating AI into production systems for non AI goods, strategically buying up the right land & land rights, AI resistant industries!?

Interesting times maybe too interesting

In conclusion I'm agnostic as to whether Nvidia is or is not over valued but other companies may benefit even more as AI advances. I think it's more about leadership & seizing opportunities more so than a few companies having a overwhelmingly dominant position.

dakara on What if Alignment is Not Enough?

Thanks for the reply!

The only general remarks that I want to make
are in regards to your question about
the model of 150 year long vaccine testing
on/over some sort of sample group and control group.
I notice that there is nothing exponential assumed
about this test object, and so therefore, at most,
the effects are probably multiplicative, if not linear.
Therefore, there are lots of questions about power dynamics
that we can overall safely ignore, as a simplification,
which is in marked contrast to anything involving ASI.
If we assume, as you requested, "no side effects" observed,
in any test group, for any of those things
that we happened to be thinking of, to even look for,
then for any linear system, that is probably "good enough".

I am not sure I understand the distinction between linear and exponential in the vaccine context. By linear do you mean that only few people die? By exponential do you mean that a lot of people die?

If so, then I am not so sure that vaccine effects could only be linear. For example, there might be some change in our complex environment that would prompt the vaccine to act differently than it did in the past.

More generally, our vaccine can lead to catastrophic outcomes if there is something about its future behavior that we didn't predict. And if that turns out to be true, then things could go ugly really fast.

And the extent of the damage can be truly big. "Scientifically proven" cancer vaccine that passed the tests is like the holy grail of medicine. "Curing cancer" is often used by parents as an example of the great things their children could achieve. This is combined with the fact that cancer has been with us for a long time and the fact that the current treatment is very expensive and painful.

All of these factors combined tell us that in a relatively short period of time a large percentage of the total population will get this vaccine. At that point, the amount of damage that can be done only depends on what thing we overlooked, which we, by definition, have no control over.

If there is some long future problem that crops up,
the company can say "we never looked for that"
and "we are not responsible for the unexpected",
because the people who made the deployment choices
have taken their profits and their pleasure in life,
and are now long dead. "Not my Job".
"Don't blame us for the sins of our forefathers".
Similarly, no one is going to ever admit or concede
any point, of any argument, on pain of ego death.

This same excuse would surely be used by companies manufacturing the vaccine. They would argue that they shouldn't be blamed for something that the researchers overlooked. They would say that they merely manufactured the product in order to prevent the needless suffering of countless people.

For all we know, by the time that the overlooked thing happens, the original researchers (who developed and tested the vaccine) are long dead, having lived a life of praise and glory for their ingenious invention (not to mention all the money that they received).

ejt on You should read Hobbes, Locke, Hume, and Mill via EarlyModernTexts.com

Nice post! There can be some surprising language-barriers between early modern writers and today's readers. I remember as an undergrad getting very confused by a passage from Locke in which he often used the word 'sensible.' I took him to mean 'prudent' and only later discovered he meant 'can be sensed'!

anthony-digiovanni on Winning isn't enough

does that require you to either have the ability to commit to a plan or the inclination to consistently pick your plan from some prior epistemic perspective

You aren't required to take an action (/start acting on a plan) that is worse from your current perspective than some alternative. Let maximality-dominated mean "w.r.t. each distribution in my representor, worse in expectation than some alternative." (As opposed to "dominated" in the sense of "worse than an alternative with certainty".) Then, in general you would need^[1] to ask, "Among the actions/plans that are not maximality-dominated from my current perspective, which of these are dominated from my prior perspective?" And rule those out.

^{^}
If you care about diachronic norms of rationality, that is.

dr_s on Fertility Will Never Recover

What? If every couple had only one child, the population would halve at each generation. That's what they mean. Replacement rate requires more than just one child.

dr_s on Fertility Will Never Recover

I mean, the whole point was "how can we have fertility but also not be a dystopia". You just described a dystopia. It's also kind of telling that the only way to make people have children, something that is supposedly a joyous experience, you can think of is "have a tyrannical dictator make it very clear that they'll make sure the alternative is even worse". That's pretty telling. Someone thinking this way is part of the problem more than they are of the solution.