LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Pausing AI Developments Isn't Enough. We Need to Shut it All Down
Eliezer Yudkowsky (Eliezer_Yudkowsky) · 2023-04-08T00:36:47.702Z · comments (39)

The 101 Space You Will Always Have With You
Screwtape · 2023-11-29T04:56:40.240Z · comments (20)

My Assessment of the Chinese AI Safety Community
Lao Mein (derpherpize) · 2023-04-25T04:21:19.274Z · comments (94)

Failures in Kindness
silentbob · 2024-03-26T21:30:11.052Z · comments (27)

The case for ensuring that powerful AIs are controlled
ryan_greenblatt · 2024-01-24T16:11:51.354Z · comments (66)

New Scaling Laws for Large Language Models
1a3orn · 2022-04-01T20:41:17.665Z · comments (22)

Munk AI debate: confusions and possible cruxes
Steven Byrnes (steve2152) · 2023-06-27T14:18:47.694Z · comments (21)

[link] "No-one in my org puts money in their pension"
Tobes (tobias-jolly) · 2024-02-16T18:33:28.996Z · comments (7)

[link] I hired 5 people to sit behind me and make me productive for a month
Simon Berens (sberens) · 2023-02-05T01:19:39.182Z · comments (81)

How "Discovering Latent Knowledge in Language Models Without Supervision" Fits Into a Broader Alignment Scheme
Collin (collin-burns) · 2022-12-15T18:22:40.109Z · comments (39)

My views on “doom”
paulfchristiano · 2023-04-27T17:50:01.415Z · comments (34)

Jailbreaking ChatGPT on Release Day
Zvi · 2022-12-02T13:10:00.860Z · comments (77)

Common misconceptions about OpenAI
Jacob_Hilton · 2022-08-25T14:02:26.257Z · comments (142)

Book Review: Going Infinite
Zvi · 2023-10-24T15:00:02.251Z · comments (109)

Yes, It's Subjective, But Why All The Crabs?
johnswentworth · 2023-07-28T19:35:36.741Z · comments (15)

A Quick Guide to Confronting Doom
Ruby · 2022-04-13T19:30:48.580Z · comments (33)

Working With Monsters
johnswentworth · 2021-07-20T15:23:20.762Z · comments (54)

The Plan - 2022 Update
johnswentworth · 2022-12-01T20:43:50.516Z · comments (37)

Alignment Implications of LLM Successes: a Debate in One Act
Zack_M_Davis · 2023-10-21T15:22:23.053Z · comments (50)

Slow motion videos as AI risk intuition pumps
Andrew_Critch · 2022-06-14T19:31:13.616Z · comments (41)

My Model Of EA Burnout
LoganStrohl (BrienneYudkowsky) · 2023-01-25T17:52:42.770Z · comments (49)

Thoughts on the impact of RLHF research
paulfchristiano · 2023-01-25T17:23:16.402Z · comments (101)

Contra Hofstadter on GPT-3 Nonsense
rictic · 2022-06-15T21:53:30.646Z · comments (24)

Announcing Balsa Research
Zvi · 2022-09-25T22:50:00.626Z · comments (64)

Concentration of Force
[DEACTIVATED] Duncan Sabien (Duncan_Sabien) · 2021-11-06T08:20:18.991Z · comments (23)

The shard theory of human values
Quintin Pope (quintin-pope) · 2022-09-04T04:28:11.752Z · comments (66)

An Observation of Vavilov Day
Elizabeth (pktechgirl) · 2022-01-03T21:10:02.107Z · comments (42)

[link] More information about the dangerous capability evaluations we did with GPT-4 and Claude.
Beth Barnes (beth-barnes) · 2023-03-19T00:25:39.707Z · comments (54)

Editing Advice for LessWrong Users
JustisMills · 2022-04-11T16:32:17.530Z · comments (14)

The Feeling of Idea Scarcity
johnswentworth · 2022-12-31T17:34:04.306Z · comments (22)

Deep Deceptiveness
So8res · 2023-03-21T02:51:52.794Z · comments (58)

My Clients, The Liars
ymeskhout · 2024-03-05T21:06:36.669Z · comments (85)

UFO Betting: Put Up or Shut Up
RatsWrongAboutUAP · 2023-06-13T04:05:32.652Z · comments (207)

Policy discussions follow strong contextualizing norms
Richard_Ngo (ricraz) · 2023-04-01T23:51:36.588Z · comments (61)

Introduction to abstract entropy
Alex_Altair · 2022-10-20T21:03:02.486Z · comments (78)

[link] Zoe Curzi's Experience with Leverage Research
Ilverin the Stupid and Offensive (Ilverin) · 2021-10-13T04:44:49.020Z · comments (261)

Self-driving car bets
paulfchristiano · 2023-07-29T18:10:01.112Z · comments (41)

You Don't Exist, Duncan
[DEACTIVATED] Duncan Sabien (Duncan_Sabien) · 2023-02-02T08:37:01.049Z · comments (107)

[link] Sum-threshold attacks
TsviBT · 2023-09-08T17:13:37.044Z · comments (52)

Lessons On How To Get Things Right On The First Try
johnswentworth · 2023-06-19T23:58:09.605Z · comments (56)

(briefly) RaDVaC and SMTM, two things we should be doing
Eliezer Yudkowsky (Eliezer_Yudkowsky) · 2022-01-12T06:20:35.555Z · comments (79)

[link] AGI in sight: our look at the game board
Andrea_Miotti (AndreaM) · 2023-02-18T22:17:44.364Z · comments (135)

AGI Safety FAQ / all-dumb-questions-allowed thread
Aryeh Englander (alenglander) · 2022-06-07T05:47:13.350Z · comments (526)

[link] ARC's first technical report: Eliciting Latent Knowledge
paulfchristiano · 2021-12-14T20:09:50.209Z · comments (90)

Replacing Karma with Good Heart Tokens (Worth $1!)
Ben Pace (Benito) · 2022-04-01T09:31:34.332Z · comments (173)

Whole Brain Emulation: No Progress on C. elegans After 10 Years
niconiconi · 2021-10-01T21:44:37.397Z · comments (87)

Catching the Eye of Sauron
Casey B. (Zahima) · 2023-04-07T00:40:46.556Z · comments (68)

Brute Force Manufactured Consensus is Hiding the Crime of the Century
Roko · 2024-02-03T20:36:59.806Z · comments (156)

Announcing MIRI’s new CEO and leadership team
Gretta Duleba (gretta-duleba) · 2023-10-10T19:22:11.821Z · comments (52)

What Do GDP Growth Curves Really Mean?
johnswentworth · 2021-10-07T21:58:15.121Z · comments (64)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

ape-in-the-coat on Another Non-Anthropic Paradox: The Unsurprising Rareness of Rare Events

If this was true, then we could not observe the event „any other coin sequence“ as well since this event is by definition not being tracked.

When you are tracking event A you are automatically tracking its complement.

In fact, in order to detect a correspondence between a coin sequence that we have in mind and the actual sequence, our brain has to compare them to decide if there is a match. I can hardly imagine how this comparison could work without observing the specific actual sequence in the first place. That we classify and perceive a specific sequence as „any other sequence“ can be the result of the comparison, but is not its starting point.

Oh sure, you are of course completely correct here. But this doesn't contradict what I'm saying.

The thing is, we observe a particular outcome and then we see which event(s) it corresponds to. Let's take an example: a series of 3 coin tosses.

So, in the beginning you have sample space which consist of all the elementary outcomes:

And an event space, some sigma-algebra of the sample space, which depends on your precommitments. Normally, it would look something like this:

${\emptyset, {H H H, T T T}, {H H T, T T H, H T H, T H T, T H H, H T T}, {H H H, T T T, H H T, T T H, H T H, T H T, T H H, H T T}}$

Because you are intuitively paying attention to whether there all Heads/Tails in a row. So your event space groups individual outcomes in this particular way, separating the event you are tracking and it's complement.

When a particular combination, say $T H H$ is realized in a iteration of the experiment, your mind works like this:

Outcome $T H H$ is realized
Therefore every event from the event space which includes $T H H$ is realized.
Events ${H H T, T T H, H T H, T H T, T H H, H T T}$ and ${H H H, T T T, H H T, T T H, H T H, T H T, T H H, H T T}$ are realized.
$P (H H T o r T T H o r H T H o r T H T o r T H H o r H T T) = 2 / 3$
This isn't a rare event and so you are not particularly surprised

So, as you see, you do indeed observe an actual sequence, it's just that observing this sequence isn't necessary an event in itself.

shoshannah-tekofsky on Dyslucksia

This is really good! Thank you for sharing ^_ competition drive and wanting to achieve certain things are great motivations, and I think in any learning process the motivation one can tap into is at least as important as the actual learning technique. I'm glad you had access to that.

I tend to feel a little confused about the concept of "intelligence", as I guess my post already illustrated, haha. I think the word as we use it is very imprecise for cases like this. I'd roughly expect people with higher general intelligence to be much faster and successful at finding workarounds for their language processing issues, but I'd also expect the variance in this to be so high as to make plotting your general intelligence against "how quickly did you tame your dyslexia" to not make super much sense.

Then again, I do agree with a comment somewhere else here that Typical Minding is a thing, and my intuitions here may be wrong cause I'm failing to understand what it's like for other minds and I might have overcorrected due to 25 years of incorrectly concluding I was kind of dumb. Lol.

ben-lang on We might be missing some key feature of AI takeoff; it'll probably seem like "we could've seen this coming"

I like this framework.

Often when thinking about a fictional setting (reading a book, or worldbuilding) there will be aspects that stand out as not feeling like they make sense [1]. I think you have a good point that extrapolating out a lot of trends might give you something that at first glance seems like a good prediction, but if you tried to write that world as a setting, without any reference to how it got there, just writing it how you think it ends up, then the weirdness jumps out.

[1] eg. In Dune lasers and shields have an interaction that produces an unpredictably large nuclear explosion. To which the setting posits the equilibrium "no one uses lasers, it could set off an explosion". With only the facts we are given, and that fact that the setting is swarming with honourless killers and martyrdom-loving religious warriors, it seems like an implausible equilibrium. Obviously it could be explained with further details.

martin-vlach on You Can Face Reality

a worthy platitude(?)

ape-in-the-coat on Beauty and the Bets

Sure, I don‘t deny that. What I am saying is, that your probability model don‘t tell you which probability you have to base on a certain decision

It says which probability you have, based on what you've observed. If you observed that it's Monday - you are supposed to use probability conditionally on the fact that it's Monday, if you didn't observe that it's Monday you can't lawfully use the probability conditionally on the fact that it's Monday. Simple as that.

There is a possible confusion where people may think that they have observed "this specific thing happened" while actually they observed "any thing from some group of things happened", which is the technicolor and rare event cases are about.

Suppose a simple experiment where the experimenter flips a fair coin and you have to guess if Tails or Heads, but you are only rewarded for the correct decision if the coin comes up Tails. Then, of course, you should still entertain unconditional probabilities P(Heads)=P(Tails)=1/2. But this uncertainty is completely irrelevant to your decision.

Here you are confusing probability and utility. The fact that P(Heads)=P(Tails)=1/2 is very much relevant to our decision making! The correct reasoning goes like this:

P(Heads) = 1/2

P(Tails) = 1/2

U(Heads) = 0

U(Tails) = X,

E(Tails) = P(Tails)U(Tails) - P(Heads)U(Heads) = 1/2X - 0

Solving E(Tails) = 0 for X:

X = 0

Which means that you shouldn't bet on Heads at any odds

What is relevant, however, is P(Tails/Tails)=1 and P(Heads/Tails)=0, concluding you should follow the strategy always guessing Tails.

And why did you happen to decide that it's P(Tails|Tails) = 1 and P(Heads|Tails) = 0 instead of

P(Heads|Heads) = 1 and P(Tails|Tails) = 0 which are "relevant" for you decision making?

You seem to just decide the "relevance" of probabilities post hoc, after you've already calculated the correct answer the proper way. I don't think you can formalize this line of thinking, so that you had a way to systematically correctly solve decision theory problems, which you do not yet know the answer to. Otherwise, we wouldn't need utilities as a concept.

Another way to arrive at this strategy is to calculate expected utilities setting U(Heads)=0 as you would propose. But this is not the only reasonable solution. It’s just a different route of reasoning to take into account the experimental condition that your decision counts only if the coin lands Tails.

This is not "another way". This is the right way. It has the proper formalization and actually allows us to arrive to the correct answer even if we do not yet know it.

If the optimal betting sheme requires you to rely on P(Heads/Red or Blue)=1/2 when receiving evidence Blue, then the betting sheme demands you to ignore your total evidence.

You do not "ignore your total evidence" - you are never supposed to do that. It's just that you didn't actually receive the evidence in the first place. You can observe the fact that the room is blue in the experiment only if you put your mind in a state where you distinguish blue in particular. Until then your event space doesn't even include "Blue" only "Blue or Red".

But I suppose it's better to go to the comment section Another Non-Anthropic Paradox for this particular crux.

vanessa-kosoy on Dating Roundup #3: Third Time’s the Charm

FWIW, from glancing at your LinkedIn profile, you seem very dateable :)

gunnar_zarncke on Dating Roundup #3: Third Time’s the Charm

I said die, not kill. Let the predators continue to use the dating platforms if they want. It will keep them away from other more wholesome places.

alexander-gietelink-oldenziel on Thomas Kwa's Shortform

This seems valuable! I'd be curious to hear more !!

keltan on Dyslucksia

I think it would be correct to say that therapy was effective for my reading. By the end of primary school I could read at a normal level. However, my reading out loud ability seems not to have improved too much since then. I hadn’t realised until just now. But I still have to memorise how to say new words. I can, with a small effort, look at a simple word I have never encountered and pronounce it. Though, the word has to be quite simple. I host trivia as a side gig, and any question with a name that isn’t spelled traditionally trips me up badly. It can be pretty embarrassing trying to say “Sarrah” and not realising it’s just pronounced “Sarah”.

That’s the thing that leads me to think, at least with reading out loud, I have to explicitly memorise a words pronunciation before I can say it. Instead of what I assume others can do, and just look at a word and know how to say it.

In writing, it was necessity and cultural pressure. By the time I was reading out loud alright I was still writing like “i fond how to Mack a YouTube account” “ken i”. That’s a real quote my mother sent me a few weeks ago. When I realised I wasn’t getting what I wanted, (Winning MC battles, Reddit upvotes, winning Facebook wars, girls would comment on my spelling and I didn’t want them to) I would look around at the way others were writing things and cargo cult type copy whatever they were doing. Actually, that’s still what I do.

I don’t think it was high intelligence that caused me to notice these fixes. It took far too long to be intelligence. Instead, I think I’m really competitive and like showing off. Eventually I found methods that got the results I was going for.

I also watched a lot of JacksFilms YGS https://youtu.be/NARxgXEdlzs?si=1rGyQMAnMxQo0x-2

shoshannah-tekofsky on Dyslucksia

Interesting! Thank you for sharing! I'd love to know the answer as well.

Anecdotally, I can say that I did try to learn Japanese a little, and I found Kanji far easier to learn than words in hiragana or katakana, cause relating a "picture" to a word seemed far easier for me to parse and remember than to remember "random phonetic encodings". I'm using quotation marks to indicate my internal experience, cause I'm a little mistrustful by now if I'm even understanding how other people parse words and language.

Either way, that anecdote would point to my pictoral->meaning wiring being stronger than my phoneme-encoding->meaning wiring. Which might explain why processing language as drawings helped me. I really have no idea how much this would generalize. But I agree people must run in to this when learning new alphabets.