LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Rejected Early Drafts of Newcomb's Problem
zahmahkibo · 2022-09-06T19:04:54.284Z · comments (5)

Agency in Conway’s Game of Life
Alex Flint (alexflint) · 2021-05-13T01:07:19.125Z · comments (93)

Hierarchical Agency: A Missing Piece in AI Alignment
Jan_Kulveit · 2024-11-27T05:49:04.241Z · comments (20)

Theses on Sleep
guzey · 2022-02-11T12:58:15.300Z · comments (104)

Launching Forecast, a community for crowdsourced predictions from Facebook
Rebecca Kossnick (rebecca-kossnick) · 2020-10-20T06:20:29.206Z · comments (14)

[link] The irrelevance of test scores is greatly exaggerated
dynomight · 2021-04-15T14:15:29.046Z · comments (13)

The Pearly Gates
lsusr · 2024-05-30T04:01:14.198Z · comments (6)

Classifying games like the Prisoner's Dilemma
philh · 2020-07-04T17:10:01.965Z · comments (28)

Meditation course claims 65% enlightenment rate: my review
KatWoods (ea247) · 2022-08-01T11:25:37.017Z · comments (35)

[link] Here, have a calmness video
Kaj_Sotala · 2023-03-16T10:00:42.511Z · comments (15)

The Parable Of The Fallen Pendulum - Part 1
johnswentworth · 2024-03-01T00:25:00.111Z · comments (32)

[link] Model, Care, Execution
Ricki Heicklen (bayesshammai) · 2023-06-26T04:05:50.065Z · comments (10)

Alignment As A Bottleneck To Usefulness Of GPT-3
johnswentworth · 2020-07-21T20:02:36.030Z · comments (57)

[link] Practically A Book Review: Appendix to "Nonlinear's Evidence: Debunking False and Misleading Claims" (ThingOfThings)
tailcalled · 2024-01-03T17:07:13.990Z · comments (25)

DeepSeek beats o1-preview on math, ties on coding; will release weights
Zach Stein-Perlman · 2024-11-20T23:50:26.597Z · comments (26)

Subagents, trauma and rationality
Kaj_Sotala · 2019-08-14T13:14:46.838Z · comments (4)

Internet Literacy Atrophy
Elizabeth (pktechgirl) · 2021-12-26T12:30:01.540Z · comments (49)

I wanted to interview Eliezer Yudkowsky but he's busy so I simulated him instead
lsusr · 2021-09-16T07:34:11.210Z · comments (33)

Measuring and Improving the Faithfulness of Model-Generated Reasoning
Ansh Radhakrishnan (anshuman-radhakrishnan-1) · 2023-07-18T16:36:34.473Z · comments (15)

Why quantitative methods are heartwarming
KatjaGrace · 2020-12-12T21:50:08.259Z · comments (15)

Ideal governance (for companies, countries and more)
HoldenKarnofsky · 2022-04-05T18:30:19.228Z · comments (63)

That one apocalyptic nuclear famine paper is bunk
Lao Mein (derpherpize) · 2022-10-12T03:33:32.488Z · comments (10)

Conflict Theory of Bounded Distrust
Zack_M_Davis · 2023-02-12T05:30:30.760Z · comments (30)

Please stop using mediocre AI art in your posts
Raemon · 2024-08-25T00:13:52.890Z · comments (24)

What is operations?
Swimmer963 (Miranda Dixon-Luinenburg) (Swimmer963) · 2019-09-26T14:16:30.892Z · comments (9)

Experiences and learnings from both sides of the AI safety job market
Marius Hobbhahn (marius-hobbhahn) · 2023-11-15T15:40:32.196Z · comments (4)

[link] The lessons of Xanadu
jasoncrawford · 2022-08-07T17:59:57.839Z · comments (20)

Why I'm Moving from Mechanistic to Prosaic Interpretability
Daniel Tan (dtch1997) · 2024-12-30T06:35:43.417Z · comments (34)

On Solving Problems Before They Appear: The Weird Epistemologies of Alignment
adamShimi · 2021-10-11T08:20:36.521Z · comments (10)

How could we know that an AGI system will have good consequences?
So8res · 2022-11-07T22:42:27.395Z · comments (25)

Eight Hundred Slightly Poisoned Word Games
Scott Alexander (Yvain) · 2021-08-09T20:17:17.814Z · comments (5)

Extrapolating GPT-N performance
Lukas Finnveden (Lanrian) · 2020-12-18T21:41:51.647Z · comments (31)

Sorry for the downtime, looks like we got DDosd
habryka (habryka4) · 2024-12-02T04:14:30.209Z · comments (13)

Introduction to French AI Policy
Lucie Philippon (lucie-philippon) · 2024-07-04T03:39:45.273Z · comments (12)

[question] How to think about and deal with OpenAI
Rafael Harth (sil-ver) · 2021-10-09T13:10:56.091Z · answers+comments (68)

Ten arguments that AI is an existential risk
KatjaGrace · 2024-08-13T17:00:03.397Z · comments (41)

Explaining Capitalism Harder
jefftk (jkaufman) · 2021-10-17T02:40:07.488Z · comments (12)

Probability vs Likelihood
abramdemski · 2020-11-10T21:28:03.934Z · comments (10)

You should go to ML conferences
Jan_Kulveit · 2024-07-24T11:47:52.214Z · comments (13)

Charbel-Raphaël and Lucius discuss interpretability
Mateusz Bagiński (mateusz-baginski) · 2023-10-30T05:50:34.589Z · comments (7)

Failing to fix a dangerous intersection
alyssavance · 2022-06-30T18:09:18.565Z · comments (17)

What I Would Do If I Were Working On AI Governance
johnswentworth · 2023-12-08T06:43:42.565Z · comments (32)

[RETRACTED] It's time for EA leadership to pull the short-timelines fire alarm.
Not Relevant (not-relevant) · 2022-04-08T16:07:46.875Z · comments (163)

System 2 as working-memory augmented System 1 reasoning
Kaj_Sotala · 2019-09-25T08:39:08.011Z · comments (23)

How You Can Gain Self Control Without "Self-Control"
spencerg · 2021-03-24T23:38:25.926Z · comments (41)

My guess for why I was wrong about US housing
romeostevensit · 2023-06-14T00:37:04.162Z · comments (13)

Introduction To The Infra-Bayesianism Sequence
Diffractor · 2020-08-26T20:31:30.114Z · comments (62)

The Intense World Theory of Autism
Steven Byrnes (steve2152) · 2021-09-27T13:23:39.780Z · comments (40)

Luna Lovegood and the Chamber of Secrets - Part 5
lsusr · 2020-12-05T01:23:19.760Z · comments (8)

The Maker of MIND
Tomás B. (Bjartur Tómas) · 2021-11-20T16:28:56.327Z · comments (19)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

huera on Unregulated Peptides: Does BPC-157 hold its promises?

A blogger who goes by Troof created a huge questionnaire to get people to report their experiences with various nootropics including peptides. He writes:
Selank, Semax, Cerebrolysin, BPC-157 are all peptides, and they are all in the green “uncommon-but-great” rectangle above. Their mean ratings are excellent, but their probabilities of changing your life are especially impressive: between 5 and 20% for Cerebrolysin (which matches anecdotal reports), between 2 and 13% for BPC-157, and between 3 and 7% for Semax.

This article pretty much convinced me that cerebrosylin doesn't work (as a nootropic), which made me quite sceptical of all popular peptides, since it's also the highest rated one in troof's survey.

embee on Welcome & FAQ!

The best pathway towards becoming a member is to produce lots of great AI Alignment content, and to post it to LessWrong and participate in discussions there. The LessWrong/Alignment Forum admins monitor activity on both sites, and if someone consistently contributes to Alignment discussions on LessWrong that get promoted to the Alignment Forum, then it’s quite possible full membership will be offered.

Got it. Thanks.

quetzal_rainbow on How do fictional stories illustrate AI misalignment?

I think, collusion between AIs?

zy on Habryka's Shortform Feed

Out of curiosity - what was the time span for this raise that achieved this goal/when did first start again? Was it 2 months ago?

faul_sname on LLMs for language learning

Adapting spaced repetition to interruptions in usage: Even without parsing the user’s responses (which would make this robust to difficult audio conditions), if the reader rewinds or pauses on some answers, the app should be able to infer that the user is having some difficulty with the relevant material, and dynamically generate new content that repeats those words or grammatical forms sooner than the default.

Likewise, if the user takes a break for a few days, weeks, or months, the ratio of old to new material should automatically adjust accordingly, as forgetting is more likely, especially of relatively new material. (And of course with text to speech, an interactive app that interpreted responses from the user could and should be able to replicate LanguageZen’s ability to specifically identify (and explain) which part of a user’s response was incorrect, and why, and use this information to adjust the schedule on which material is reviewed or introduced.)

Seems like this one is mostly a matter of schlep rather than capability. The abilities you would need to make this happen are

Have a highly granular curriculum for what vocabulary and what skills are required to learn the language and a plan for what order to teach them in / what spaced repetition schedule to aim for
Have a granular and continuously updated model of the user's current knowledge of vocabulary, rules of grammar and acceptability, idioms, if there are any phonemes or phoneme sequences they have trouble with
Given specific highly granular learning goals (e.g. "understanding when to use preterite vs imperfect when conjugating saber" in spanish) within the curriculum and the model of the user's knowledge and abilities, produce exercises which teach / evaluate those specific skills.
Determine whether the user had trouble with the exercise, and if so what the trouble was
Based on the type of trouble the user had, describe whay updates should be made to the model of the user's knowledge and vocabulary
Correctly apply the updates from (6)
Adapt to deviations from the spaced repetition plan (tbh this seems like the sort of thing you would want to do with normal code)

I expect that the hardest things here will be 1, 2, and 6, and I expect them to be hard because of the volume of required work rather than the technical difficulty. But I also expect the LanguageZen folks have already tried this and could give you a more detailed view about what the hard bits are here.

Automatic customization of content through passive listening

This sounds like either a privacy nightmare or a massive battery drain. The good language models are quite compute intensive, so running them on a battery-powered phone will drain the battery very fast. Especially since this would need to hook into the "granular model of what the user knows" piece.

metawrong on Shortform

How does this explain the Decoy effect ^[1]?

^{^}
I am not sure how real and how well researched the 'decoy effect' is

shankar-sivarajan on Why abandon “probability is in the mind” when it comes to quantum dynamics?

You might also like this short summary from MinutePhysics:

owain_evans on Gaming TruthfulQA: Simple Heuristics Exposed Dataset Weaknesses

Author here: I'm excited for people to make better versions of TruthfulQA. We started working on TruthfulQA in early 2021 and we would do various things differently if we were making a truthfulness benchmark for LLMs in early 2025.

That said, you do not provide evidence that "many" questions are badly labelled. You just pointed to one question where you disagree with our labeling. (I agree with you that there is ambiguity as to how to label questions like that). I acknowledge that there are mistakes in TruthfulQA but this is true of almost all benchmarks of this kind.

yair-halberstadt on Why abandon “probability is in the mind” when it comes to quantum dynamics?

Yes

gwillen on Unregulated Peptides: Does BPC-157 hold its promises?

Ultrapersonal Healthcare appears to have forgotten to pay Squarespace to renew their website, which doesn't seem like a great sign.