LessWrong 2.0 Reader

View: New · Old · Top

next page (older posts) →

Gradient hacking: definitions and examples
Richard_Ngo (ricraz) · 2022-06-29T21:35:37.166Z · comments (2)

[link] Progress links and tweets, 2022-06-29
jasoncrawford · 2022-06-29T21:33:40.285Z · comments (0)

[question] Correcting human error vs doing exactly what you're told - is there literature on this in context of general system design?
Jan Czechowski (przemyslaw-czechowski) · 2022-06-29T21:30:05.753Z · answers+comments (0)

Latent Adversarial Training
Adam Jermyn (adam-jermyn) · 2022-06-29T20:04:00.249Z · comments (12)

Game Review: This Merchant Life
Zvi · 2022-06-29T18:30:00.816Z · comments (0)

Limits to Legibility
Jan_Kulveit · 2022-06-29T17:42:19.338Z · comments (11)

Will Capabilities Generalise More?
Ramana Kumar (ramana-kumar) · 2022-06-29T17:12:56.255Z · comments (39)

Kevin Kelly's "103 Bits of Advice," Expanded
Dalton Mabery (dalton-mabery) · 2022-06-29T13:36:13.160Z · comments (0)

The table of different sampling assumptions in anthropics
avturchin · 2022-06-29T10:41:18.872Z · comments (5)

Can We Align AI by Having It Learn Human Preferences? I’m Scared (summary of last third of Human Compatible)
apollonianblues · 2022-06-29T04:09:06.213Z · comments (3)

[link] Kurzgesagt – The Last Human (Youtube)
habryka (habryka4) · 2022-06-29T03:28:44.213Z · comments (7)

[question] Literature on How to Maximize Preferences
josh (soren-d) · 2022-06-28T22:41:38.152Z · answers+comments (0)

Challenge: A Much More Alien Message
kman · 2022-06-28T21:50:59.877Z · comments (7)

It’s Probably Not Lithium
Natália (Natália Mendonça) · 2022-06-28T21:24:10.246Z · comments (186)

Reflections on Living in "Guess Culture"
Dalton Mabery (dalton-mabery) · 2022-06-28T21:00:39.680Z · comments (1)

[question] What is the LessWrong Logo(?) Supposed to Represent?
DragonGod · 2022-06-28T20:20:52.321Z · answers+comments (6)

What Are You Tracking In Your Head?
johnswentworth · 2022-06-28T19:30:06.164Z · comments (81)

[link] Why is so much political commentary misleading?
contrarianbrit · 2022-06-28T17:10:58.743Z · comments (5)

CFAR Handbook: Introduction
CFAR!Duncan (CFAR 2017) · 2022-06-28T16:53:53.312Z · comments (12)

Units of Exchange
CFAR!Duncan (CFAR 2017) · 2022-06-28T16:53:53.069Z · comments (28)

[link] Scott Aaronson and Steven Pinker Debate AI Scaling
Liron · 2022-06-28T16:04:58.515Z · comments (7)

A physicist's approach to Origins of Life
pchvykov · 2022-06-28T15:23:23.310Z · comments (6)

[link] What success looks like
Marius Hobbhahn (marius-hobbhahn) · 2022-06-28T14:38:42.758Z · comments (4)

Four reasons I find AI safety emotionally compelling
KatWoods (ea247) · 2022-06-28T14:10:35.216Z · comments (3)

Some alternative AI safety research projects
Michele Campolo · 2022-06-28T14:09:27.661Z · comments (0)

Doom doubts - is inner alignment a likely problem?
Crissman · 2022-06-28T12:42:16.197Z · comments (7)

Low-Friction MBTA Predictions
jefftk (jkaufman) · 2022-06-28T12:30:01.714Z · comments (0)

What Diet Books Don't Teach: A book review and a request for more reading
Lone Pine (conor-sullivan) · 2022-06-28T12:27:04.847Z · comments (34)

Assessing AlephAlphas Multimodal Model
p.b. · 2022-06-28T09:28:10.921Z · comments (5)

[question] Is there any way someone could post about public policy relating to abortion access (or another sensitive subject) on LessWrong without getting super downvoted?
Evan_Gaensbauer · 2022-06-28T05:45:17.831Z · answers+comments (20)

[Test Post Please Ignore] Testing polling features
Lone Pine (conor-sullivan) · 2022-06-28T04:35:09.467Z · comments (5)

Yann LeCun, A Path Towards Autonomous Machine Intelligence [link]
Bill Benzon (bill-benzon) · 2022-06-27T23:29:55.384Z · comments (1)

Limits of Bodily Autonomy
jefftk (jkaufman) · 2022-06-27T19:50:01.813Z · comments (18)

[question] Systems Biology for self study
Ulisse Mini (ulisse-mini) · 2022-06-27T19:36:32.707Z · answers+comments (2)

[link] [Yann Lecun] A Path Towards Autonomous Machine Intelligence
DragonGod · 2022-06-27T19:24:50.543Z · comments (13)

Exploring Mild Behaviour in Embedded Agents
Megan Kinniment (megan-kinniment) · 2022-06-27T18:56:34.794Z · comments (4)

Epistemic modesty and how I think about AI risk
Aryeh Englander (alenglander) · 2022-06-27T18:47:35.827Z · comments (4)

Deliberation Everywhere: Simple Examples
Oliver Sourbut · 2022-06-27T17:26:20.848Z · comments (3)

Deliberation, Reactions, and Control: Tentative Definitions and a Restatement of Instrumental Convergence
Oliver Sourbut · 2022-06-27T17:25:45.986Z · comments (0)

[question] Are long-form dating profiles productive?
AABoyles · 2022-06-27T17:03:35.266Z · answers+comments (32)

[link] Custom iPhone Widget to Encourage Less Wrong Use
Will Payne (will-payne) · 2022-06-27T16:14:50.141Z · comments (2)

Announcing the Inverse Scaling Prize ($250k Prize Pool)
Ethan Perez (ethan-perez) · 2022-06-27T15:58:19.135Z · comments (14)

[link] Announcing Epoch: A research organization investigating the road to Transformative AI
Jsevillamol · 2022-06-27T13:55:51.451Z · comments (2)

Air Conditioner Repair
Zvi · 2022-06-27T12:40:01.514Z · comments (34)

[question] Why Are Posts in the Sequences Tagged [Personal Blog] Instead of [Frontpage]?
DragonGod · 2022-06-27T09:35:26.778Z · answers+comments (2)

Contest: An Alien Message
DaemonicSigil · 2022-06-27T05:54:54.144Z · comments (100)

[link] Robin Hanson asks "Why Not Wait On AI Risk?"
Gunnar_Zarncke · 2022-06-26T23:32:19.436Z · comments (4)

Sex Fairy Lore
pchvykov · 2022-06-26T20:42:38.636Z · comments (10)

[link] King David's %: Establishing a new symbol for Bayesian probability.
Paul Logan (paul-logan) · 2022-06-26T19:47:57.047Z · comments (1)

Do You Care Whether There Are "Successful" Rationalists?
UtilityMonster (Matt Goldwater) · 2022-06-26T18:53:37.316Z · comments (8)

next page (older posts) →

Archive

Recent comments

lechmazur on Are the LLM "intelligence" tests publicly available for humans to take?

You can go through an archive of NYT Connections puzzles I used in my leaderboard. The scoring I use allows only one try and gives partial credit, so if you fail after getting 1 line correct, that's 0.25 for the puzzle. Top humans get near 100%. Top LLMs score around 30%. Timing is not taken into account.

nonveumann on Thoughts on seed oil

This is shockingly similar to what I'm going through. And the fries that fucked me up the other night are indeed fried in canola oil. I'm cautiously optimistic but I know how complicated these things can be -_-. Will report back!

lukehmiles on Is being a trans woman +20 IQ?

Someone on a subreddit said "free testosterone" is what matters and they usually just measure uh "regular testosterone" in blood or something. I have no idea if that's true. Know what those studies measured?

Wildly guessing here, but my intuition is that estrogen would have a greater impact on neuroticism than testosterone. Although I can't even say which direction.

lukehmiles on Is being a trans woman +20 IQ?

Like what exactly? That seems unlikely to me. I suppose we will have results from the ongoing gender transitions soon.

lukehmiles on Is being a trans woman +20 IQ?

I only linked the U-shaped study to mention that someone had said something vaguely similar. Notice my words "people have posited a U-shaped curve...". Study indeed seems like garbage. Perhaps i should've said that explicitly.

But it still doesn't really prove the causality - lots of things presumably influence intelligence, and I wouldn't be surprised if some of them influence T as well.

Yes so the experiment is that a million people are starting up in taking hormones/blockers now. I don't think proper results are in but what I have myself observed seems like strong evidence that blocking T preserves or raises intelligence on the margin.

slapstick on Thoughts on seed oil

I would consider adding salt to something to be making that thing less healthy. If adding salt is essential to making something edible, I think it would be healthier to opt for something that doesn't require added salt. That's speaking generally though, someone might not be getting enough sodium, but typically there is adequate sodium in a diet of whole foods.

We often combine foods to make nutrients more accessible, like adding oil to greens with fat-soluble vitamins.

I would disagree that adding refined oil to greens would be healthy overall.

Not sure how much oil we're talking, but a tablespoon of oil has more calories than an entire pound of greens. Even if the oil increases the availability of vitamins, I am very sceptical that it would be healthier than greens or other whole plants with an equivalent caloric content to the added oil. I believe it's also the case that fats from whole foods can offer similar bioavailability effects.

At the same time, as far as I'm aware some kinds of vinegar might sometimes be a healthy addition to a meal, despite it's processing being undoubtedly contrary to the general guidelines I'm defending, so even if I don't agree about the oil I think the point still stands.

I do think you're offering some valid points that confound my idea of simple guidelines somewhat, but I still don't think they're very significant exceptions to my main point.

Appreciate the dialogue:)

nisan on Use the Try Harder, Luke

It is a fiction.

ablue on Magic by forgetting

Is this an independent reinvention of the law of attraction? There doesn't seem to be anything special about "stop having a disease by forgetting about it" compared to the general "be in a universe by adopting a mental state compatible with that universe." That said, becoming completely convinced I'm a billionaire seems more psychologically involved than forgetting I have some disease, and the ratio of universes where I'm a billionaire versus I've deluded myself into thinking I'm a billionaire seems less favorable as well.

Anyway, this doesn't seem like a good solution since even for every "me" that gets into a better universe, another just gets booted into the worse one. As far as the interests of the whole cohort go it'd be a waste of effort.

carl-feynman on Examples of Highly Counterfactual Discoveries?

Wegener’s theory of continental drift was decades ahead of its time. He published in the 1920s, but plate tectonics didn’t take over until the 1960s. His theory was wrong in important ways, but still.

christiankl on Morpheus's Shortform

Practicing grammar and correcting grammar on the fly seem to be two different things.

If you want to improve, then I would prompt GPT-4 with something like "I'm a student looking to improve my writing and grammar ability, here's an essay I wrote. Given that writing, please teach me about grammar."