LessWrong 2.0 Reader

View: New · Old · Top

next page (older posts) →

[question] Which of our online writings was used to train GPT-3?
Mati_Roy (MathieuRoy) · 2021-10-30T21:52:08.706Z · answers+comments (3)

Why the Problem of the Criterion Matters
Gordon Seidoh Worley (gworley) · 2021-10-30T20:44:00.143Z · comments (9)

Budapest Less Wrong/ SSC
Timothy Underwood (timothy-underwood-1) · 2021-10-30T18:27:27.045Z · comments (0)

Quick general thoughts on suffering and consciousness
Rob Bensinger (RobbBB) · 2021-10-30T18:05:59.612Z · comments (46)

Must true AI sleep?
YimbyGeorge (mardukofbabylon) · 2021-10-30T16:47:46.234Z · comments (1)

How Much is a Sweet?
jefftk (jkaufman) · 2021-10-30T15:50:05.758Z · comments (6)

God Is Great
Mahdi Complex (mahdi-complex) · 2021-10-30T13:03:20.998Z · comments (7)

We Live in a Post-Scarcity Society
lsusr · 2021-10-30T12:05:35.267Z · comments (22)

Tell the Truth
lsusr · 2021-10-30T10:27:42.996Z · comments (40)

A Roadmap to a Post-Scarcity Economy
lorepieri (lorenzo-rex) · 2021-10-30T09:04:29.479Z · comments (3)

Start with a Title
lsusr · 2021-10-30T08:59:08.208Z · comments (4)

SSC/Lesswrong San Diego Meetup
CitizenTen · 2021-10-30T00:15:41.324Z · comments (1)

Unlock the Door
lincolnquirk · 2021-10-29T23:45:43.273Z · comments (5)

[link] Naval Ravikant and Chris Dixon Didn't Explain Any Web3 Use Cases
Liron · 2021-10-29T21:54:50.184Z · comments (0)

[TL;DR] "Training for the New Alpinism" by Steve House and Scott Johnston
lsusr · 2021-10-29T21:20:00.451Z · comments (1)

True Stories of Algorithmic Improvement
johnswentworth · 2021-10-29T20:57:13.638Z · comments (7)

Goodhart's Imperius
[DEACTIVATED] Duncan Sabien (Duncan_Sabien) · 2021-10-29T20:19:42.291Z · comments (6)

A system of infinite ethics
Chantiel · 2021-10-29T19:37:42.828Z · comments (60)

Stuart Russell and Melanie Mitchell on Munk Debates
Alex Flint (alexflint) · 2021-10-29T19:13:58.244Z · comments (4)

A very crude deception eval is already passed
Beth Barnes (beth-barnes) · 2021-10-29T17:57:29.475Z · comments (6)

On the Universal Distribution
Joe Carlsmith (joekc) · 2021-10-29T17:50:15.849Z · comments (4)

[link] Google announces Pathways: new generation multitask AI Architecture
Ozyrus · 2021-10-29T11:55:21.797Z · comments (1)

I Really Don't Understand Eliezer Yudkowsky's Position on Consciousness
J Bostock (Jemist) · 2021-10-29T11:09:20.559Z · comments (120)

Leadership
lsusr · 2021-10-29T07:29:54.610Z · comments (4)

Truthful and honest AI
abergal · 2021-10-29T07:28:36.225Z · comments (1)

Interpretability
abergal · 2021-10-29T07:28:02.650Z · comments (13)

Techniques for enhancing human feedback
abergal · 2021-10-29T07:27:46.700Z · comments (0)

Measuring and forecasting risks
abergal · 2021-10-29T07:27:32.836Z · comments (0)

Request for proposals for projects in AI alignment that work with deep learning systems
abergal · 2021-10-29T07:26:58.754Z · comments (0)

My current thinking on money and low carb diets
Adam Zerner (adamzerner) · 2021-10-29T06:50:38.543Z · comments (17)

[question] What are fiction stories related to AI alignment?
Mati_Roy (MathieuRoy) · 2021-10-29T02:59:52.920Z · answers+comments (22)

[question] How to generate idea/solutions to solve a problem?
warrenjordan · 2021-10-29T00:53:15.941Z · answers+comments (5)

[link] Forecasting progress in language models
Matthew Barnett (matthew-barnett) · 2021-10-28T20:40:59.897Z · comments (6)

[AN #168]: Four technical topics for which Open Phil is soliciting grant proposals
Rohin Shah (rohinmshah) · 2021-10-28T17:20:03.387Z · comments (0)

Better and Worse Ways of Stating SIA
dadadarren · 2021-10-28T16:04:22.333Z · comments (0)

Recommending Understand, a Game about Discerning the Rules
MondSemmel · 2021-10-28T14:53:16.901Z · comments (53)

Covid 10/28: An Unexpected Victory
Zvi · 2021-10-28T14:50:01.072Z · comments (37)

An Unexpected Victory: Container Stacking at the Port of Long Beach
Zvi · 2021-10-28T14:40:00.497Z · comments (41)

Save the kid, ruin the suit; Acceptable utility exchange rates; Distributed utility calculations; Civic duties matter
spkoc · 2021-10-28T11:51:52.057Z · comments (8)

Voting for people harms people
CraigMichael · 2021-10-28T08:29:13.075Z · comments (6)

[link] Selfishness, preference falsification, and AI alignment
jessicata (jessica.liu.taylor) · 2021-10-28T00:16:47.051Z · comments (28)

Ruling Out Everything Else
[DEACTIVATED] Duncan Sabien (Duncan_Sabien) · 2021-10-27T21:50:39.545Z · comments (51)

[link] They don't make 'em like they used to
jasoncrawford · 2021-10-27T19:44:47.098Z · comments (84)

Hegel vs. GPT-3
Bezzi · 2021-10-27T05:55:18.296Z · comments (21)

[link] Everything Studies on Cynical Theories
DanielFilan · 2021-10-27T01:31:20.608Z · comments (5)

Harry Potter and the Methods of Psychomagic | Chapter 2: The Global Neuronal Workspace
Henry Prowbell · 2021-10-26T18:54:49.386Z · comments (8)

X-Risk, Anthropics, & Peter Thiel's Investment Thesis
Jackson Wagner · 2021-10-26T18:50:03.300Z · comments (1)

[question] Would the world be a better place if we all agreed to form a world government next Monday?
idontwanttodie · 2021-10-26T18:14:17.432Z · answers+comments (5)

Don't Use the "God's-Eye View" in Anthropic Problems.
dadadarren · 2021-10-26T13:47:53.386Z · comments (1)

Impressive vs honest signaling
Adam Zerner (adamzerner) · 2021-10-26T07:16:24.478Z · comments (12)

next page (older posts) →

Archive

Recent comments

slapstick on Thoughts on seed oil

I am perhaps not speaking as precisely as I should be. I appreciate your comments.

I believe it's correct to say that if you consider all of the food/energy we consumed in the past 50+ million years, it's virtually all plants.

The past 2-2.5 million years had us introducing more animal products to greater or lesser extents. Some were able to subsist on mostly animal products. Some consumed them very rarely.

In that sense it is a relatively recent introduction. My main point is that given our evolutionary history, the idea that plants would be healthier for us than animal products when we have both in abundance, and the idea that plants are more suitable to maintaining health long past reproductive age, aren't immediately/obviously unreasonable ideas.

dzoldzaya on Thoughts on seed oil

I think your intuitions are generally correct, and as I say, it's usually a good heuristic to avoid overly processed food. In the absence of other evidence, if you're in a food market where everything is edible, you should probably opt for the less processed option. I also don't disagree with it playing a role in national health guidelines.

But it's a very imprecise heuristic, and I think LessWrong-ers with aspirations to understand the world more accurately should feel a bit uncomfortable with it, especially when benign and beneficial processes are lumped together with those with much clearer mechanisms for harm.

interstice on Is being a trans woman +20 IQ?

performance gap of trans women over women

The post is about the performance gap of trans women over men, not women.

leon-lang on Examples of Highly Counterfactual Discoveries?

I guess (but don't know) that most people who downvote Garrett's comment overupdated on intuitive explanations of singular learning theory, not realizing that entire books with novel and nontrivial mathematical theory have been written on it.

eggsyntax on eggsyntax's Shortform

Maybe by the time we cotton on properly, they're somewhere past us at the top end.

Great point. I agree that there are lots of possible futures where that happens. I'm imagining a couple of possible cases where this would matter:

Humanity decides to stop AI capabilities development or slow it way down, so we have sub-ASI systems for a long time (which could be at various levels of intelligence, from current to ~human). I'm not too optimistic about this happening, but there's certainly been a lot of increasing AI governance momentum in the last year.
Alignment is sufficiently solved that even > AGI systems are under our control. On many alignment approaches, this wouldn't necessarily mean that those systems' preferences were taken into account.

We can't "just ask" an LLM about its interests and expect the answer to soundly reflect its actual interests.

I agree entirely. I'm imagining (though I could sure be wrong!) that any future systems which were sentient would be ones that had something more like a coherent, persistent identity, and were trying to achieve goals.

LLMs specifically have a 'drive' to generate reasonable-sounding text

(not very important to the discussion, feel free to ignore, but) I would quibble with this. In my view LLMs aren't well-modeled as having goals or drives. Instead, generating distributions over tokens is just something they do in a fairly straightforward way because of how they've been shaped (in fact the only thing they do or can do), and producing reasonable text is an artifact of how we choose to use them (ie picking a likely output, adding it onto the context, and running it again). Simulacra like the assistant character can be reasonably viewed (to a limited degree) as being goal-ish, but I think the network itself can't.

That may be overly pedantic, and I don't feel like I'm articulating it very well, but the distinction seems useful to me since some other types of AI are well-modeled as having goals or drives.

tailcalled on Examples of Highly Counterfactual Discoveries?

Newton's Universal Law of Gravitation was the first highly accurate model of things falling down that generalized beyond the earth, and it is also the second-most computationally applicable model of things falling down that we have today.

Are you saying that singular learning theory was the first highly accurate model of breadth of optima, and that it's one of the most computationally applicable ones we have?

johannes-c-mayer on Johannes C. Mayer's Shortform

The point is that you are just given some graph. This graph is expected to have subgraphs which are lattice graphs. But you don't know where they are. And the graph is so big that you can't iterate the entire graph to find these lattices. Therefore you need a way to embed the graph without traversing it fully.

johannes-c-mayer on Johannes C. Mayer's Shortform

—The realization that I have a systematic distortion in my mental evaluation of plans, making actions seem less promising than they are. When I’m deciding whether to do stuff, I can apply a conscious correction to this, to arrive at a properly calibrated judgment.

—The realization that, in general, my thinking can have systematic distortions, and that I shouldn’t believe everything I think. This is basic less-wrong style rationalism, but it took years to work through all the actual consequences on me.

This is useful. Now that I think about it, I do this. Specifically, I have extremely unrealistic assumptions about how much I can do, such that these are impossible to accomplish. And then I feel bad for not accomplishing the thing.

I haven't tried to be mindful of that. The problem is that this is I think mainly subconscious. I don't think things like "I am dumb" or "I am a failure" basically at all. At least not in explicit language. I might have accidentally suppressed these and thought I had now succeeded in not being harsh to myself. But maybe I only moved it to the subconscious level where it is harder to debug.

johannes-c-mayer on Planning in a Lattice Graph

I might not understand exactly what you are saying. Are you saying that the problem is easy when you have a function that gives you the coordinates of an arbitrary node? Isn't that exactly the embedding function? So are you not therefore assuming that you have an embedding function?

I agree that once you have such a function the problem is easy, but I am confused about how you are getting that function in the first place. If you are not given it, then I don't think it is super easy to get.

In the OP I was assuming that I have that function, but I was saying that this is not a valid assumption in general. You can imagine you are just given a set of vertices and edges. Now you want to compute the embedding such that you can do the vector planning described in the article.

I agree that you probably can do better than though. I don't understand how your proposal helps though.

no77e-noi on The first future and the best future

From a purely utilitarian standpoint, I'm inclined to think that the cost of delaying is dwarfed by the number of future lives saved by getting a better outcome, assuming that delaying does increase the chance of a better future.

That said, after we know there's "no chance" of extinction risk, I don't think delaying would likely yield better future outcomes. On the contrary, I suspect getting the coordination necessary to delay means it's likely that we're giving up freedoms in a way that may reduce the value of the median future and increase the chance of stuff like totalitarian lock-in, which decreases the value of the average future overall.

I think you're correct that there's also to balance the "other existential risks exist" consideration in the calculation, although I don't expect it to be clear-cut.