LessWrong 2.0 Reader

View: New · Old · Top

← previous page (newer posts) · next page (older posts) →

The Sinews of Sudan’s Latest War
Tim Liptrot (rockthecasbah) · 2023-08-04T18:17:27.860Z · comments (12)

[link] Read More Books but Pretend to Read Even More
Arjun Panickssery (arjun-panickssery) · 2023-08-05T00:07:48.671Z · comments (12)

[link] Announcing Squiggle Hub
ozziegooen · 2023-08-05T01:00:17.739Z · comments (4)

[question] What are the best published papers from outside the alignment community that are relevant to Agent Foundations?
Stephen Fowler (LosPolloFowler) · 2023-08-05T03:02:33.003Z · answers+comments (4)

Meet Hyperion on Sunday Aug 6?
duck_master · 2023-08-05T04:36:02.462Z · comments (0)

ACX Paris Meetup - August 11 2023
PoignardAzur · 2023-08-05T09:44:05.717Z · comments (0)

A Naive Proposal for Constructing Interpretable AI
Chris_Leong · 2023-08-05T10:32:05.446Z · comments (6)

[Linkpost] Applicability of scaling laws to vision encoding models
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2023-08-05T11:10:35.599Z · comments (2)

video games > IQ tests
bhauth · 2023-08-05T13:27:54.697Z · comments (35)

[link] Stomach Ulcers and Dental Cavities
Metacelsus · 2023-08-05T14:08:15.263Z · comments (7)

[link] Join AISafety.info's Writing & Editing Hackathon (Aug 25-28) (Prizes to be won!)
smallsilo (monstrologies) · 2023-08-05T14:08:19.639Z · comments (3)

AISafety.info's Writing & Editing Hackathon
smallsilo (monstrologies) · 2023-08-05T17:14:45.292Z · comments (0)

Seattle Astral Codex Ten Monthly Social
a7x · 2023-08-05T17:55:25.884Z · comments (0)

[link] Ground-Truth Label Imbalance Impairs the Performance of Contrast-Consistent Search (and Other Contrast-Pair-Based Unsupervised Methods)
Tom Angsten (tom-angsten) · 2023-08-05T17:55:46.569Z · comments (2)

Summary of Improving Global Decision Making (around AI)
Will_Pearson · 2023-08-05T18:46:44.268Z · comments (0)

how 2 tell if ur input is out of distribution given only model weights
dkirmani · 2023-08-05T22:45:20.250Z · comments (10)

Aligning my web server with devops practices: part 2 (security)
VipulNaik · 2023-08-06T01:30:35.005Z · comments (0)

Exploring the Multiverse of Large Language Models
franky · 2023-08-06T02:38:02.784Z · comments (0)

The Benevolent Ruler’s Handbook (Part 1): The Policy Problem
FCCC · 2023-08-06T03:46:31.594Z · comments (3)

Safety-First Agents/Architectures Are a Promising Path to Safe AGI
Brendon_Wong · 2023-08-06T08:02:30.072Z · comments (2)

[question] On being in a bad place and too stubborn to leave.
TeaTieAndHat (Augustin Portier) · 2023-08-06T11:45:49.771Z · answers+comments (14)

[link] Model-Based Policy Analysis under Deep Uncertainty
Max Reddel (max-reddel) · 2023-08-06T14:07:36.079Z · comments (1)

[link] Rebooting AI Governance: An AI-Driven Approach to AI Governance
Max Reddel (max-reddel) · 2023-08-06T14:19:50.180Z · comments (1)

Reducing the risk of catastrophically misaligned AI by avoiding the Singleton scenario: the Manyton Variant
GravitasGradient (Bll) · 2023-08-06T14:24:04.774Z · comments (0)

[Linkpost] Will AI avoid exploitation?
cdkg · 2023-08-06T14:28:29.166Z · comments (1)

[link] ‘We’re changing the clouds.’ An unforeseen test of geoengineering is fueling record ocean warmth
Annapurna (jorge-velez) · 2023-08-06T20:58:51.838Z · comments (6)

Computational Thread Art
CallumMcDougall (TheMcDouglas) · 2023-08-06T21:42:30.306Z · comments (2)

[link] Yann LeCun on AGI and AI Safety
Chris_Leong · 2023-08-06T21:56:52.644Z · comments (13)

Problems with Robin Hanson's Quillette Article On AI
DaemonicSigil · 2023-08-06T22:13:43.654Z · comments (33)

Drinks at a bar
yakimoff · 2023-08-07T02:52:19.388Z · comments (0)

The second act: Beginning epistemic rigor at 30
hiAndrewQuinn (hiandrewquinn) · 2023-08-07T09:34:20.923Z · comments (0)

[link] Overview of how AI might exacerbate long-running catastrophic risks
Hauke Hillebrandt (hauke-hillebrandt) · 2023-08-07T11:53:29.171Z · comments (0)

Strengthening the Argument for Intrinsic AI Safety: The S-Curves Perspective
avturchin · 2023-08-07T13:13:42.635Z · comments (0)

Monthly Roundup #9: August 2023
Zvi · 2023-08-07T13:20:03.522Z · comments (25)

[link] What I've been reading, July–August 2023
jasoncrawford · 2023-08-07T14:22:57.046Z · comments (0)

[link] Announcing the Clearer Thinking micro-grants program for 2023
spencerg · 2023-08-07T15:21:28.191Z · comments (1)

Optimisation Measures: Desiderata, Impossibility, Proposals
mattmacdermott · 2023-08-07T15:52:17.624Z · comments (9)

[question] Should I test myself for microplastics?
Augs · 2023-08-07T17:31:41.656Z · answers+comments (2)

[link] Growing Bonsai Networks with RNNs
ameo (ameobea) · 2023-08-07T17:34:15.713Z · comments (5)

Feedbackloop-first Rationality
Raemon · 2023-08-07T17:58:56.349Z · comments (65)

[link] An interactive introduction to grokking and mechanistic interpretability
Adam Pearce (adam-pearce) · 2023-08-07T19:09:19.422Z · comments (3)

[question] Tips for reducing thinking branching factor
Simon Berens (sberens) · 2023-08-07T20:21:43.298Z · answers+comments (6)

A plea for more funding shortfall transparency
porby · 2023-08-07T21:33:11.912Z · comments (4)

[question] How do I find all the items on LW that I've *favorited* or upvoted?
Alex K. Chen (parrot) (alex-k-chen) · 2023-08-07T23:51:05.711Z · answers+comments (3)

Perpetually Declining Population?
jefftk (jkaufman) · 2023-08-08T01:30:00.897Z · comments (29)

Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research
evhub · 2023-08-08T01:30:10.847Z · comments (26)

Notice your everything
metachirality · 2023-08-08T02:38:39.974Z · comments (1)

4 types of AGI selection, and how to constrain them
Remmelt (remmelt-ellen) · 2023-08-08T10:02:53.921Z · comments (3)

My Trial Period as an Independent Alignment Researcher
Bart Bussmann (Stuckwork) · 2023-08-08T14:16:35.122Z · comments (1)

[question] Beginner's question about RLHF
[deleted] · 2023-08-08T15:48:24.118Z · answers+comments (3)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

mesaoptimizer on Fund me please - I Work so Hard that my Feet start Bleeding and I Need to Infiltrate University

I think what quila is pointing at is their belief in the supposed fragility of thoughts at the edge of research questions. From that perspective I think their rebuttal is understandable, and your response completely misses the point: you can be someone who spends only four hours a day working and the rest of the time relaxing, but also care a lot about not losing the subtle and supposedly fragile threads of your thought when working.

Note: I have a different model of research thought, one that involves a systematic process towards insight, and because of that I also disagree with Johannes' decisions.

rhollerith_dot_com on Stephen Fowler's Shortform

COI == conflict of interest.

localdeity on On Privilege

Absolutely. For a quick model of why you get multiplicative results:

Intelligence—raw intellectual horsepower—might be considered a force-multiplier, whereby you produce more intellectual work per hour spent working.
Motivation (combined with say, health) determines how much time you spend working. We could quantify it as hours per week.
Taste determines the quality of the project you choose to work on. We might quantify it as "the expected value, per unit of intellectual work, of the project".

Then you literally multiply those three quantities together and it's the expected value per week of your intellectual work. My mentor says that these are the three most important traits that determine the best scientists.

ete on "If we go extinct due to misaligned AI, at least nature will continue, right? ... right?"

By my models of anthropics, I think this goes through.

templarrr on Monthly Roundup #18: May 2024

Europeans... vastly less rich than they could be.

POSIWID. Metric being optimized is not "having the most money". It is debatable if it should be, as one of the "poor Europeans" my personal opinion is that we're doing just fine.

ete on "If we go extinct due to misaligned AI, at least nature will continue, right? ... right?"

This is correct. I'm not arguing about p(total human extinction|superintelligence), but p(nature survives|total human extinction from superintelligence), as this conditional probability I see people getting very wrong sometimes.

It's not implausible to me that we survive due to decision theoretic reasons, this seems possible though not my default expectation (I mostly expect Decision theory does not imply we get nice things [LW · GW], unless we manually win a decent chunk more timelines than I expect).

My confidence is in the claim "if AI wipes out humans, it will wipe out nature". I don't engage with counterarguments to a separate claim, as that is beyond the scope of this post and I don't have much to add over existing literature like the other posts you linked.

Edit: Partly retracted, I see how the second to last paragraph made a more overreaching claim, edited to clarify my position.

jiao-bu on Teaching CS During Take-Off

"[I]s a traditional education sequence the best way to prepare myself for [...?]"

This is hard to answer because in some ways the foundation of a broad education in all subjects is absolutely necessary. And some of them (math, for example), are a lot harder to patch in later if you are bad at them at say, 28.

However, the other side of this is once some foundation is laid and someone has some breadth and depth, the answer to the above question, with regards to nearly anything, is often (perhaps usually) "Absolutely Not."

So, for a 17 year old, Yes. For a 25 year old, you should be skipping as many pre-reqs and hoops as possible to do precisely what you want. You should not spend too much time on the traditional pedagogical steps as once you know enough, a lot can be learned along the way and bootstrapped to what you need while working on harder or more cutting-edge projects or coursework. To do this type of learning, you have to be "all in" and it feels exceedingly hard, but you get to high level. Also, you should not spend too much time on books and curricula that are not very good.

Somewhere in the middle of these two points though, are things that are just being done badly (math, for example, in the USA).

mo-putera on Fund me please - I Work so Hard that my Feet start Bleeding and I Need to Infiltrate University

These thoughts remind me of something Scott Alexander once wrote - that sometimes he hears someone say true but low status things - and his automatic thoughts are about how the person must be stupid to say something like that, and he has to consciously remind himself that what was said is actually true.

For anyone who's curious, this is what Scott said, in reference to him getting older – I remember it because I noticed the same in myself as I aged too:

I look back on myself now vs. ten years ago and notice I’ve become more cynical, more mellow, and more prone to believing things are complicated. For example: [list of insights] ...
All these seem like convincing insights. But most of them are in the direction of elite opinion. There’s an innocent explanation for this: intellectual elites are pretty wise, so as I grow wiser I converge to their position. But the non-innocent explanation is that I’m not getting wiser, I’m just getting better socialized. ...
I’m pretty embarassed by Parable On Obsolete Ideologies, which I wrote eight years ago. It’s not just that it’s badly written, or that it uses an ill-advised Nazi analogy. It’s that it’s an impassioned plea to jettison everything about religion immediately, because institutions don’t matter and only raw truth-seeking is important. If I imagine myself entering that debate today, I’d be more likely to take the opposite side. But when I read Parable, there’s…nothing really wrong with it. It’s a good argument for what it argues for. I don’t have much to say against it. Ask me what changed my mind, and I’ll shrug, tell you that I guess my priorities shifted. But I can’t help noticing that eight years ago, New Atheism was really popular, and now it’s really unpopular. Or that eight years ago I was in a place where having Richard Dawkins style hyperrationalism was a useful brand, and now I’m (for some reason) in a place where having James C. Scott style intellectual conservativism is a useful brand. A lot of the “wisdom” I’ve “gained” with age is the kind of wisdom that helps me channel James C. Scott instead of Richard Dawkins; how sure am I that this is the right path?
Sometimes I can almost feel this happening. First I believe something is true, and say so. Then I realize it’s considered low-status and cringeworthy. Then I make a principled decision to avoid saying it – or say it only in a very careful way – in order to protect my reputation and ability to participate in society. Then when other people say it, I start looking down on them for being bad at public relations. Then I start looking down on them just for being low-status or cringeworthy. Finally the idea of “low-status” and “bad and wrong” have merged so fully in my mind that the idea seems terrible and ridiculous to me, and I only remember it’s true if I force myself to explicitly consider the question. And even then, it’s in a condescending way, where I feel like the people who say it’s true deserve low status for not being smart enough to remember not to say it. This is endemic, and I try to quash it when I notice it, but I don’t know how many times it’s slipped my notice all the way to the point where I can no longer remember the truth of the original statement.

This was back in 2017.

sameisenstat on The consistent guessing problem is easier than the halting problem

Note that Andy Drucker is not claiming to have discovered this; the paper you link is expository.

Since Drucker doesn't say this in the link, I'll mention that the objects you're discussing are conventionally know as PA degrees. The PA here stands for Peano arithmetic; a Turing degree solves the consistent guessing problem iff it computes some model of PA. This name may be a little misleading, in that PA isn't really special here. A Turing degree computes some model of PA iff it computes some model of ZFC, or more generally any theory capable of expressing arithmetic.

Drucker also doesn't mention the name of the theorem that this result is a special case of: the low basis theorem. "Low" here suggests low computability strength. Explicitly, a Turing degree $A$ is low if solving the halting problem for machines with an oracle for $A$ is equivalent (in the sense of reductions) to solving the halting problem for Turing machines without any oracle. The low basis theorem says that every computable binary tree has a low path. We are able to apply the theorem to this problem, concluding that there is a consistent guessing oracle $C$ which is low. So, we cannot use this oracle to solve the halting problem; if we could, then an oracle machine with access to $C$ would be at least as strong as an oracle machine with access to the halting set, but we know that the halting set suffices to compute the halting problem for such a machine, which is a contradiction.

Various other things are known about PA degrees, though I'm not sure what might be of interest to you or others here. This stuff is discussed in books on computability theory, like Robert Soare's Computability Theory and Applications. Though, I thought I learned about PA degrees from his earlier book, but now I don't see them in there, so maybe I just learned about PA degrees around the same time, possibly following my interest in your and others' work on reflective oracles. The basics of computability theory--Turing degrees, the Turing jump, and the arithmetic hierarchy in the computability sense--may be of interest to the extent there is anything there that you're not already familiar with. With regard to PA degrees, in particular people like to talk about diagonally nonrecursive functions. This works as follows. Let $φ_{n}$ denote the $n$ th partial computable function according to some Goedel numbering. The PA degrees are exactly the Turing degrees that compute functions $f : N \to 2$ such that $f (n) \neq φ_{n} (n)$ for all numbers $n$ at which the right-hand side is defined. This is suggestive of the ideas around reflective oracles, the Lawvere fixed-point theorem, etc. But I wouldn't say that when I think about these things, I think of them in terms of diagonally nonrecursive functions; plausibly that's not an interesting direction to point people in.

wei-dai on Stephen Fowler's Shortform

It's also notable that the topic of OpenAI nondisparagement agreements was brought to Holden Karnofsky's attention in 2022, and he replied with "I don’t know whether OpenAI uses nondisparagement agreements; I haven’t signed one." (He could have asked his contacts inside OAI about it, or asked the EA board member to investigate. Or even set himself up earlier as someone OpenAI employees could whistleblow to on such issues.)

If the point was to buy a ticket to play the inside game, then it was played terribly and negative credit should be assigned on that basis, and for misleading people about how prosocial OpenAI was likely to be (due to having an EA board member).