LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

A bet for Samo Burja
Nathan Helm-Burger (nathan-helm-burger) · 2024-09-05T16:01:35.440Z · comments (2)

[link] Diffusion Guided NLP: better steering, mostly a good thing
Nathan Helm-Burger (nathan-helm-burger) · 2024-08-10T19:49:50.963Z · comments (0)

What can we learn from insecure domains?
Logan Zoellner (logan-zoellner) · 2024-11-01T23:53:30.066Z · comments (1)

[question] Looking for intuitions to extend bargaining notions
ProgramCrafter (programcrafter) · 2024-08-24T05:00:13.995Z · answers+comments (0)

[link] Metaculus Is Open Source
ChristianWilliams · 2024-10-07T19:55:31.035Z · comments (0)

Chevy Bolt Review
jefftk (jkaufman) · 2024-09-26T13:40:05.456Z · comments (2)

A Case for Conscious Significance rather than Free Will.
James Stephen Brown (james-brown) · 2024-10-25T23:20:30.834Z · comments (2)

Editing at the Take Level
jefftk (jkaufman) · 2024-09-24T11:30:04.914Z · comments (1)

Shutting down all competing AI projects might not buy a lot of time due to Internal Time Pressure
ThomasCederborg · 2024-10-03T00:01:34.011Z · comments (7)

SAE Probing: What is it good for? Absolutely something!
Subhash Kantamneni (subhashk) · 2024-11-01T19:23:55.418Z · comments (0)

Apartment Price Map Discontinuity
jefftk (jkaufman) · 2024-08-19T15:30:05.386Z · comments (0)

[question] If AI is in a bubble and the bubble bursts, what would you do?
Remmelt (remmelt-ellen) · 2024-08-19T10:56:03.948Z · answers+comments (12)

LLM Psychometrics and Prompt-Induced Psychopathy
Korbinian K. (korbinian-koch) · 2024-10-18T18:11:24.256Z · comments (2)

Critique of 'Many People Fear A.I. They Shouldn't' by David Brooks.
Axel Ahlqvist (axelahlqvist1995@gmail.com) · 2024-08-15T18:38:13.437Z · comments (8)

Amoeba roles in tech
Sindhu Shivaprasad (sindhu-shivaprasad) · 2024-10-04T17:25:46.568Z · comments (0)

Live Machinery: Interface Design Workshop for AI Safety @ EA Hotel
Sahil · 2024-11-01T17:24:09.957Z · comments (0)

Source Control for Prototyping and Analysis
jefftk (jkaufman) · 2024-09-26T01:50:04.145Z · comments (0)

[link] How to Fake Decryption
ohmurphy · 2024-09-05T09:18:41.586Z · comments (0)

Do you want to do a debate on youtube? I'm looking for polite, truth-seeking participants.
Nathan Young · 2024-10-10T09:32:59.162Z · comments (0)

Lenses of Control
WillPetillo · 2024-10-22T07:51:06.355Z · comments (0)

Contextual Constitutional AI
aksh-n · 2024-09-28T23:24:43.529Z · comments (1)

Binary encoding as a simple explicit construction for superposition
tailcalled · 2024-10-12T21:18:31.731Z · comments (0)

[link] AI & wisdom 3: AI effects on amortised optimisation
L Rudolf L (LRudL) · 2024-10-28T21:08:56.604Z · comments (0)

[link] AI & wisdom 2: growth and amortised optimisation
L Rudolf L (LRudL) · 2024-10-28T21:07:39.449Z · comments (0)

[link] AI Prejudices: Practical Implications
PeterMcCluskey · 2024-10-19T02:19:58.695Z · comments (0)

How I'd like alignment to get done (as of 2024-10-18)
TristanTrim · 2024-10-18T23:39:03.107Z · comments (2)

[Cross-post] Book Review: Bureaucracy, by James Q Wilson
davekasten · 2024-08-19T13:57:10.872Z · comments (0)

SYSTEMA ROBOTICA
Ali Ahmed (roboticali) · 2024-08-12T20:34:45.879Z · comments (2)

Clarifying Alignment Fundamentals Through the Lens of Ontology
eternal/ephemera · 2024-10-07T20:57:33.238Z · comments (4)

[link] GPT-4o Guardrails Gone: Data Poisoning & Jailbreak-Tuning
ChengCheng (ccstan99) · 2024-11-01T00:10:50.718Z · comments (0)

Beyond Defensive Technology
ejk64 · 2024-10-14T11:34:24.595Z · comments (1)

[link] Intention-to-Treat (Re: How harmful is music, really?)
kqr · 2024-09-18T18:44:41.128Z · comments (0)

Conversational Signposts—An Antidote to Dull Social Interactions
Declan Molony (declan-molony) · 2024-10-22T05:37:56.175Z · comments (6)

What You Can Give Instead of Advice
Karl Faulks (karl-faulks) · 2024-10-24T23:10:48.014Z · comments (2)

Restructuring Pop Songs for Contra
jefftk (jkaufman) · 2024-08-18T14:10:04.029Z · comments (0)

[unlisted] Beneficial applications for current-level AI in human information systems? More likely than you'd think!
mako yass (MakoYass) · 2024-08-16T20:49:57.582Z · comments (0)

Motte-and-Bailey: a Short Explanation
Lorec · 2024-10-23T22:29:55.074Z · comments (0)

[link] AI Safety Newsletter #40: California AI Legislation Plus, NVIDIA Delays Chip Production, and Do AI Safety Benchmarks Actually Measure Safety?
Corin Katzke (corin-katzke) · 2024-08-21T18:09:33.284Z · comments (0)

Substituting Talkbox for Breath Controller
jefftk (jkaufman) · 2024-10-27T19:10:03.768Z · comments (0)

Tall tales and long odds
Solenoid_Entity · 2024-08-10T15:22:16.958Z · comments (0)

Switching to a Yamaha P-121 Keyboard
jefftk (jkaufman) · 2024-10-02T02:20:02.284Z · comments (0)

Palisade is hiring: Exec Assistant, Content Lead, Ops Lead, and Policy Lead
Charlie Rogers-Smith (charlie.rs) · 2024-10-09T00:04:03.837Z · comments (0)

We Don't Just Let People Die—So What Next?
James Stephen Brown (james-brown) · 2024-08-03T01:04:49.756Z · comments (8)

Four Phases of AGI
Gabe M (gabe-mukobi) · 2024-08-05T13:15:23.406Z · comments (3)

[link] Mechanistic Anomaly Detection Research Update
Nora Belrose (nora-belrose) · 2024-08-06T10:33:26.031Z · comments (0)

Updating the NAO Simulator
jefftk (jkaufman) · 2024-10-30T13:50:06.908Z · comments (0)

What is Randomness?
martinkunev · 2024-09-27T17:49:42.704Z · comments (2)

Can startups be impactful in AI safety?
Esben Kran (esben-kran) · 2024-09-13T19:00:33.306Z · comments (0)

[link] AISafety.info: What are Inductive Biases?
Algon · 2024-09-19T17:26:24.581Z · comments (4)

Self location for LLMs by LLMs: Self-Assessment Checklist.
weightt an (weightt-an) · 2024-09-26T19:57:31.707Z · comments (0)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

johnswentworth on Ryan Kidd's Shortform

Y'know, you probably have the data to do a quick-and-dirty check here. Take a look at the GRE/SAT scores on the applications (both for applicant pool and for accepted scholars). If most scholars have much-less-than-perfect scores, then you're probably not hiring the top tier (standardized tests have a notoriously low ceiling). And assuming most scholars aren't hitting the test ceiling, you can also test the hypothesis about different domains by looking at the test score distributions for scholars in the different areas.

saidachmiz on JargonBot Beta Test

I in fact don’t use Google very much these days, and don’t particularly recommend that anyone else do so, either.

(If by “google” you meant “search engines in general”, then that’s a bit different, of course. But then, the analogy here would be to something like “carefully select which LLM products you use, try to minimize their use, avoid the popular ones, and otherwise take all possible steps to ensure that LLMs affect what you see and do as little as possible”.)

saidachmiz on JargonBot Beta Test

The most important thing is “There is a small number of individuals who are paying attention, who you can argue with, and if you don’t like what they’re doing, I encourage you to write blogposts or comments complaining about it. And if your arguments make sense to me/us, we might change our mind. If they don’t make sense, but there seems to be some consensus that the arguments are true, we might lose the Mandate of Heaven or something.”

There’s not, like, anything necessarily wrong with this, on its own terms, but… this is definitely not what “being held accountable” is.

It happening at all already constitutes “going wrong”.

This particular sort of comment doesn’t particularly move me.

All this really means is that you’ll just do with this whatever you feel like doing. Which, again, is not necessarily “wrong”, and really it’s the default scenario for, like… websites, in general… I just really would like to emphasize that “being held accountable” has approximately nothing to do with anything that you’re describing.

As far as the specifics go… well, the bad effect here is that instead of the site being a way for me to read the ideas and commentary of people whose thoughts and writings I find interesting, it becomes just another purveyor of AI “extruded writing product”. I really don’t know why I’d want more of that than there already is, all over the internet. I mean… it’s a bad thing. Pretty straightforwardly. If you don’t think so then I don’t know what to tell you.

All I can say is that this sort of thing drastically reduces my interest in participating here. But then, my participation level has already been fairly low for a while, so… maybe that doesn’t matter very much, either. On the other hand, I don’t think that I’m the only one who has this opinion of LLM outputs.

mindprison on The Cartesian Crisis

All sources are cited within here - https://www.mindprison.cc/p/the-cartesian-crisis

norimori1992 on D&D Sci Coliseum: Arena of Data

Dang, I missed seeing this before the solution was posted. And oh dear, it's high-complexity. Oh well, I'll give it a shot anyway!

Edit: Hah, I spent an hour checking one thing (which went nowhere), and then ran out of steam, and now I can no longer resist checking the answer. So much for that 😅 Next time I'll try to check my notifications more often so I see the next one before the answer is up, maybe that'll give me more drive to keep at it.

d0themath on dirk's Shortform

Yes, the field is definitely more funding constrained than talent constrained right now

danielfilan on 2024 Unofficial LW Community Census, Request for Comments

Oh I misread it as "eighty percent of the effort" oops.

abandon on dirk's Shortform

people sometimes talk about whether you should go into policy, ai research, ea direct work, etc. but afaict all of those fields work like normal careers where actually you have to spend several years resume-building before painstakingly convincing people you're worth hiring for paid work. so imo these are not actually high-leverage paths to impact and the fields are not in fact short on people.

vaniver on JargonBot Beta Test

Consider the reaction my comment from three months [LW(p) · GW(p)] ago got.

elizabeth-1 on Why I quit effective altruism, and why Timothy Telleen-Lawton is staying (for now)

There’s a lot here and if my existing writing didn’t answer your questions, I’m not optimistic another comment will help^[1]. Instead, how about we find something to bet on? It’s difficult to identify something both cruxy and measurable, but here are two ideas:

I see a pattern of:
1. CEA takes some action with the best of intentions
2. It takes a few years for the toll to come out, but eventually there’s a negative consensus on it.
3. A representative of CEA agrees the negative consensus is deserved, but since it occurred under old leadership, doesn’t think anyone should draw conclusions about new leadership from it.
4. CEA announces new program with the best of intentions.

So I would bet that within 3 years, a CEA representative will repudiate a major project occurring under Zach’s watch.

I would also bet on more posts similar to Bad Omens in Current Community Building or University Groups Need Fixing coming out in a few years, talking about 2024 recruiting.

^{^}
Although you might like Change my mind: Veganism entails trade-offs, and health is one of the axes [LW · GW] (the predecessor to EA Vegan Advocacy is not Truthseeking) and Truthseeking when your disagreements lie in moral philosophy [LW · GW] and Love, Reverence, and Life [LW · GW] (dialogues with a vegan commenter on the same post)