LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Maximizing Communication, not Traffic
jefftk (jkaufman) · 2025-01-05T13:00:02.280Z · comments (10)

Current safety training techniques do not fully transfer to the agent setting
Simon Lermen (dalasnoin) · 2024-11-03T19:24:51.537Z · comments (9)

Ironing Out the Squiggles
Zack_M_Davis · 2024-04-29T16:13:00.371Z · comments (36)

EIS XIII: Reflections on Anthropic’s SAE Research Circa May 2024
scasper · 2024-05-21T20:15:36.502Z · comments (16)

I make several million dollars per year and have hundreds of thousands of followers—what is the straightest line path to utilizing these resources to reduce existential-level AI threats?
shrimpy · 2025-03-16T16:52:42.177Z · comments (25)

[question] things that confuse me about the current AI market.
DMMF · 2024-08-28T13:46:56.908Z · answers+comments (27)

Formal verification, heuristic explanations and surprise accounting
Jacob_Hilton · 2024-06-25T15:40:03.535Z · comments (11)

[question] Have LLMs Generated Novel Insights?
abramdemski · 2025-02-23T18:22:12.763Z · answers+comments (36)

Subskills of "Listening to Wisdom"
Raemon · 2024-12-09T03:01:18.706Z · comments (29)

o3
Zach Stein-Perlman · 2024-12-20T18:30:29.448Z · comments (164)

The Incredible Fentanyl-Detecting Machine
sarahconstantin · 2024-06-28T22:10:01.223Z · comments (26)

Dyslucksia
Shoshannah Tekofsky (DarkSym) · 2024-05-09T19:21:33.874Z · comments (45)

It's been ten years. I propose HPMOR Anniversary Parties.
Screwtape · 2025-02-16T01:43:14.586Z · comments (3)

OpenAI: Exodus
Zvi · 2024-05-20T13:10:03.543Z · comments (26)

Reducing LLM deception at scale with self-other overlap fine-tuning
Marc Carauleanu (Marc-Everin Carauleanu) · 2025-03-13T19:09:43.620Z · comments (40)

"It's a 10% chance which I did 10 times, so it should be 100%"
egor.timatkov · 2024-11-18T01:14:27.738Z · comments (59)

Statistical Challenges with Making Super IQ babies
Jan Christian Refsgaard (jan-christian-refsgaard) · 2025-03-02T20:26:22.103Z · comments (26)

A Rocket–Interpretability Analogy
plex (ete) · 2024-10-21T13:55:18.184Z · comments (31)

Liability regimes for AI
Ege Erdil (ege-erdil) · 2024-08-19T01:25:01.006Z · comments (34)

[link] Arithmetic is an underrated world-modeling technology
dynomight · 2024-10-17T14:00:22.475Z · comments (33)

My takes on SB-1047
leogao · 2024-09-09T18:38:37.799Z · comments (8)

Priors and Prejudice
MathiasKB (MathiasKirkBonde) · 2024-04-22T15:00:41.782Z · comments (31)

[link] Daniel Dennett has died (1942-2024)
kave · 2024-04-19T16:17:04.742Z · comments (5)

“Alignment Faking” frame is somewhat fake
Jan_Kulveit · 2024-12-20T09:51:04.664Z · comments (13)

[link] Quotes from the Stargate press conference
Nikola Jurkovic (nikolaisalreadytaken) · 2025-01-22T00:50:14.793Z · comments (7)

OpenAI #10: Reflections
Zvi · 2025-01-07T17:00:07.348Z · comments (7)

[link] The Checklist: What Succeeding at AI Safety Will Involve
Sam Bowman (sbowman) · 2024-09-03T18:18:34.230Z · comments (49)

[link] Self-fulfilling misalignment data might be poisoning our AI models
TurnTrout · 2025-03-02T19:51:14.775Z · comments (25)

The Sorry State of AI X-Risk Advocacy, and Thoughts on Doing Better
Thane Ruthenis · 2025-02-21T20:15:11.545Z · comments (51)

OpenAI o1
Zach Stein-Perlman · 2024-09-12T17:30:31.958Z · comments (41)

[link] Conceptual Rounding Errors
Jan_Kulveit · 2025-03-26T19:00:31.549Z · comments (15)

Methods for strong human germline engineering
TsviBT · 2025-03-03T08:13:49.414Z · comments (28)

Levels of Friction
Zvi · 2025-02-10T13:10:07.224Z · comments (7)

Capital Ownership Will Not Prevent Human Disempowerment
beren · 2025-01-05T06:00:23.095Z · comments (18)

[link] Stanislav Petrov Quarterly Performance Review
Ricki Heicklen (bayesshammai) · 2024-09-26T21:20:11.646Z · comments (3)

Repeal the Jones Act of 1920
Zvi · 2024-11-27T15:00:06.801Z · comments (24)

[link] Decomposing Agency — capabilities without desires
owencb · 2024-07-11T09:38:48.509Z · comments (32)

Don’t ignore bad vibes you get from people
Kaj_Sotala · 2025-01-18T09:20:17.397Z · comments (50)

AI companies are unlikely to make high-assurance safety cases if timelines are short
ryan_greenblatt · 2025-01-23T18:41:40.546Z · comments (5)

LLMs for Alignment Research: a safety priority?
abramdemski · 2024-04-04T20:03:22.484Z · comments (24)

[link] Power Lies Trembling: a three-book review
Richard_Ngo (ricraz) · 2025-02-22T22:57:59.720Z · comments (7)

Activation space interpretability may be doomed
bilalchughtai (beelal) · 2025-01-08T12:49:38.421Z · comments (32)

The Information: OpenAI shows 'Strawberry' to feds, races to launch it
Martín Soto (martinsq) · 2024-08-27T23:10:18.155Z · comments (15)

0. CAST: Corrigibility as Singular Target
Max Harms (max-harms) · 2024-06-07T22:29:12.934Z · comments (12)

[link] Nursing doubts
dynomight · 2024-08-30T02:25:36.826Z · comments (23)

The "Think It Faster" Exercise
Raemon · 2024-12-11T19:14:10.427Z · comments (35)

Value Claims (In Particular) Are Usually Bullshit
johnswentworth · 2024-05-30T06:26:21.151Z · comments (18)

[link] Fields that I reference when thinking about AI takeover prevention
Buck · 2024-08-13T23:08:54.950Z · comments (16)

[link] China Hawks are Manufacturing an AI Arms Race
garrison · 2024-11-20T18:17:51.958Z · comments (44)

When is a mind me?
Rob Bensinger (RobbBB) · 2024-04-17T05:56:38.482Z · comments (130)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

jbash on NormanPerlmutter's Shortform

Things are getting scary with the Trump regime.

Things got scary November 5 at the very latest. And I haven't even been in the US for years.

The deportations, both the indiscriminate ones and the vindictive ones, represent a very high level of lawlessness, one that hasn't been seen in a long time. Not only are they ignoring due process, they're actively thwarting it, and openly bragging about doing so. They're not even trying to pretend to be remotely decent. The case you mention isn't even close to the worst of them; that one could at least theoretically have happened before.

The deportations were also a campaign promise. Actually the campaign promise was even more extreme.

It's part of a systematic plan. There've been a lot of administrative and personnel changes obviously designed to weaken institutions that are supposed to prevent things like that.

ICE has always had a reputation for a relatively thuggish, xenophobic organizational culture. It was already primed to get worse. As soon as Trump signalled aproval, it did get worse.

Bad conditions in detention centers are nothing new. There's never been the any willingness to spend what it would take to do them right, or to put in the kind of controls you'd need. It's politically risky to act like you care about "illegal immigrants", whereas it can be politically rewarding to "get tough". The 2020 "kids in cages" scandal was a rare case of something that got some traction. But, sure, I imagine that the newly emboldened ICE is even more indifferent to bad conditions, and may even be actively trying to make them worse. And of course if a center is already bad, putting more people into it and moving people through it fast is only going to make it worse.

jkaufman on Quarter Inch Cables are Devious

They're weird: input and output in the same jack. They're for connecting to external effects, often through a cable that splits TRS to dual TS.

jkaufman on Quarter Inch Cables are Devious

do you already know that a piezo signal is much improved by a preamp with >1 meg ohm input impedance?

Very much so, yes! And input impedance this high pretty much requires an active circuit.

zy on How much progress actually happens in theoretical physics?

I have a second-handed source hearing this view from a theoretical physics 4th phd student at Stanford - he believes less breakthroughs nowadays as the field becomes more and more established, and this was exactly why he was a bit discouraged/sad. Not sure if things has changed, and that may or may not be his personal view.

maxwell-peterson on NormanPerlmutter's Shortform

The Krome thing is all rumor - looking into it, you see numeric estimates like

>According to its official figures, there are 605 people detained at Krome, although the capacity is 581. While ICE is looking for ways to increase its current detention capacity of 40,000 nationwide to 100,000, lawyers and activists estimate the real number is much higher. Some speak of double the capacity, others of up to 4,000.

“Activists and [activist] lawyers say number is huge” is not news, and shouldn’t dumbfound the reader.

The water claim is also weird. I tried watching one of the instagram links, and it shared so much stylistically with mind-killing videos I remember from the BLM era that I had to turn it off.

Like, maybe some of this stuff is true. I don’t have evidence against. But when I was deeply involved with the protest scene in 2014-2015, I remember every arrest being an opportunity for claiming major mistreatment. Everything from the way police carried resisting arrestees, to when and if arrestees were made to change into jail uniforms, were spread frantically on social media as clear examples of mistreatment.

Once, when I was arrested, and we were being transported to the larger jail via van, the other arrestee (to be clear: not related to protests) being transported with me banged his head on the metal separating grate repeatedly, presumably with the idea of later accusing the police of beating him.

I’d always scoffed at police claims about detainees hurting themselves to get social ammunition, but I’ve ridden in a police van once in my life, and saw this. So now I think detainees often tell very tall tales.

All this isn’t to say “this proves your links are false”. But rather to say this is a low standard of evidence. I think it would be really bad if people started just dumping rumors and accusations on LessWrong whenever those accusations pointed at politicians they already didn’t like.

Social media posts by activists are mind-killing. Like, take a look at previous posts by that instagram account in the post: many are about celebrities, or her breakup, but when the videos are political, they are pretty clearly pro-migrant and anti-trump. “Partisan social media account” is typically not the best information source for rationalists.

lao-mein on Lao Mein's Shortform

Potential token analysis tool idea:

Use the tokenizers of common LLMs to tokenize a corpus of web text (OpenWebText, for example), and identify the contexts in which they frequently appear, their correlation with other tokens, whether they are glitch tokens, ect. It could act as a concise resource for explaining weird tokenizer-related behavior to those less familiar with LLMs (e.g. why they tend to be bad at arithmetic) and how a token entered a tokenizer's vocabulary.

Would this be useful and/or duplicate work? I already did this with GPT2 when I used it to analyze glitch tokens, so I could probably code the backend in a few days.

nickh on AI 2027: What Superintelligence Looks Like

Downvoted. See Burdensome Details [LW · GW]. I particularly dislike predicting "Algorithmic Breakthroughs"

zy on Is instrumental convergence a thing for virtue-driven agents?

I very much agree with the approach and the values in virtue; in case for humans, we enforce virtues either through empathy or law/punishments (in modern societies); wondering how that can be most effectively translated to machines in a consistent way

jbash on Cheesecake Frosting

... but that means she learned what it was at age 5. I'd assume most people learn between about 4 and 8, maybe 10...

jbash on Cheesecake Frosting

I am aware of it and I regret to say that I've tasted it...