LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Dialogue introduction to Singular Learning Theory
Olli Järviniemi (jarviniemi) · 2024-07-08T16:58:10.108Z · comments (14)

Explaining a Math Magic Trick
Robert_AIZI · 2024-05-05T19:41:52.048Z · comments (10)

Key takeaways from our EA and alignment research surveys
Cameron Berg (cameron-berg) · 2024-05-03T18:10:41.416Z · comments (10)

Deceptive AI ≠ Deceptively-aligned AI
Steven Byrnes (steve2152) · 2024-01-07T16:55:13.761Z · comments (19)

[link] Ilya Sutskever created a new AGI startup
harfe · 2024-06-19T17:17:17.366Z · comments (35)

[link] Explaining Impact Markets
Saul Munn (saul-munn) · 2024-01-31T09:51:27.587Z · comments (2)

I am the Golden Gate Bridge
Zvi · 2024-05-27T14:40:03.216Z · comments (6)

Catching AIs red-handed
ryan_greenblatt · 2024-01-05T17:43:10.948Z · comments (21)

Counting arguments provide no evidence for AI doom
Nora Belrose (nora-belrose) · 2024-02-27T23:03:49.296Z · comments (188)

[link] Compact Proofs of Model Performance via Mechanistic Interpretability
LawrenceC (LawChan) · 2024-06-24T19:27:21.214Z · comments (3)

[link] Almost everyone I’ve met would be well-served thinking more about what to focus on
Henrik Karlsson (henrik-karlsson) · 2024-01-05T21:01:27.861Z · comments (8)

Kids or No kids
Kids or no kids (grosseholz.f@gmail.com) · 2023-11-14T18:37:02.799Z · comments (10)

[link] Ideological Bayesians
Kevin Dorst · 2024-02-25T14:17:25.070Z · comments (4)

On Claude 3.5 Sonnet
Zvi · 2024-06-24T12:00:05.719Z · comments (14)

[question] How to get nerds fascinated about mysterious chronic illness research?
riceissa · 2024-05-27T22:58:29.707Z · answers+comments (50)

[link] Uncovering Deceptive Tendencies in Language Models: A Simulated Company AI Assistant
Olli Järviniemi (jarviniemi) · 2024-05-06T07:07:05.019Z · comments (13)

[link] MIRI's April 2024 Newsletter
Harlan · 2024-04-12T23:38:20.781Z · comments (0)

OpenAI's Sora is an agent
CBiddulph (caleb-biddulph) · 2024-02-16T07:35:52.171Z · comments (25)

Refactoring cryonics as structural brain preservation
Andy_McKenzie · 2024-09-11T18:36:30.285Z · comments (14)

[link] the Giga Press was a mistake
bhauth · 2024-08-21T04:51:24.150Z · comments (26)

Sparsify: A mechanistic interpretability research agenda
Lee Sharkey (Lee_Sharkey) · 2024-04-03T12:34:12.043Z · comments (22)

[link] I found >800 orthogonal "write code" steering vectors
Jacob G-W (g-w1) · 2024-07-15T19:06:17.636Z · comments (19)

[link] Things You’re Allowed to Do: University Edition
Saul Munn (saul-munn) · 2024-02-06T00:36:11.690Z · comments (13)

Access to powerful AI might make computer security radically easier
Buck · 2024-06-08T06:00:19.310Z · comments (14)

[link] RAND report finds no effect of current LLMs on viability of bioterrorism attacks
StellaAthena · 2024-01-25T19:17:30.493Z · comments (14)

[link] Against Aschenbrenner: How 'Situational Awareness' constructs a narrative that undermines safety and threatens humanity
GideonF · 2024-07-15T18:37:40.232Z · comments (17)

It's time for a self-reproducing machine
Carl Feynman (carl-feynman) · 2024-08-07T21:52:22.819Z · comments (68)

Towards a Less Bullshit Model of Semantics
johnswentworth · 2024-06-17T15:51:06.060Z · comments (44)

A Solomonoff Inductor Walks Into a Bar: Schelling Points for Communication
johnswentworth · 2024-07-26T00:33:42.000Z · comments (1)

Apollo Research 1-year update
Marius Hobbhahn (marius-hobbhahn) · 2024-05-29T17:44:32.484Z · comments (0)

[link] Sabotage Evaluations for Frontier Models
David Duvenaud (david-duvenaud) · 2024-10-18T22:33:14.320Z · comments (11)

You can, in fact, bamboozle an unaligned AI into sparing your life
David Matolcsi (matolcsid) · 2024-09-29T16:59:43.942Z · comments (171)

Notes on Dwarkesh Patel’s Podcast with Demis Hassabis
Zvi · 2024-03-01T16:30:08.687Z · comments (0)

[question] Am I confused about the "malign universal prior" argument?
nostalgebraist · 2024-08-27T23:17:22.779Z · answers+comments (33)

Takeoff speeds presentation at Anthropic
Tom Davidson (tom-davidson-1) · 2024-06-04T22:46:35.448Z · comments (0)

SB 1047: Final Takes and Also AB 3211
Zvi · 2024-08-27T22:10:07.647Z · comments (11)

On attunement
Joe Carlsmith (joekc) · 2024-03-25T12:47:34.856Z · comments (8)

OpenAI: The Board Expands
Zvi · 2024-03-12T14:00:04.110Z · comments (1)

[link] The Soul Key
Richard_Ngo (ricraz) · 2023-11-04T17:51:53.176Z · comments (9)

The case for unlearning that removes information from LLM weights
Fabien Roger (Fabien) · 2024-10-14T14:08:04.775Z · comments (14)

Defining alignment research
Richard_Ngo (ricraz) · 2024-08-19T20:42:29.279Z · comments (23)

Everything Wrong with Roko's Claims about an Engineered Pandemic
WitheringWeights (EZ97) · 2024-02-22T15:59:08.439Z · comments (10)

New page: Integrity
Zach Stein-Perlman · 2024-07-10T15:00:41.050Z · comments (3)

Meaning & Agency
abramdemski · 2023-12-19T22:27:32.123Z · comments (17)

Quotes from Leopold Aschenbrenner’s Situational Awareness Paper
Zvi · 2024-06-07T11:40:03.981Z · comments (10)

Announcing Neuronpedia: Platform for accelerating research into Sparse Autoencoders
Johnny Lin (hijohnnylin) · 2024-03-25T21:17:58.421Z · comments (7)

Just admit that you’ve zoned out
joec · 2024-06-04T02:51:27.594Z · comments (22)

How to train your own "Sleeper Agents"
evhub · 2024-02-07T00:31:42.653Z · comments (11)

Circular Reasoning
abramdemski · 2024-08-05T18:10:32.736Z · comments (36)

[link] Finishing The SB-1047 Documentary In 6 Weeks
Michaël Trazzi (mtrazzi) · 2024-10-28T20:17:47.465Z · comments (5)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

morpheus on The Compendium, A full argument about extinction risk from AGI

Typo in the linked document:

There is no one is coming to save us.

richard_kennaway on JargonBot Beta Test

Both. I do not want to have AI content added to my post without my knowledge or consent.

In fact, thinking further about it, I do not want AI content added to anyone's post without their knowledge or consent, anywhere, not just on LessWrong.

Such content could be seen as just automating what people can do anyway with an LLM open in another window. I've no business trying to stop people doing that. However, someone doing that knows what they are doing. If the stuff pops up automatically amidst the author's original words, will they be so aware of its source and grok that the author had nothing to do with it? I do not think that the proposed discreet "AI-generated" label is enough to make it clear that such content is third-party commentary, for which the author carries no responsibility.

But then, who does carry that responsibility? No-one. An AI's words are news from nowhere. No-one's reputation is put on the line by uttering them. For it is written, the fundamental question of rationality is "What do I think I know and how do I think I know it?" But these AI popovers cannot be questioned.

And also, I do not personally want to be running into any writing that AI had a hand in.

(Oh, hey, you're the one who wrote Please do not use AI to write for you [LW · GW])

I am that person, and continue to be.

sarahconstantin on sarahconstantin's Shortform

links 11/01/2024: https://roamresearch.com/#/app/srcpublic/page/11-01-2024

https://en.m.wikipedia.org/wiki/Neats_and_scruffies a typology of AI researchers
https://notes.andymatuschak.org/About_these_notes Andy Matuschak's working notes, mostly about educational technology (but not educational games!)
- https://notes.andymatuschak.org/zUVBJdPc4kBud5fsLmPFpbw
https://notes.manjarinarayan.org/ Manjari Narayan's notes, mostly about statistics
https://www.washingtonpost.com/health/2024/05/06/ultrasound-addiction-treatment/ ultrasound being used as an addiction treatment -- the full study results aren't published yet, but the anecdotes suggest very dramatic effects.
all drugs for neuropathic pain have poor success rates.
https://pubmed.ncbi.nlm.nih.gov/24291734/ lots of people -- maybe 6-10% of the world population -- have neuropathic pain.
https://pmc.ncbi.nlm.nih.gov/articles/PMC3201926/ chronic pain generally affects about 20% of adults worldwide.
roughly half of opioid addicts treated with buprenorphine or methadone manage to abstain for 30 days after treatment: https://pubmed.ncbi.nlm.nih.gov/26599131/
https://www.whitehouse.gov/ondcp/briefing-room/2021/05/28/biden-harris-administration-calls-for-historic-levels-of-funding-to-prevent-and-treat-addiction-and-overdose/ the Biden-Harris administration has allocated $41B to preventing and treating drug addiction; hard to extract from that exactly how much is spent on rehab/treatment vs. anti-drug campaigns or law enforcement
- https://www.forbes.com/sites/danmunro/2015/04/27/inside-the-35-billion-addiction-treatment-industry/ US addiction treatment spending was estimated at $35B/year back in 2015
Vampire Weekend's Ezra Koenig:
- their latest album Only God Was Above Us is wrenching and it's kind of getting to me lately.
  - most of the commentary in interviews is about how Koening, now 40 with a 5-year-old kid, has matured and found peace (though if you listen to the lyrics it's an extremely nihilistic sort of being "at peace" with a terrible world and giving up on trying to change it)
  - nobody is remarking on what I see as pretty explicit themes like:
    - last album's "Harmony Hall" was about a sense of betrayal regarding Ivy-League antisemitism
    - this album is pretty clearly a rejection of the backlash, the Gen-X ("Gen X Cops"), ex-Eastern-Bloc ("Pravda"), or specifically Jewish (in the [[Bari Weiss]]/Tablet-mag vein) "vibe shift".
      - there's a lot of reflection on heritage and generation gaps, there's the sense that someone (his elders? his family?) is pushing him in a direction and he doesn't want to go that way, he thinks it doesn't make sense in his generation, in this era, but he does care enough to be conflicted and to yearn over the pain of people still (mistakenly, he thinks) struggling ("Capricorn").
- https://en.wikipedia.org/wiki/Ezra_Koenig
- https://people.com/vampire-weekend-ezra-koenig-finally-feels-adult-exclusive-8625179
- https://www.theguardian.com/us-news/2016/jun/20/bernie-sanders-vampire-weekend-grizzly-bear-endorsements
- https://www.theguardian.com/music/2024/mar/23/ezra-koenig-vampire-weekend-interview
- https://www.thejc.com/life-and-culture/music/vampire-weekend-dont-call-us-white-c3xbezac

nathan-helm-burger on johnswentworth's Shortform

My own attempt is much less well written and comprehensive, but I think I hit on some points that theirs misses: https://www.lesswrong.com/posts/NRZfxAJztvx2ES5LG/a-path-to-human-autonomy [LW · GW]

lorec on Lorec's Shortform

Recently, Raginrayguns [LW · GW] and Philosophy Bear both [presumably] read "Cargo Cult Science" [not necessarily for the first time] on /r/slatestarcodex. I follow both of them, so I looked into it. And TIL that's where "cargo-culting" comes from. He doesn't say why it's wrong, he just waves his hands and says it doesn't work and it's silly. Well, now I feel silly. I've been cargo-culting "cargo-culting". I'm a logical decision theorist. Cargo cults work. If they work unreliably, so do reductionistic methods.

notfnofn on you should probably eat oatmeal sometimes

My diet is heavily, heavily oatmeal (out of laziness). Like on most days I just eat oatmeal + peanut butter + walnuts microwaved in water until I get home (and I do ~45 minutes of HIIT or strength training every morning so it's not like I have a very low BMR). Maybe I should track my exact diet and bloodwork over time and report it here.

raemon on JargonBot Beta Test

To doublecheck/clarify: do you feel strongly (or, weakly) that you don't want autogenerated jargon to exist on your posts for people who click the "opt into non-author-endorsed AI content" for that post? Or simply that you don't personally want to be running into it?

(Oh, hey, you're the one who wrote Please do not use AI to write for you [LW · GW])

avturchin on The Compendium, A full argument about extinction risk from AGI

Your central argument seems to be a metaphor: We caused the Holocene extinction of animals, so godlike AI will kill us.

The problem with metaphorical arguments is that they can be reversed. As humans have become more intelligent, we've started to value animals, created zoos, natural reserves and now even work on the resurrection of extinct animals like mammoths. See more examples of such reversal by Gwern https://gwern.net/modus

Presenting weak arguments is evidence that there are no strong arguments, and this is obvious to outside readers.

The main problem is that we can't predict what superintelligent AI will do, and thus we can't 100 percent prove that it will necessarily kill us. But we shouldn't have to.

Instead, we should show that superintelligence will disempower us and that it may want to kill us for some reasons.

sable on What TMS is like

I do think there's something to that idea - physical injury and pain is a very universal and visible experience, whereas mental illness is difficult to parse for those who've never experienced it. I also think there's some sense in which 'treatment' and 'cure' are treated differently for mental and physical illness.

A doctor wouldn't just prescribe painkillers for a broken arm and call it a day because your symptoms have been dealt with; they'd want to actually fix the problem. Depression, on the other hand, doctors seem perfectly fine with merely mitigating the symptoms. Perhaps because that's all they're confident they can do?

avturchin on What TMS is like

BTW, memantine is weak (but legal) analog of ketamine and helped me to cure my depression.