LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Electric Grid Cyberattack: An AI-Informed Threat Model
moonlightmaze · 2024-11-11T21:34:17.190Z · comments (0)

New Funding Category Open in Foresight's AI Safety Grants
Allison Duettmann (allison-duettmann) · 2024-11-06T22:59:41.065Z · comments (0)

Two arguments against longtermist thought experiments
momom2 (amaury-lorin) · 2024-11-02T10:22:11.311Z · comments (5)

[link] Levers for Biological Progress - A Response to "Machines of Loving Grace"
Niko_McCarty (niko-2) · 2024-11-01T16:35:08.221Z · comments (0)

LifeKeeper Diaries: Exploring Misaligned AI Through Interactive Fiction
Tristan Tran (tristan-tran) · 2024-11-09T20:58:09.182Z · comments (5)

Chaos Theory in Ecology
Elizabeth (pktechgirl) · 2024-11-09T17:50:01.727Z · comments (2)

2024 NYC Secular Solstice & Megameetup
Joe Rogero · 2024-11-12T17:46:18.674Z · comments (0)

Current Attitudes Toward AI Provide Little Data Relevant to Attitudes Toward AGI
Seth Herd · 2024-11-12T18:23:53.533Z · comments (2)

[question] How can we prevent AGI value drift?
Dakara (chess-ice) · 2024-11-20T18:19:24.375Z · answers+comments (4)

[link] What if muscle tension is sometimes signal jamming?
Chipmonk · 2024-11-04T21:08:47.800Z · comments (1)

Aligning AI Safety Projects with a Republican Administration
Deric Cheng (deric-cheng) · 2024-11-21T22:12:27.502Z · comments (0)

AXRP Episode 38.0 - Zhijing Jin on LLMs, Causality, and Multi-Agent Systems
DanielFilan · 2024-11-14T07:00:06.977Z · comments (0)

Secular Solstice Songbook Update
jefftk (jkaufman) · 2024-11-17T17:30:07.404Z · comments (1)

[link] Disentangling Representations through Multi-task Learning
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-11-24T13:10:26.307Z · comments (1)

[link] AI & wisdom 3: AI effects on amortised optimisation
L Rudolf L (LRudL) · 2024-10-28T21:08:56.604Z · comments (0)

What can we learn from insecure domains?
Logan Zoellner (logan-zoellner) · 2024-11-01T23:53:30.066Z · comments (21)

[link] AI & wisdom 2: growth and amortised optimisation
L Rudolf L (LRudL) · 2024-10-28T21:07:39.449Z · comments (0)

Dance Differentiation
jefftk (jkaufman) · 2024-11-15T02:30:07.694Z · comments (0)

Registrations Open for 2024 NYC Secular Solstice & Megameetup
Joe Rogero · 2024-11-12T17:50:10.827Z · comments (0)

[question] Why is Gemini telling the user to die?
Burny · 2024-11-18T01:44:12.583Z · answers+comments (1)

[link] The lying p value
kqr · 2024-11-12T06:12:59.934Z · comments (6)

Paraddictions: unreasonably compelling behaviors and their uses
Michael Cohn (michael-cohn) · 2024-11-22T20:53:59.479Z · comments (0)

[link] I, Token
Ivan Vendrov (ivan-vendrov) · 2024-11-25T02:20:35.629Z · comments (2)

Curriculum of Ascension
andrew sauer (andrew-sauer) · 2024-11-07T23:54:18.983Z · comments (0)

[link] [Linkpost] Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms
Gunnar_Zarncke · 2024-11-04T10:15:35.550Z · comments (0)

Goal: Understand Intelligence
Johannes C. Mayer (johannes-c-mayer) · 2024-11-03T21:20:02.900Z · comments (19)

AXRP Episode 38.1 - Alan Chan on Agent Infrastructure
DanielFilan · 2024-11-16T23:30:09.098Z · comments (0)

A Poem Is All You Need: Jailbreaking ChatGPT, Meta & More
Sharat Jacob Jacob (sharat-jacob-jacob) · 2024-10-29T12:41:30.337Z · comments (0)

ML4Good (AI Safety Bootcamp) - Experience report
JanEbbing · 2024-11-05T01:18:43.554Z · comments (0)

The current state of RSPs
Zach Stein-Perlman · 2024-11-04T16:00:42.630Z · comments (0)

GPT-4o Can In Some Cases Solve Moderately Complicated Captchas
dirk (abandon) · 2024-11-09T04:04:37.782Z · comments (2)

Fundamental Uncertainty: Chapter 9 - How do we live with uncertainty?
Gordon Seidoh Worley (gworley) · 2024-11-07T18:15:45.049Z · comments (2)

[link] Anthropic - The case for targeted regulation
anaguma · 2024-11-05T07:07:48.174Z · comments (0)

Why We Wouldn't Build Aligned AI Even If We Could
Snowyiu · 2024-11-16T20:19:59.324Z · comments (7)

Don't Dismiss on Epistemics
ggex · 2024-11-19T00:44:05.329Z · comments (3)

Spooky Recommendation System Scaling
phdead · 2024-10-31T22:00:51.728Z · comments (0)

Updating the NAO Simulator
jefftk (jkaufman) · 2024-10-30T13:50:06.908Z · comments (0)

Substituting Talkbox for Breath Controller
jefftk (jkaufman) · 2024-10-27T19:10:03.768Z · comments (0)

Arthropod (non) sentience
Arturo Macias (arturo-macias) · 2024-11-25T16:01:58.514Z · comments (6)

The Three Warnings of the Zentradi
Trevor Hill-Hand (Jadael) · 2024-11-21T20:28:45.567Z · comments (0)

Festival Stats 2024
jefftk (jkaufman) · 2024-11-12T02:00:04.831Z · comments (0)

Expected Utility, Geometric Utility, and Other Equivalent Representations
StrivingForLegibility · 2024-11-20T23:28:21.826Z · comments (0)

[link] Proposing the Conditional AI Safety Treaty (linkpost TIME)
otto.barten (otto-barten) · 2024-11-15T13:59:01.050Z · comments (8)

[question] Using hex to get murder advice from GPT-4o
Laurence Freeman (laurence-freeman) · 2024-11-13T18:30:23.475Z · answers+comments (5)

Reward Bases: A simple mechanism for adaptive acquisition of multiple reward type
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-11-23T12:45:01.067Z · comments (0)

Sideloading: creating a model of a person via LLM with very large prompt
avturchin · 2024-11-22T16:41:28.293Z · comments (4)

A Sober Look at Steering Vectors for LLMs
Joschka Braun (joschka-braun) · 2024-11-23T17:30:00.745Z · comments (0)

Fundamental Uncertainty: Epilogue
Gordon Seidoh Worley (gworley) · 2024-11-16T00:57:48.823Z · comments (0)

[link] Book Review: Replacing Guilt - On Having Something to Fight For
Cole Killian (cole-killian) · 2024-11-03T19:47:35.093Z · comments (0)

Prediction markets and Taxes
Edmund Nelson (edmund-nelson) · 2024-11-01T17:39:35.191Z · comments (7)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

cousin_it on a space habitat design

The thing is, for a given amount of centrifugal force, required material strength is proportional to radius.

Yeah. I wondered what if we have a series of concentric cylinders connected by spokes or walls, then the required material strength might be lower? The amount of material grows with radius, and you can't use all floors as living space because they will have different g's, but maybe some floors could be industry or infrastructure. And maybe the structure could be safer as it wouldn't have so many moving parts in contact.

rom on Locally optimal psychology

If there were no downsides to resolving a persistent issue, then why has it lasted so long??

If I understand correctly, your claim is that when we see long-standing issues like depression, chronic neck pain, or patterns of emotional avoidance persisting for years, it's more likely than not to be some sort of adaptive coping strategy—essentially a way the mind or body protects itself from harm–otherwise the issue would have been resolved.

Why do you think this is more likely than a mundane explanations such as "bad luck in the genetic lottery, no obvious levers to pull"?

lao-mein on DeepSeek beats o1-preview on math, ties on coding; will release weights

Is there a reason why every LLM tokenizer I've seen excludes slurs? It seems like a cheap way to train for AI assistant behavior.

Also notable that numbers are tokenized individually - I assume this greatly improves its performance in basic arithmetic tasks as compared to GPTs.

mondsemmel on Sinclair Chen's Shortform

Re: moral patienthood, I understand the Sam Harris position (paraphrased by him here as "Morality and values depend on the existence of conscious minds—and specifically on the fact that such minds can experience various forms of well-being and suffering in this universe.") as saying that anything else that supposedly matters, only matters because conscious minds care about it. Like, a painting has no more intrinsic value in the universe than any other random arrangement of atoms like a rock; its value stems purely from conscious minds caring about it. Same with concepts like beauty and virtue and biodiversity and anything else that's not directly about conscious minds.

And re: caring more about one's close circle: well, everyone in your close circle has their own close circle they care about, and if you repeat that exercise often enough, the vast majority of people in the world are in someone's close circle.

exmateriae on How to use bright light to improve your life.

I remember reading about SAD treatment by lumens in Inadequate Equilibria, though I did not finish the book.

cousin_it on Passages I Highlighted in The Letters of J.R.R.Tolkien

Every teacher knows that. How quickly an intelligent woman can be taught, grasp his ideas, see his point – and how (with rare exceptions) they can go no further

To me there seems almost an anticorrelation between being a diligent student and going further. It's not gender specific, I've noticed it in musicians in general, and it puzzles and frustrates me. It seems the more diligent they are about learning technique and proper ways and so on, the less willing they are to write their own music. I've known conservatory folks who are literally scared of it. While the messy amateurish folks often do compose, and it's occasionally good. Maybe you have to be a little bit independent-minded to go further than others.

There's a little bit of contradiction in there, in that it's not enough to be independent - you also need some amount of good technique. But you almost need to luck into it, acquire it in your own way. If you get it by being too much of a good student, then that mindset in itself will limit you.

lao-mein on Passages I Highlighted in The Letters of J.R.R.Tolkien

The older I get, and the more I learn about Tolkien, the more he disgusts me.

He is the inverse of all I value and all I find good in the world.

dakara on Thoughts on “AI is easy to control” by Pope & Belrose

You've mentioned that you are on optimists discord server. Is there any way I can join this server to ask a few questions about misuse risk there?

aproteinengine on Ultralearning in 80 days

CS: Thanks! Although I've done a lot of CS over the past four years - ML, apps, published papers, worked in labs at MIT, etc.- I've never formally immersed myself in theory by watching lectures or reading CS books. Since MIT OCW approximates a flexible and structured curriculum, I thought it best (the fact that the MIT Challenge exists and that I have friends taking the actual classes at MIT were no small factors either).

Sleep: My sleep schedules have been messy for the past two years, but I'm trying to make it a habit to sleep by 9 (10, latest) to ensure I get a steady 8 hours.

Writing: I hope to be able to write blog posts (such as this one) better. I struggled to sketch out what I wanted to say and found putting it on paper to be Herculean. It's a bit hard for me to illustrate what exactly I mean by "better," but writing closer to what William Zinnser and Paul Graham is what I'm targeting right now. I'm going about this as Ben Franklin did. I'll modify my approach as I go. The currently-set goal for writing is to be able to become able to write something like Not Boring for protein design.

“One such exercise he documents was taking a favorite magazine of his, The Spectator, and taking notes on articles that appeared there. He would then leave the notes for a few days and come back to them, trying to reconstruct the original argument from memory. After finishing, he “compared my Spectator with the original, discovered some of my faults, and corrected them.” Realizing that his vocabulary was limited, he developed another strategy. By turning the prose into verse, he could replace words with synonyms that matched in meter or rhyme. To improve his sense of the rhetorical flow of an essay, he tried his imitation approach again, but this time he jumbled up the hints so he would have to determine the correct order of the sequence of ideas as he wrote again.Once he had established some of the mechanics of writing, he moved on to the more difficult task of writing in a style that would persuade. When reading an English grammar book, he was exposed to the idea of the Socratic method, of challenging another’s ideas through probing questions rather than direct contradiction. He then went to work, carefully avoiding “abrupt contradiction and positive argumentation,” instead focusing[…]” - Ultralearning

Practice: I'm working through the CFAR handbook right now. (I understand it isn't a substitute for the actual camp, but the Atlas Fellowship's gone). I'm picking one concept from it, committing it to memory (SRS), executing it every chance I get during the day, and journalling the results at night. I review them in the morning and make notes on improvement. I'm going to apply for ESPR when it opens up again.

michael-roe on Crosspost: Developing the middle ground on polarized topics

I will take “actually, it’s even more complicated” as a reasonable response. Yes, it probably is.