LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Biology-Inspired AGI Timelines: The Trick That Never Works
Eliezer Yudkowsky (Eliezer_Yudkowsky) · 2021-12-01T22:35:28.379Z · comments (142)

Ironing Out the Squiggles
Zack_M_Davis · 2024-04-29T16:13:00.371Z · comments (36)

My thoughts on the social response to AI risk
Matthew Barnett (matthew-barnett) · 2023-11-01T21:17:08.184Z · comments (37)

What’s up with LLMs representing XORs of arbitrary features?
Sam Marks (samuel-marks) · 2024-01-03T19:44:33.162Z · comments (61)

[link] Gradient Routing: Masking Gradients to Localize Computation in Neural Networks
cloud · 2024-12-06T22:19:26.717Z · comments (12)

EIS XIII: Reflections on Anthropic’s SAE Research Circa May 2024
scasper · 2024-05-21T20:15:36.502Z · comments (16)

Language Models Model Us
eggsyntax · 2024-05-17T21:00:34.821Z · comments (55)

Deep Learning Systems Are Not Less Interpretable Than Logic/Probability/Etc
johnswentworth · 2022-06-04T05:41:56.713Z · comments (55)

Dear Self; We Need To Talk About Social Media
Elizabeth (pktechgirl) · 2021-12-07T00:40:01.949Z · comments (19)

[link] Comp Sci in 2027 (Short story by Eliezer Yudkowsky)
sudo · 2023-10-29T23:09:56.730Z · comments (22)

parenting rules
Dave Orr (dave-orr) · 2020-12-21T19:48:42.365Z · comments (9)

Formal verification, heuristic explanations and surprise accounting
Jacob_Hilton · 2024-06-25T15:40:03.535Z · comments (11)

Nonprofit Boards are Weird
HoldenKarnofsky · 2022-06-23T14:40:11.593Z · comments (26)

Password-locked models: a stress case for capabilities evaluation
Fabien Roger (Fabien) · 2023-08-03T14:53:12.459Z · comments (14)

Negative Feedback and Simulacra
Elizabeth (pktechgirl) · 2020-04-29T02:00:01.734Z · comments (24)

Current safety training techniques do not fully transfer to the agent setting
Simon Lermen (dalasnoin) · 2024-11-03T19:24:51.537Z · comments (8)

[link] The Death of Behavioral Economics
habryka (habryka4) · 2021-08-22T22:39:12.697Z · comments (24)

[link] Conjecture internal survey: AGI timelines and probability of human extinction from advanced AI
Maris Sala (maris-sala) · 2023-05-22T14:31:59.139Z · comments (5)

Introduction to Cartesian Frames
Scott Garrabrant · 2020-10-22T13:00:00.000Z · comments (32)

Meta Questions about Metaphilosophy
Wei Dai (Wei_Dai) · 2023-09-01T01:17:57.578Z · comments (78)

[question] things that confuse me about the current AI market.
DMMF · 2024-08-28T13:46:56.908Z · answers+comments (28)

[question] What are some beautiful, rationalist artworks?
jacobjacob · 2020-10-17T06:32:43.142Z · answers+comments (140)

Omicron Post #7
Zvi · 2021-12-16T17:30:01.676Z · comments (41)

Tips for Empirical Alignment Research
Ethan Perez (ethan-perez) · 2024-02-29T06:04:54.481Z · comments (4)

An Orthodox Case Against Utility Functions
abramdemski · 2020-04-07T19:18:12.043Z · comments (65)

Your posts should be on arXiv
JanB (JanBrauner) · 2022-08-25T10:35:12.087Z · comments (44)

Announcing Dialogues
Ben Pace (Benito) · 2023-10-07T02:57:39.005Z · comments (52)

[link] "Diamondoid bacteria" nanobots: deadly threat or dead-end? A nanotech investigation
titotal (lombertini) · 2023-09-29T14:01:15.453Z · comments (79)

o3
Zach Stein-Perlman · 2024-12-20T18:30:29.448Z · comments (155)

LessWrong Has Agree/Disagree Voting On All New Comment Threads
Ben Pace (Benito) · 2022-06-24T00:43:17.136Z · comments (217)

Matt Botvinick on the spontaneous emergence of learning algorithms
Adam Scholl (adam_scholl) · 2020-08-12T07:47:13.726Z · comments (87)

Emotionally Confronting a Probably-Doomed World: Against Motivation Via Dignity Points
TurnTrout · 2022-04-10T18:45:08.027Z · comments (7)

Curated conversations with brilliant rationalists
spencerg · 2021-05-28T14:23:30.631Z · comments (18)

A freshman year during the AI midgame: my approach to the next year
Buck · 2023-04-14T00:38:49.807Z · comments (15)

The Incredible Fentanyl-Detecting Machine
sarahconstantin · 2024-06-28T22:10:01.223Z · comments (26)

Sapir-Whorf for Rationalists
Duncan Sabien (Deactivated) (Duncan_Sabien) · 2023-01-25T07:58:46.794Z · comments (49)

Dyslucksia
Shoshannah Tekofsky (DarkSym) · 2024-05-09T19:21:33.874Z · comments (45)

[link] Will no one rid me of this turbulent pest?
Metacelsus · 2023-10-14T15:27:21.497Z · comments (23)

Potential Bottlenecks to Taking Over The World
johnswentworth · 2021-07-06T19:34:53.016Z · comments (22)

The Felt Sense: What, Why and How
Kaj_Sotala · 2020-10-05T15:57:50.545Z · comments (23)

New York Times, Please Do Not Threaten The Safety of Scott Alexander By Revealing His True Name
Zvi · 2020-06-23T12:20:00.788Z · comments (2)

Request: stop advancing AI capabilities
So8res · 2023-05-26T17:42:07.182Z · comments (24)

[link] ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks
Beth Barnes (beth-barnes) · 2023-08-01T18:30:57.068Z · comments (12)

Omicron Post #4
Zvi · 2021-12-06T17:00:01.470Z · comments (66)

Apologizing is a Core Rationalist Skill
johnswentworth · 2024-01-02T17:47:35.950Z · comments (42)

OpenAI: Exodus
Zvi · 2024-05-20T13:10:03.543Z · comments (26)

AI: Practical Advice for the Worried
Zvi · 2023-03-01T12:30:00.703Z · comments (48)

Nate Soares' Life Advice
CatGoddess · 2022-08-23T02:46:43.369Z · comments (41)

The Commitment Races problem
Daniel Kokotajlo (daniel-kokotajlo) · 2019-08-23T01:58:19.669Z · comments (56)

Staying Split: Sabatini and Social Justice
Duncan Sabien (Deactivated) (Duncan_Sabien) · 2022-06-08T08:32:58.633Z · comments (28)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

notfnofn on Everywhere I Look, I See Kat Woods

So I'm new here and this website is great because it doesn't have bite-sized oversimplifying propaganda. But isn't that common everywhere else? Those posts seem very typical for reddit and at least they're not outright misinformation.

Also I... don't hate these memes. They strike me as decent quality. Memes aren't supposed to make you think deeply about things.

abandon on Everywhere I Look, I See Kat Woods

I also dislike many of the posts you included here, but I feel like this is perhaps unfairly harsh on some of the matters that come down to subjective taste; while it's perfectly reasonable to find a post cringe or unfunny for your own part, not everyone will necessarily agree, and the opinions of those who enjoy this sort of content aren't incorrect per se.

As a note, since it seems like you're pretty frustrated with how many of her posts you're seeing, blocking her might be a helpful intervention; Reddit's help page says blocked users' posts are hidden from your feeds.

wassname on New, improved multiple-choice TruthfulQA

Owen, have you looked at the GitHub issues in your repo? There are other issues too. I submitted one here about wrong labels.

I really think it's worth making TruthfulQA 2.0, give the amount of usage it sees and the room for improvement.

wassname on Gaming TruthfulQA: Simple Heuristics Exposed Dataset Weaknesses

TruthfulQA is actually quite bad. I don't blame the authors, as no one has made anything better, but we really should. It's only ~800 samples. And many of them are badly labelled.

wassname on Nathan Helm-Burger's Shortform

I agree, it shows the ease of shoffy copying. But it doesn't show the ease of reverse engineering or parallel engineering.

It's just distillation, though. It doesn't reveal how o1 could be constructed, it just reveals how to efficiently copy from o1-like outputs (not from scratch). This recipe won't be able to make o1, unless o1 already exists. That means this method of copying lets someone catch up to the leader, but not surpass them.

There are some papers that attempt to replicate o1 though, and so far they don't quite get there, using distillation from a larger model (math-star, huggingface TTC) or not matching the results (see my post [LW(p) · GW(p)]). Maybe we will see open source replication in a couple of months? Which means only a short lag.

It's worth noting that Silicon Valley leaks like a sieve. And this is a feature, not a bug. Part of the reason it became the techno-VC centre of the world is because they banned non-competes. So you can take your competitor's trade secrets if you are willing to pay millions to poach some of their engineers. This is why some ML engineers get paid millions, it's not the skill, it's the trade secrets that competitors are paying for (and sometimes the brand-name). This has been great for tech and civilisation, but it's not so great for maintaining a technology lead.

christiankl on Unregulated Peptides: Does BPC-157 hold its promises?

That's not a good data point. If you want to provide anecdotal data, it would be good to provide more of the observations. How long did he have a should issue before taking BPC-157? How fast did it get away afterward?

benquo on Rough Sketch for Product to Enhance Citizen Participation in Politics

Your proposal is well-structured and interesting but has a fundamental flaw that needs to be addressed. Interest keyword-based filtering will primarily encourage politics-as-identity, which is actively harmful - it directs attention towards zero-sum thinking and performative identities, rather than creative problem solving. As Bryan Caplan demonstrates in The Myth of the Rational Voter, people already tend to vote to express identities and affiliations rather than to achieve better outcomes. We shouldn't build tools that further entrench this destructive pattern.

Instead, imagine a tool that:

Has users journal daily about their life - activities, hopes, problems, and worries
Uses AI to identify where their constraints are plausibly caused by or could be alleviated by government action, especially local government
Maps them to specific opportunities for formal recourse, with guidance on process, likely outcomes, and practical assistance (like drafting letters or legal documents)
For issues requiring collective action, connects users facing similar constraints and helps coordinate through mechanisms like dominant assurance contracts [LW · GW] where appropriate

This approach would ground political participation in the solving of one's own problems rather than identity expression. While technically more challenging to implement than interest-based filtering, it would generate higher-quality engagement that expands our collective problem-solving capacity rather than just reallocating political power between existing interest groups.

The patterns emerging from aggregated user experiences would naturally reveal systemic issues and preventive opportunities, especially in how regulations and policies interact to shape people's choices and planning horizons. While building reliable AI judgment about political causation is challenging, it's better to attempt something hard that would be beneficial if feasible, than to facilitate the destructive forces of identity-based politics simply because they're easier to implement.

waterlubber on Unregulated Peptides: Does BPC-157 hold its promises?

Anecdotal data point: an (online) friend of mine with EDS successfully used BPC-157 to treat shoulder ligament injury, although apparently it promoted scar tissue formation as well. He claims that it produced a significant improvement in his symptoms.

yonatan-cale-1 on Yonatan Cale's Shortform

More on starting early:

Imagine a lab starts working in an air gapped network, and one of the 1000 problems that comes up is working-from-home.

If that problem comes up now (early), then we can say "okay, working from home is allowed", and we'll add that problem to the queue of things that we'll prioritize and solve. We can also experiment with it: Maybe we can open another secure office closer to the employee's house, would they like that? If so, we could discuss fancy ways to secure the communication between the offices. If not, we can try something else.

If that problem comes up when security is critical (if we wait), then the solution will be "no more working from home, period". The security staff will be too overloaded with other problems to solve, not available to experiment with having another office nor to sign a deal with Cursor.

anthonyc on Passages I Highlighted in The Letters of J.R.R.Tolkien

Edit to add: Just thinking about the converse, you could also make it sound more ridiculous by rewriting it with more obscure parts of the legendarium, too.

Conquer Morgoth with Ungoliant. Turn Maiar into balrogs. Glamdring among the morgul-blades.