LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Thoughts on Evo-Bio Math and Mesa-Optimization: Maybe We Need To Think Harder About "Relative" Fitness?
Lorec · 2024-09-28T14:07:42.412Z · comments (6)

[link] SCP Foundation - Anti memetic Division Hub
landscape_kiwi · 2024-09-15T13:40:52.691Z · comments (1)

[link] [Linkpost] Interpretable Analysis of Features Found in Open-source Sparse Autoencoder (partial replication)
Fernando Avalos (fernando-avalos) · 2024-09-09T03:33:53.548Z · comments (1)

[link] Could Things Be Very Different?—How Historical Inertia Might Blind Us To Optimal Solutions
James Stephen Brown (james-brown) · 2024-09-11T09:53:07.474Z · comments (0)

[link] An "Observatory" For a Shy Super AI?
Sherrinford · 2024-09-27T21:22:40.296Z · comments (0)

'Chat with impactful research & evaluations' (Unjournal NotebookLMs)
david reinstein (david-reinstein) · 2024-09-28T00:32:16.845Z · comments (0)

Using LLM's for AI Foundation research and the Simple Solution assumption
Donald Hobson (donald-hobson) · 2024-09-24T11:00:53.658Z · comments (0)

Longevity and the Mind
George3d6 · 2024-09-16T09:43:09.700Z · comments (2)

Reinforcement Learning from Information Bazaar Feedback, and other uses of information markets
Abhimanyu Pallavi Sudhir (abhimanyu-pallavi-sudhir) · 2024-09-16T01:04:32.953Z · comments (1)

[link] Join the $10K AutoHack 2024 Tournament
Paul Bricman (paulbricman) · 2024-09-25T11:54:20.112Z · comments (0)

Seeking mentorship
Kevin Afachao (kevin-afachao) · 2024-09-21T16:54:58.353Z · comments (0)

[link] AI Safety Newsletter #41: The Next Generation of Compute Scale Plus, Ranking Models by Susceptibility to Jailbreaking, and Machine Ethics
Corin Katzke (corin-katzke) · 2024-09-11T19:14:08.274Z · comments (1)

Democracy beyond majoritarianism
Arturo Macias (arturo-macias) · 2024-09-03T15:10:56.284Z · comments (2)

[link] Universal basic income isn’t always AGI-proof
Kevin Kohler (KevinKohler) · 2024-09-05T15:39:18.389Z · comments (3)

Increasing the Span of the Set of Ideas
Jeffrey Heninger (jeffrey-heninger) · 2024-09-13T15:52:39.132Z · comments (1)

Contextual Constitutional AI
aksh-n · 2024-09-28T23:24:43.529Z · comments (0)

A Psychoanalytic Explanation of Sam Altman's Irrational Actions
Gabe · 2024-09-29T18:58:13.511Z · comments (3)

[question] How do you follow AI (safety) news?
PeterH · 2024-09-24T13:58:48.916Z · answers+comments (2)

Exploring Shard-like Behavior: Empirical Insights into Contextual Decision-Making in RL Agents
Alejandro Aristizabal (alejandro-aristizabal) · 2024-09-29T00:32:42.161Z · comments (0)

Of Birds and Bees
RussellThor · 2024-09-30T10:52:15.069Z · comments (2)

For Limited Superintelligences, Epistemic Exclusion is Harder than Robustness to Logical Exploitation
Lorec · 2024-09-15T20:49:06.370Z · comments (9)

[link] Climate Change And Global Warming
Zero Contradictions · 2024-09-25T19:13:09.508Z · comments (0)

Avoiding jailbreaks by discouraging their representation in activation space
Guido Bergman · 2024-09-27T17:49:20.785Z · comments (2)

[link] Models of life
Abhishaike Mahajan (abhishaike-mahajan) · 2024-09-29T19:24:40.060Z · comments (0)

[link] Linkpost: Hypocrisy standoff
Chris_Leong · 2024-09-29T14:27:19.175Z · comments (1)

On Measuring Intellectual Performance - personal experience and several thoughts
Alexander Gufan (alexander-gufan) · 2024-09-20T17:21:19.747Z · comments (2)

Endogenous Growth and Human Intelligence
Nicholas D. (nicholas-d) · 2024-09-18T14:05:54.567Z · comments (0)

MIT FutureTech are hiring for a Technical Associate role
peterslattery · 2024-09-09T20:16:49.299Z · comments (0)

[question] Is it Legal to Maintain Turing Tests using Data Poisoning, and would it work?
Double · 2024-09-05T00:35:39.504Z · answers+comments (6)

[question] Calibration training for 'percentile rankings'?
david reinstein (david-reinstein) · 2024-09-14T21:51:55.705Z · answers+comments (0)

Building Safer AI from the Ground Up: Steering Model Behavior via Pre-Training Data Curation
Antonio Clarke (antonio-clarke) · 2024-09-29T18:48:23.308Z · comments (0)

Collapsing “Collapsing the Belief/Knowledge Distinction”
Jeremias (jeremias-sur) · 2024-09-20T16:11:33.558Z · comments (0)

Toy Models of Superposition: Simplified by Hand
Axel Sorensen (axel-sorensen) · 2024-09-29T21:19:52.475Z · comments (0)

What bootstraps intelligence?
invertedpassion · 2024-09-10T07:11:21.819Z · comments (2)

[question] What are the effective utilitarian pros and cons of having children (in rich countries)?
SpectrumDT · 2024-09-02T10:01:06.405Z · answers+comments (4)

[question] Who looked into extreme nuclear meltdowns?
Remmelt (remmelt-ellen) · 2024-09-01T21:38:02.644Z · answers+comments (8)

Emotion-Informed Valuation Mechanism for Improved AI Alignment in Large Language Models
Javier Marin Valenzuela (javier-marin-valenzuela) · 2024-09-04T17:00:13.203Z · comments (4)

Moscow – ACX Meetups Everywhere Fall 2024
red-hara · 2024-09-20T23:03:16.028Z · comments (0)

Survey - Psychological Impact of Long-Term AI Engagement
Manuela García (manuela-garcia) · 2024-09-17T17:31:38.397Z · comments (0)

[question] Most capable publicly available agents?
Gabe · 2024-09-30T00:04:24.480Z · answers+comments (0)

[question] Searching for Impossibility Results or No-Go Theorems for provable safety.
Maelstrom · 2024-09-27T20:12:25.515Z · answers+comments (1)

San Francisco ACX Meetup “First Saturday”
Nate Sternberg (nate-sternberg) · 2024-09-01T04:48:55.095Z · comments (1)

San Francisco ACX Meetup “First Saturday”
Nate Sternberg (nate-sternberg) · 2024-09-29T03:13:34.615Z · comments (0)

Superposition through Active Learning Lens
akankshanc · 2024-09-17T17:32:56.583Z · comments (0)

Survey - Psychological Impact of Long-Term AI Engagement
Manuela García (manuela-garcia) · 2024-09-17T17:31:38.383Z · comments (1)

[question] What does it mean for an event or observation to have probability 0 or 1 in Bayesian terms?
Noosphere89 (sharmake-farah) · 2024-09-17T17:28:52.731Z · answers+comments (22)

Sampling Effects on Strategic Behavior in Supervised Learning Models
Phil Bland · 2024-09-24T07:44:41.677Z · comments (0)

AIS Hungary Operations Officer role, Deadline: 2024 October 6th
gergogaspar (gergo-gaspar) · 2024-09-25T13:54:25.077Z · comments (0)

Emergent Authorship: Creativity à la Communing
gswonk · 2024-09-14T19:02:07.635Z · comments (0)

Extending the Off-Switch Game: Toward a Robust Framework for AI Corrigibility
OwenChen · 2024-09-25T20:38:22.928Z · comments (0)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

viliam on Of Birds and Bees

I think the rule is not necessarily "smarter units make a worse collective", but rather "it is more difficult to make a collective out of smarter units (but when it succeeds, it can be even better)". Humanity is unparalleled at eliminating larger predators.

Bees sacrifice their lives for their biological closest relatives. Birds have small families, so the cost of sacrificing their life is an important factor. Humans also have small families, but they can use prestige and money to incentivize heroic behavior.

So my proposed analogy would be that smarter populations can win, but they cannot achieve it by merely copying the behavior of stupider populations. They need a new solution that leverages their strengths.

mateusz-baginski on Shortform

Also, I'm curious what it is that you consider(ed) AI safety progress/innovation. Can you give a few representative examples?

unreal on A Path out of Insufficient Views

First paragraph: 3/10. The claim is that something was already more natural to begin with, but you need deliberate practice to unlock the thing that was already more natural. It's not that it 'comes more naturally' after you practice something. What 'felt' natural before was actually very unnatural and hindered, but we don't realize this until after practicing.

2nd, 3rd, 4th paragraph: 2/10. This mostly doesn't seem relevant to what I'm trying to offer.

...

It's interesting trying to watch various people try to repeat what I'm saying or respond to what I'm saying and just totally missing the target each time.

It suggests an active blind spot or a refusal to try to look straight at the main claim I'm making. I have been saying it over and over again, so I don't think it's on my end. Although the thing I am trying to point at is notoriously hard to point at, so there's that.

...

Anyway the cruxy part is here, and so to pass my ITT you'd have to include this:

"It's not a meta-process. It's not metacognition. It's not intelligence. It's also not intuition or instinct. This wisdom doesn't get better with more intelligence or more intuition. "

"I'm more guided by a wisdom that is not based in System 1 or System 2 or any "process" whatsoever. "

"The "one weird trick" to getting the right answers is to discard all stuck, fixed points. Discard all priors and posteriors. Discard all aliefs and beliefs. Discard worldview after worldview. Discard perspective. Discard unity. Discard separation. Discard conceptuality. Discard map, discard territory. Discard past, present, and future. Discard a sense of you. Discard a sense of world. Discard dichotomy and trichotomy. Discard vague senses of wishy-washy flip floppiness. Discard something vs nothing. Discard one vs all. Discard symbols, discard signs, discard waves, discard particles.

All of these things are Ignorance. Discard Ignorance."

sarah-constantin-1 on Sarah Constantin's Shortform

links 9/30/24 https://roamresearch.com/#/app/srcpublic/page/09-30-2024

lorec on I’m confused about innate smell neuroanatomy

Thank you! Someone else noticed! For my part, I'll update this if I find anything.

stephen-fowler on You can, in fact, bamboozle an unaligned AI into sparing your life

I am concerned our disagreement here is primarily semantic or based on a simple misunderstanding of each others position. I hope to better understand your objection.

"The p-zombie doesn't believe it's conscious, , it only acts that way."

One of us is mistaken and using a non-traditional definition of p-zombie or we have different definitions of "belief'.

My understanding is that P-zombies are physically identical to regular humans. Their brains contain the same physical patterns that encode their model of the world. That seems, to me, a sufficient physical condition for having identical beliefs.

If your p-zombies are only "acting" like they're concious, but do not believe it, then they are not physically identical to humans. The existence of p-zombies, as you have described them, wouldn't refute physicalism.

This resource indicates that the way you understand the term p-zombie may be mistaken: https://plato.stanford.edu/entries/zombies/

"but that's because p-zombies are impossible"

The main post that I responded to, specifically the section that I directly quoted, assumes it is possible for p-zombies to exist.

My comment begins "Assuming for the sake of argument that p-zombies could exist" but this is distinct from a claim that p-zombies actually exist.

"If they were possible, this wouldn't be the case, and we would have special access to the truth that p-zombies lack."

I do not feel this is convincing because this is an assertion my conclusion is incorrect, but without engaging with my arguments I made to reach that conclusion.

I look forward to continuing this discussion.

unreal on A Path out of Insufficient Views

Just respond genuinely. You already did.

unreal on A Path out of Insufficient Views

I don't know how else to phrase it, but I would like to not contradict interdependent origination. While still pointing toward what happens when all views are dropped and insight becomes possible.

gb on Alignment by default: the simulation hypothesis

Or it could be:

SimulatedAndBeingTestedForAchievingGoalsWithoutBeingNoticed

SimulatedAndBeingTestedForAbilityToTradeWithCreators

SimulatedAndBeingTestedForWillignessToSitQuietAndDoNothing

…

SimulatedAndBeingTestedForAnyXThatDoesNotLeadToDeathOfCreators

…

None of the things here nor in your last reply seems particularly likely, so there’s no telling in principle which set outweighs the other. Hence my previous assertion that we should be approximately completely unsure of what happens.

eniteris on Of Birds and Bees

Nitpicking at the example, worker bees do not have offspring; the best way for them to spread their genes is to protect the queen and thus, the hive.

Birds can have offspring, so self-preservation instead of risky attacks is optimal for individuals of a flock (of genetically unrelated individuals).

It's not that the group is less intelligent, rather that the individuals of the group have different goals (self-preservation vs hive preservation, though the end goal of maximizing fitness is the same).

But genetic fitness breaks down as a metric when you add culture to the system, so application to humans is limited.