LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[question] Why Can’t Sub-AGI Solve AI Alignment? Or: Why Would Sub-AGI AI Not be Aligned?
MrThink (ViktorThink) · 2024-07-02T20:13:24.054Z · answers+comments (23)

Likelihood calculation with duobels
Martin Gerdes (martin-gerdes) · 2024-10-01T16:21:01.268Z · comments (0)

[question] is there a big dictionary somewhere with all your jargon and acronyms and whatnot?
KvmanThinking (avery-liu) · 2024-10-17T11:30:50.937Z · answers+comments (7)

[link] A Logical Proof for the Emergence and Substrate Independence of Sentience
rife (edgar-muniz) · 2024-10-24T21:08:09.398Z · comments (31)

Ways to think about alignment
Abhimanyu Pallavi Sudhir (abhimanyu-pallavi-sudhir) · 2024-10-27T01:40:50.762Z · comments (0)

[question] When do alignment researchers retire?
Jordan Taylor (Nadroj) · 2024-06-25T23:30:25.520Z · answers+comments (2)

[question] What are the primary drivers that caused selection pressure for intelligence in humans?
Towards_Keeperhood (Simon Skade) · 2024-11-07T09:40:20.275Z · answers+comments (15)

Contrapositive Natural Abstraction - Project Intro
Elliot Callender (javanotmocha) · 2024-06-24T18:37:21.761Z · comments (5)

Some Comments on Recent AI Safety Developments
testingthewaters · 2024-11-09T16:44:58.936Z · comments (0)

[link] Social interaction-inspired AI alignment
Chipmonk · 2024-06-24T08:10:08.719Z · comments (2)

Towards a Clever Hans Test: Unmasking Sentience Biases in Chatbot Interactions
glykokalyx · 2024-11-10T22:34:58.956Z · comments (0)

How does generalized accessibility compare to targeted accessibility?
ErioirE (erioire) · 2024-07-17T17:07:09.829Z · comments (0)

[question] What the cost difference in processing input vs. output tokens with LLMs?
kotrfa · 2024-08-08T10:43:18.049Z · answers+comments (10)

Theories With Mentalistic Atoms Are As Validly Called Theories As Theories With Only Non-Mentalistic Atoms
Lorec · 2024-11-12T06:45:26.039Z · comments (4)

Hamiltonian Dynamics in AI: A Novel Approach to Optimizing Reasoning in Language Models
Javier Marin Valenzuela (javier-marin-valenzuela) · 2024-10-09T19:14:56.162Z · comments (0)

Help us seed AI Safety Brussels
gergogaspar (gergo-gaspar) · 2024-08-07T06:32:24.760Z · comments (2)

EAGxBerkeley 2024
Lauriander · 2024-07-15T18:38:16.980Z · comments (0)

Joint mandatory donation as a way to increase the number of donations
Crazy philosopher (commissar Yarrick) · 2024-07-07T10:56:57.222Z · comments (3)

Leverage points for a pause
Remmelt (remmelt-ellen) · 2024-08-28T09:21:17.593Z · comments (0)

Alignment from equivariance
hamishtodd1 · 2024-08-13T21:09:11.849Z · comments (0)

Interview with Bill O’Rourke - Russian Corruption, Putin, Applied Ethics, and More
JohnGreer · 2024-10-27T17:11:28.891Z · comments (0)

On Artificial Wisdom
Jordan Arel · 2024-07-12T00:20:33.241Z · comments (0)

Auckland New Zealand - ACX Meetups Everywhere Fall 2024
Mark Gilmour (mark-gilmour) · 2024-08-29T18:35:31.852Z · comments (0)

For Limited Superintelligences, Epistemic Exclusion is Harder than Robustness to Logical Exploitation
Lorec · 2024-09-15T20:49:06.370Z · comments (9)

Methodology: Contagious Beliefs
James Stephen Brown (james-brown) · 2024-10-19T03:58:17.966Z · comments (0)

On the Practical Applications of Interpretability
Nick Jiang (nick-jiang) · 2024-10-15T17:18:25.280Z · comments (0)

AI development is an act of social revolution
artemiocobb · 2024-07-03T18:00:17.947Z · comments (0)

Computational irreducibility challenges the simulation hypothesis
Clément L · 2024-08-11T16:14:29.655Z · comments (15)

San Francisco ACX Meetup “First Saturday”
Nate Sternberg (nate-sternberg) · 2024-09-29T03:13:34.615Z · comments (0)

Hamburg Germany - ACX Meetups Everywhere Fall 2024
Gunnar (gunnar ) · 2024-08-29T18:37:11.622Z · comments (0)

(draft) Cyborg software should be open (?)
AtillaYasar (atillayasar) · 2024-11-01T07:24:51.966Z · comments (5)

Bayes' Theorem: In Search of Gold (Lesson 1)
bayesyatina · 2024-06-28T08:39:16.638Z · comments (0)

Some desirable properties of automated wisdom
Marius Adrian Nicoară · 2024-07-13T06:05:34.386Z · comments (2)

Recursion in AI is scary. But let’s talk solutions.
Oleg Trott (oleg-trott) · 2024-07-16T20:34:58.580Z · comments (10)

Enabling New Applications with Today's Mechanistic Interpretability Toolkit
ananya_joshi · 2024-10-25T17:53:23.960Z · comments (0)

[link] Motivation Theory
Zero Contradictions · 2024-08-08T05:05:04.741Z · comments (0)

Memetics as an analogy and its implicit connotations
Rachel Shu (wearsshoes) · 2024-06-25T05:13:12.746Z · comments (0)

The AI alignment problem in socio-technical systems from a computational perspective: A Top-Down-Top view and outlook
zhaoweizhang (Zhaowei Zhang) · 2024-07-15T18:56:08.108Z · comments (0)

Playing Minecraft with a Superintelligence
Johannes C. Mayer (johannes-c-mayer) · 2024-08-17T22:47:42.767Z · comments (0)

Vilnius – ACX Meetups Everywhere Fall 2024
NoUsernameSelected · 2024-08-19T17:38:12.378Z · comments (1)

[link] Unnatural abstractions
Aprillion · 2024-08-10T22:31:42.949Z · comments (3)

AI Compute governance: Verifying AI chip location
Farhan · 2024-10-12T17:36:45.942Z · comments (0)

Quantum Immortality: A Perspective if AI Doomers are Probably Right
avturchin · 2024-11-07T16:06:08.106Z · comments (34)

[question] Seeking feedback on a critique of the paperclip maximizer thought experiment
bio neural (bio-neural) · 2024-07-15T18:39:30.545Z · answers+comments (9)

MIT FutureTech are hiring for a Technical Associate role
peterslattery · 2024-09-09T20:16:49.299Z · comments (0)

Near-death experiences
Declan Molony (declan-molony) · 2024-10-08T06:34:04.107Z · comments (1)

Endogenous Growth and Human Intelligence
Nicholas D. (nicholas-d) · 2024-09-18T14:05:54.567Z · comments (0)

[link] Podcast discussing Hanson's Cultural Drift Argument
vaishnav92 · 2024-10-20T17:58:41.416Z · comments (0)

The Potential Impossibility of Subjective Death
VictorLJZ · 2024-07-04T18:17:28.141Z · comments (34)

Keeping content out of LLM training datasets
Ben Millwood (ben-millwood) · 2024-07-18T10:27:27.827Z · comments (0)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

satron on Sabotage Evaluations for Frontier Models

To modify my example to include an accountability mechanism that's also similar to the real life, the King takes exactly the same vaccines as everyone else. So if he messed up with the chemicals, he also dies.

I believe similar accountability mechanism works in our real world case. If CEOs build unsafe AI, they and everyone they valued in this life die. This seems like a really good incentive for them to not build unsafe AI.

At the end of the day, voluntary commitment such as debating with the critics are not as strong in my option. Imagine that they agree with you and go to the debate. Without the incentive of "if I mess up, everyone dies", the CEOs could just go right back to doing what they were doing. As far as I know voluntary debates have few (if any) actual legal mechanisms to hold CEOs accountable.

q-home on Q Home's Shortform

There's an alignment-related problem, the problem of defining real objects. Relevant topics: environmental goals; task identification problem; "look where I'm pointing, not at my finger"; Eliciting Latent Knowledge [? · GW].

I think I realized how people go from caring about sensory data to caring about real objects. But I need help with figuring out how to capitalize on the idea.

So... how do humans do it?

Humans create very small models for predicting very small/basic aspects of sensory input (mini-models).
Humans use mini-models as puzzle pieces for building models for predicting ALL of sensory input.
As a result, humans get models in which it's easy to identify "real objects" corresponding to sensory input.

For example, imagine you're just looking at ducks swimming in a lake. You notice that ducks don't suddenly disappear from your vision (permanence), their movement is continuous (continuity) and they seem to move in a 3D space (3D space). All those patterns ("permanence", "continuity" and "3D space") are useful for predicting aspects of immediate sensory input. But all those patterns are also useful for developing deeper theories of reality, such as atomic theory of matter. Because you can imagine that atoms are small things which continuously move in 3D space, similar to ducks. (This image stops working as well when you get to Quantum Mechanics, but then aspects of QM feel less "real" and less relevant for defining object.) As a result, it's easy to see how the deeper model relates to surface-level patterns.

In other words: reality contains "real objects" to the extent to which deep models of reality are similar to (models of) basic patterns in our sensory input.

jonas-hallgren on OpenAI Email Archives (from Musk v. Altman)

Do you have any thoughts on what this actionably means? For me it seems a bit like being able to influence such coversations is potentially a bit intractable but maybe one could host forums and events for this if one has the right network?

I think it's a good point and I'm wondering about how it actionably looks, I can see it for someone with the right contacts and so the message for people who don't have that is to create it or what are your thoughts there?

lukehmiles on Project Adequate: Seeking Cofounders/Funders

Wasted opportunity to guarantee this post keeps getting holywar comments for the next hundred years.

lukehmiles on Project Adequate: Seeking Cofounders/Funders

This is pretty inspiring to me. Thank you for sharing.

elityre on Reformative Hypocrisy, and Paying Close Enough Attention to Selectively Reward It.

I suspect it would still involve billions of $ of funding, partnerships like the one with Microsoft, and other for-profit pressures to be the sort of player it is today. So I don't know that Musk's plan was viable at all.

Note that all of this happened before the scaling hypothesis was really formulated, much less made obvious.

We now know, with the benefit of hindsight that developing AI and it's precursors is extremely compute intensive, which means capital intensive. There was some reason to guess this might be true at the time, but it wasn't a forgone conclusion—it was still an open question if the key to AGI would be mostly some technical innovation that hadn't been developed yet.

elityre on Lao Mein's Shortform

Those people don't get substantial equity in most business in the world. They generally get paid a salary and benefits in exchange for their work, and that's about it.

zy on Shortform

Haven't looked too closely at this, but wanted to comment with my initial two thoughts:

child consent is tricky.
likely many are foreign children, which may or may not be in the 75 million statistic

It is good to think critically, but I think it would be beneficial to present more evidence before making the claim or conclusion

lukehmiles on Shortform

The other day I was trying to think of information leaks that a competent conspiracy couldn't prevent, regarding this. I just thought of one small one: people will sometimes randomly die or have their homes raided. If the slavery is common, then sometimes the slaves will be discovered during these events. Even if the escapees wanted to silence the story out of shame, cops would probably gossip to the press.

So you can probably tally such events, crunch the numbers, and get a decent conspiracy-resistant estimate.

lukehmiles on Alexander Gietelink Oldenziel's Shortform

As a layman, I have not seen much unrealistic hype. I think the hype-level is just about right.