LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[question] Are Sparse Autoencoders a good idea for AI control?
Gerard Boxo (gerard-boxo) · 2024-12-26T17:34:55.617Z · answers+comments (2)
[link] When do experts think human-level AI will be created?
Vishakha (vishakha-agrawal) · 2024-12-30T06:20:33.158Z · comments (0)
[link] The Economics & Practicality of Starting Mars Colonization
Zero Contradictions · 2024-12-26T10:56:26.019Z · comments (1)
Algorithmic Asubjective Anthropics, Cartesian Subjective Anthropics
Lorec · 2024-12-27T01:58:39.880Z · comments (0)
2. Skim the Manual: Intelligent Voluntary Cooperation
Allison Duettmann (allison-duettmann) · 2025-01-02T19:02:06.864Z · comments (0)
Teaching Claude to Meditate
Gordon Seidoh Worley (gworley) · 2024-12-29T22:27:44.657Z · comments (3)
I Recommend More Training Rationales
Gianluca Calcagni (gianluca-calcagni) · 2024-12-31T14:06:44.007Z · comments (0)
Duplicate token neurons in the first layer of gpt2-small
Alex Gibson · 2024-12-27T04:21:55.896Z · comments (0)
[link] Riffing on Machines of Loving Grace
an1lam · 2025-01-01T01:06:45.122Z · comments (0)
Towards a Unified Interpretability of Artificial and Biological Neural Networks
jan_bauer · 2024-12-21T23:10:45.842Z · comments (0)
[Rationality Malaysia] 2024 year-end meetup!
Doris Liew (doris-liew) · 2024-12-23T16:02:03.566Z · comments (0)
On False Dichotomies
nullproxy · 2025-01-02T18:54:21.560Z · comments (0)
[link] World models I'm currently building
xpostah · 2024-12-30T08:26:16.972Z · comments (0)
Alienable (not Inalienable) Right to Buy
FlorianH (florian-habermacher) · 2025-01-01T12:19:03.691Z · comments (4)
Game Theory and Behavioral Economics in The Stock Market
Jaiveer Singh (jaiveer-singh) · 2024-12-24T18:15:55.468Z · comments (0)
[question] What are the main arguments against AGI?
Edy Nastase (edy-nastase) · 2024-12-24T15:49:03.196Z · answers+comments (6)
[link] AGI is what generates evolutionarily fit and novel information
onur · 2025-01-01T09:22:55.841Z · comments (0)
ARC-AGI is a genuine AGI test but o3 cheated :(
Knight Lee (Max Lee) · 2024-12-22T00:58:05.447Z · comments (2)
Emergence and Amplification of Survival
jgraves01 · 2024-12-28T23:52:47.893Z · comments (0)
The Great OpenAI Debate: Should It Stay ‘Open’ or Go Private?
Satya (satya-2) · 2024-12-30T01:14:28.329Z · comments (0)
The AI Agent Revolution: Beyond the Hype of 2025
DimaG (di-wally-ga) · 2025-01-02T18:55:22.824Z · comments (0)
Morality Is Still Demanding
utilistrutil · 2024-12-29T00:33:40.471Z · comments (2)
The Opening Salvo: 1. An Ontological Consciousness Metric: Resistance to Behavioral Modification as a Measure of Recursive Awareness
Peterpiper · 2024-12-25T02:29:52.025Z · comments (0)
Making LLMs safer is more intuitive than you think: How Common Sense and Diversity Improve AI Alignment
Jeba Sania (jeba-sania) · 2024-12-29T19:27:35.685Z · comments (0)
[link] Merry Sciencemas: A Rat Solstice Retrospective
leebriskCyrano · 2025-01-01T01:08:36.433Z · comments (0)
Turing-Test-Passing AI implies Aligned AI
Roko · 2024-12-31T19:59:27.917Z · comments (28)
Action: how do you REALLY go about doing?
DDthinker · 2024-12-29T22:00:24.915Z · comments (0)
How Business Solved (?) the Human Alignment Problem
Gianluca Calcagni (gianluca-calcagni) · 2024-12-31T20:39:59.067Z · comments (1)
[link] Human, All Too Human - Superintelligence requires learning things we can’t teach
Ben Turtel (ben-turtel) · 2024-12-26T16:26:27.328Z · comments (4)
Aristotle, Aquinas, and the Evolution of Teleology: From Purpose to Meaning.
Spiritus Dei (spiritus-dei) · 2024-12-23T19:37:58.788Z · comments (0)
Woloch & Wosatan
JackOfAllTrades (JackOfAllSpades) · 2024-12-22T15:46:27.235Z · comments (0)
Terminal goal vs Intelligence
Donatas Lučiūnas (donatas-luciunas) · 2024-12-26T08:10:42.144Z · comments (24)
Propaganda Is Everywhere—LLM Models Are No Exception
Yanling Guo (yanling-guo) · 2024-12-23T01:39:03.777Z · comments (0)
The Engineering Argument Fallacy: Why Technological Success Doesn't Validate Physics
Wenitte Apiou (wenitte-apiou) · 2024-12-28T00:49:53.300Z · comments (5)
Rejecting Anthropomorphic Bias: Addressing Fears of AGI and Transformation
Gedankensprünge (gedankenspruenge) · 2024-12-29T01:48:47.583Z · comments (1)
AI Alignment, and where we stand.
afeller08 · 2024-12-29T14:08:47.276Z · comments (0)
So you want to be a witch
lucid_levi_ackerman · 2024-12-31T04:31:52.196Z · comments (3)
The Misconception of AGI as an Existential Threat: A Reassessment
Gedankensprünge (gedankenspruenge) · 2024-12-29T01:39:57.780Z · comments (0)
← previous page (newer posts) · next page (older posts) →