LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Interpreting autonomous driving agents with attention based architecture
Manav Dahra (manav-dahra) · 2025-02-01T23:20:27.162Z · comments (0)
Locating and Editing Knowledge in LMs
Dhananjay Ashok (dhananjay-ashok) · 2025-01-24T22:53:40.559Z · comments (0)
[question] Why isn't AI containment the primary AI safety strategy?
OKlogic · 2025-02-05T03:54:58.171Z · answers+comments (3)
[link] Ideas for CoT Models: A Geometric Perspective on Latent Space Reasoning
Rohan Ganapavarapu (rohan-ganapavarapu) · 2025-01-24T19:01:47.339Z · comments (0)
[link] Request for proposals: improving capability evaluations
cb · 2025-02-07T18:51:34.926Z · comments (0)
Poll on AI opinions.
Niclas Kupper (niclas-kupper) · 2025-02-23T22:39:09.027Z · comments (1)
AI alignment for mental health supports
hiki_t · 2025-02-24T04:21:42.379Z · comments (1)
[link] Language Models and World Models, a Philosophy
kyjohnso · 2025-02-03T02:55:36.577Z · comments (0)
Dayton, Ohio, HPMOR 10 year Anniversary meetup
Lunawarrior · 2025-02-24T12:55:59.484Z · comments (0)
Part 1: Enhancing Inner Alignment in CLIP Vision Transformers: Mitigating Reification Bias with SAEs and Grad ECLIP
Gilber A. Corrales (mysticdeepai) · 2025-02-03T19:30:52.505Z · comments (0)
Introducing International AI Governance Alliance (IAIGA)
jamesnorris · 2025-02-05T16:02:29.226Z · comments (0)
Nationwide Action Workshop: Contact Congress about AI safety!
Felix De Simone (BobusChilc) · 2025-02-24T19:36:09.084Z · comments (0)
[question] Programming Language Early Funding?
J Thomas Moros (J_Thomas_Moros) · 2025-02-16T17:34:06.058Z · answers+comments (5)
Gettier cases, Rigid Designators, and Referential Opacity
Antigone (luke-st-clair) · 2025-01-28T18:46:10.180Z · comments (0)
Quantifying the Qualitative: Towards a Bayesian Approach to Personal Insight
Pruthvi Kumar (pruthvi-kumar) · 2025-02-15T19:50:42.550Z · comments (0)
Preference for uncertainty and impact overestimation bias in altruistic systems.
Luck (luck-1) · 2025-02-15T12:27:05.474Z · comments (0)
Positive Directions
G Wood (geoffrey-wood) · 2025-02-11T00:00:11.426Z · comments (0)
Static Place AI Makes AGI Redundant: Multiversal AI Alignment & Rational Utopia
ank · 2025-02-13T22:35:28.300Z · comments (2)
[link] Baumol effect vs Jevons paradox
Hzn · 2025-02-10T08:28:05.982Z · comments (0)
To know or not to know
arisAlexis (arisalexis) · 2025-01-27T13:17:33.672Z · comments (3)
Recursive Cognitive Refinement (RCR): A Self-Correcting Approach for LLM Hallucinations
mxTheo · 2025-02-22T21:32:50.832Z · comments (0)
An Alternate History of the Future, 2025-2040
Mr Beastly (mr-beastly) · 2025-02-24T05:53:25.521Z · comments (0)
the dumbest theory of everything
lostinwilliamsburg · 2025-02-13T07:57:38.842Z · comments (0)
The Newbie's Guide to Navigating AI Futures
keithjmenezes · 2025-02-19T20:37:06.272Z · comments (0)
the devil's ontology
lostinwilliamsburg · 2025-02-07T14:18:52.516Z · comments (14)
Places of Loving Grace [Story]
ank · 2025-02-18T23:49:18.580Z · comments (0)
[link] LLMs can teach themselves to better predict the future
Ben Turtel (ben-turtel) · 2025-02-13T01:01:12.175Z · comments (1)
[link] Humans are Just Self Aware Intelligent Biological Machines
asksathvik · 2025-02-21T01:03:59.950Z · comments (3)
[link] Sea Change
Charlie Sanders (charlie-sanders) · 2025-02-18T06:03:06.961Z · comments (2)
Are we the Wolves now? Human Eugenics under AI Control
Brit (james-spencer) · 2025-01-30T08:31:34.423Z · comments (1)
CyberEconomy. The Limits to Growth
Timur Sadekov (timur-sadekov) · 2025-02-16T21:02:34.040Z · comments (0)
[question] Implication of Uncomputable Problems
Nathan1123 · 2025-01-30T16:48:38.222Z · answers+comments (3)
[link] Biology, Ideology and Violence
Zero Contradictions · 2025-02-06T11:26:02.845Z · comments (5)
AI and Non-Existence.
Eleven · 2025-01-25T19:36:22.624Z · comments (9)
Stopping unaligned LLMs is easy!
Yair Halberstadt (yair-halberstadt) · 2025-02-03T15:38:27.083Z · comments (11)
How To Prevent a Dystopia
ank · 2025-01-29T14:16:09.862Z · comments (4)
Paranoia, Cognitive Biases, and Catastrophic Thought Patterns.
Spiritus Dei (spiritus-dei) · 2025-02-14T00:13:56.300Z · comments (1)
Chinese room AI to survive the inescapable end of compute governance
rotatingpaguro · 2025-02-02T02:42:03.627Z · comments (0)
[link] Several Arguments Against the Mathematical Universe Hypothesis
Vittu Perkele · 2025-02-19T22:13:59.425Z · comments (6)
Preserving Epistemic Novelty in AI: Experiments, Insights, and the Case for Decentralized Collective Intelligence
Andy E Williams (andy-e-williams) · 2025-02-08T10:25:27.891Z · comments (8)
Gettier Cases [repost]
Antigone (luke-st-clair) · 2025-02-03T18:12:22.253Z · comments (4)
[link] We Fell For It
Nicholas / Heather Kross (NicholasKross) · 2025-02-05T03:07:43.175Z · comments (9)
Zizian comparisons / connections in the open source & Linux communities
pocock · 2025-02-24T19:55:08.172Z · comments (0)
[link] Against Unlimited Genius for Baby-Killers
ggggg · 2025-02-19T20:33:27.188Z · comments (1)
A critique of Soares "4 background claims"
YanLyutnev (YanLutnev) · 2025-01-27T20:27:51.026Z · comments (0)
Political Idolatry
Arturo Macias (arturo-macias) · 2025-02-10T15:26:30.686Z · comments (7)
All pigeons are ugly!
Eris (anton-zheltoukhov) · 2025-01-28T15:18:25.507Z · comments (2)
The Goodness of Morning
YanLyutnev (YanLutnev) · 2025-01-27T23:25:38.273Z · comments (1)
Deploying the Observer will save humanity from existential threats
Aram Panasenco (panasenco) · 2025-02-05T10:39:00.789Z · comments (8)
The Fundamental Circularity Theorem: Why Some Mathematical Behaviours Are Inherently Unprovable
Alister Munday (alister-munday) · 2025-01-22T18:20:25.697Z · comments (2)
← previous page (newer posts) · next page (older posts) →