LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[question] hypnosis question
KvmanThinking (avery-liu) · 2025-02-06T02:41:53.314Z · answers+comments (5)
Scanless Whole Brain Emulation
Knight Lee (Max Lee) · 2025-01-27T10:00:08.036Z · comments (4)
Use computers as powerful as in 1985 or AI controls humans or ?
jrincayc (nerd_gatherer) · 2025-02-03T00:51:05.706Z · comments (0)
[question] Whose track record of AI predictions would you like to see evaluated?
Jonny Spicer (jonnyspicer) · 2025-01-29T12:05:30.311Z · answers+comments (3)
[link] Credit Suisse collapse obfuscated Parreaux, Thiébaud & Partners scandal
pocock · 2025-02-24T21:28:39.617Z · comments (0)
Safe Search is off: root causes of AI catastrophic risks
Jemal Young (ghostwheel) · 2025-01-31T18:22:43.947Z · comments (0)
Is it ethical to work in AI "content evaluation"?
anon_databoy123 (noob1234) · 2025-01-27T19:58:26.176Z · comments (2)
[link] New LLM Scaling Law
wrmedford · 2025-02-19T20:21:17.475Z · comments (0)
[question] How do biological or spiking neural networks learn?
Dom Polsinelli (dom-polsinelli) · 2025-01-31T16:03:38.425Z · answers+comments (1)
[question] Strong, Stable, Open: Choose Two - in search of an article
Eli_ · 2025-01-31T14:48:21.438Z · answers+comments (0)
[link] Modularity and assembly: AI safety via thinking smaller
D Wong (d-nell) · 2025-02-20T00:58:39.714Z · comments (0)
arch-anarchist reading list
Peter lawless · 2025-02-16T22:47:00.273Z · comments (1)
Arguing for the Truth? An Inference-Only Study into AI Debate
denisemester · 2025-02-11T03:04:58.852Z · comments (0)
Can someone, anyone, make superintelligence a more concrete concept?
Ori Nagel (ori-nagel) · 2025-02-04T02:18:51.718Z · comments (8)
AI acceleration, DeepSeek, moral philosophy
Josh H (joshua-haas) · 2025-02-02T00:08:11.593Z · comments (0)
[link] Probability of AI-Caused Disaster
Alvin Ånestrand (alvin-anestrand) · 2025-02-12T19:40:11.121Z · comments (2)
[link] The future of humanity is in management
jasoncrawford · 2025-01-30T22:14:46.765Z · comments (5)
Visualizing Interpretability
Darold Davis (darold) · 2025-02-03T19:36:38.938Z · comments (0)
Workshop: Interpretability in LLMs Using Geometric and Statistical Methods
Karthik Viswanathan (vkarthik095) · 2025-02-22T09:39:26.446Z · comments (0)
Making alignment a law of the universe
juggins · 2025-02-25T10:44:11.632Z · comments (0)
[link] Forecasting Uncontrolled Spread of AI
Alvin Ånestrand (alvin-anestrand) · 2025-02-22T13:05:57.171Z · comments (0)
Artificial Static Place Intelligence: Guaranteed Alignment
ank · 2025-02-15T11:08:50.226Z · comments (2)
ChatGPT: Exploring the Digital Wilderness, Findings and Prospects
Bill Benzon (bill-benzon) · 2025-02-02T09:54:26.008Z · comments (0)
Updating and Editing Factual Knowledge in Language Models
Dhananjay Ashok (dhananjay-ashok) · 2025-01-23T19:34:37.121Z · comments (2)
Intrinsic Dimension of Prompts in LLMs
Karthik Viswanathan (vkarthik095) · 2025-02-14T19:02:49.464Z · comments (0)
if you're not happy single, you won't be happy immortal
daijin · 2025-02-24T13:23:52.204Z · comments (1)
The many failure modes of consumer-grade LLMs
dereshev · 2025-01-26T19:01:09.891Z · comments (0)
Starting Thoughts on RLHF
Michael Flood (michael-flood) · 2025-01-23T22:16:49.793Z · comments (0)
The Outer Levels
Jerdle (daniel-amdurer) · 2025-02-03T14:30:29.230Z · comments (3)
Should Art Carry the Weight of Shaping our Values?
Krishna Maneesha Dendukuri (krishna_maneesha-d) · 2025-01-28T18:43:32.517Z · comments (0)
LW/ACX social meetup
Stefan (stefan-1) · 2025-02-10T21:12:39.092Z · comments (0)
[link] Language Models and World Models, a Philosophy
kyjohnso · 2025-02-03T02:55:36.577Z · comments (0)
Locating and Editing Knowledge in LMs
Dhananjay Ashok (dhananjay-ashok) · 2025-01-24T22:53:40.559Z · comments (0)
Part 1: Enhancing Inner Alignment in CLIP Vision Transformers: Mitigating Reification Bias with SAEs and Grad ECLIP
Gilber A. Corrales (mysticdeepai) · 2025-02-03T19:30:52.505Z · comments (0)
[question] Programming Language Early Funding?
J Thomas Moros (J_Thomas_Moros) · 2025-02-16T17:34:06.058Z · answers+comments (5)
Interpreting autonomous driving agents with attention based architecture
Manav Dahra (manav-dahra) · 2025-02-01T23:20:27.162Z · comments (0)
Exploring the coherence of features explanations in the GemmaScope
Mattia Proietti (mattia-proietti) · 2025-02-01T21:28:33.690Z · comments (0)
Biological humans collectively exert at most 400 gigabits/s of control over the world.
benwr · 2025-02-20T23:44:06.509Z · comments (1)
The Domain of Orthogonality
mgfcatherall · 2025-02-05T08:14:32.793Z · comments (0)
[question] Why do we have the NATO logo?
KvmanThinking (avery-liu) · 2025-02-19T22:59:41.755Z · answers+comments (4)
[question] Why isn't AI containment the primary AI safety strategy?
OKlogic · 2025-02-05T03:54:58.171Z · answers+comments (3)
[link] Ideas for CoT Models: A Geometric Perspective on Latent Space Reasoning
Rohan Ganapavarapu (rohan-ganapavarapu) · 2025-01-24T19:01:47.339Z · comments (0)
Nationwide Action Workshop: Contact Congress about AI safety!
Felix De Simone (BobusChilc) · 2025-02-24T19:36:09.084Z · comments (0)
Quantifying the Qualitative: Towards a Bayesian Approach to Personal Insight
Pruthvi Kumar (pruthvi-kumar) · 2025-02-15T19:50:42.550Z · comments (0)
Upcoming Neuroscience Workshop - Functionalizing Brain Data, Ground-Truthing, and the Role of Artificial Data in Advancing Neuroscience
Devin Ward (Carboncopies Foundation) · 2025-01-30T23:02:00.681Z · comments (0)
Demystifying the Pinocchio Paradox
Novak Zukowski (Zantarus) · 2025-02-25T06:16:57.219Z · comments (0)
Opinion Article Scoring System
ciaran · 2025-02-10T14:32:19.030Z · comments (0)
Dayton, Ohio, HPMOR 10 year Anniversary meetup
Lunawarrior · 2025-02-24T12:55:59.484Z · comments (0)
[link] The Capitalist Agent
henophilia · 2025-02-04T15:32:39.694Z · comments (10)
[link] Request for proposals: improving capability evaluations
cb · 2025-02-07T18:51:34.926Z · comments (0)
← previous page (newer posts) · next page (older posts) →