LessWrong 2.0 Reader

View: New · Old · Top

next page (older posts) →

Inositol Non-Results
Elizabeth (pktechgirl) · 2023-11-29T21:40:03.242Z · comments (2)
Losing Metaphors: Zip and Paste
jefftk (jkaufman) · 2023-11-29T20:31:07.464Z · comments (6)
Preserving our heritage: Building a movement and a knowledge ark for current and future generations
rnk8 · 2023-11-29T19:20:55.974Z · comments (5)
AGI Alignment is Absurd
Youssef Mohamed (youssef-mohamed) · 2023-11-29T19:11:50.894Z · comments (4)
[link] The origins of the steam engine: An essay with interactive animated diagrams
jasoncrawford · 2023-11-29T18:30:36.315Z · comments (1)
ChatGPT 4 solved all the gotcha problems I posed that tripped ChatGPT 3.5
VipulNaik · 2023-11-29T18:11:53.252Z · comments (16)
“Clean” vs. “messy” goal-directedness (Section 2.2.3 of “Scheming AIs”)
Joe Carlsmith (joekc) · 2023-11-29T16:32:30.068Z · comments (1)
Lying Alignment Chart
Zack_M_Davis · 2023-11-29T16:15:28.102Z · comments (17)
Rethink Priorities: Seeking Expressions of Interest for Special Projects Next Year
kierangreig · 2023-11-29T13:59:46.727Z · comments (0)
[question] Thoughts on teletransportation with copies?
titotal (lombertini) · 2023-11-29T12:56:51.193Z · answers+comments (13)
Interpretability with Sparse Autoencoders (Colab exercises)
CallumMcDougall (TheMcDouglas) · 2023-11-29T12:56:21.608Z · comments (9)
The 101 Space You Will Always Have With You
Screwtape · 2023-11-29T04:56:40.240Z · comments (20)
Trust your intuition - Kahneman's book misses the forest for the trees
mnvr · 2023-11-29T04:37:19.660Z · comments (2)
Process Substitution Without Shell?
jefftk (jkaufman) · 2023-11-29T03:20:06.922Z · comments (18)
Deception Chess: Game #2
Zane · 2023-11-29T02:43:22.375Z · comments (17)
Black Box Biology
GeneSmith · 2023-11-29T02:27:29.794Z · comments (30)
[question] What would be the shelf life of nuclear weapon-secrecy if nuclear weapons had not immediately been used in combat?
Gram Stone · 2023-11-29T00:53:42.598Z · answers+comments (2)
[link] Scaling laws for dominant assurance contracts
jessicata (jessica.liu.taylor) · 2023-11-28T23:11:07.631Z · comments (5)
I’m confused about innate smell neuroanatomy
Steven Byrnes (steve2152) · 2023-11-28T20:49:13.042Z · comments (2)
How to Control an LLM's Behavior (why my P(DOOM) went down)
RogerDearnaley (roger-d-1) · 2023-11-28T19:56:49.679Z · comments (30)
[question] Is there a word for discrimination against A.I.?
Aaron Bohannon (aaron-bohannon-1) · 2023-11-28T19:03:42.161Z · answers+comments (4)
Update #2 to "Dominant Assurance Contract Platform": EnsureDone
moyamo · 2023-11-28T18:02:50.367Z · comments (2)
[link] Ethicophysics II: Politics is the Mind-Savior
MadHatter · 2023-11-28T16:27:19.233Z · comments (9)
[link] Neither EA nor e/acc is what we need to build the future
jasoncrawford · 2023-11-28T16:04:16.803Z · comments (22)
[link] Agentic Growth
Logan Kieller (logan-kieller) · 2023-11-28T15:45:54.055Z · comments (0)
[link] AISC project: How promising is automating alignment research? (literature review)
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2023-11-28T14:47:29.372Z · comments (1)
A day in the life of a mechanistic interpretability researcher
Bill Benzon (bill-benzon) · 2023-11-28T14:45:17.967Z · comments (3)
Two sources of beyond-episode goals (Section 2.2.2 of “Scheming AIs”)
Joe Carlsmith (joekc) · 2023-11-28T13:49:49.175Z · comments (1)
Self-Referential Probabilistic Logic Admits the Payor's Lemma
Yudhister Kumar (randomwalks) · 2023-11-28T10:27:29.029Z · comments (14)
[question] How can I use AI without increasing AI-risk?
Yoav Ravid · 2023-11-28T10:05:44.321Z · answers+comments (6)
A Reading From The Book Of Sequences
Screwtape · 2023-11-28T06:45:57.806Z · comments (0)
Anthropic Fall 2023 Debate Progress Update
Ansh Radhakrishnan (anshuman-radhakrishnan-1) · 2023-11-28T05:37:30.070Z · comments (9)
Apocalypse insurance, and the hardline libertarian take on AI risk
So8res · 2023-11-28T02:09:52.400Z · comments (38)
[link] My techno-optimism [By Vitalik Buterin]
habryka (habryka4) · 2023-11-27T23:53:35.859Z · comments (17)
[question] Could Germany have won World War I with high probability given the benefit of hindsight?
Roko · 2023-11-27T22:52:42.066Z · answers+comments (18)
[question] Could World War I have been prevented given the benefit of hindsight?
Roko · 2023-11-27T22:39:15.866Z · answers+comments (8)
AISC 2024 - Project Summaries
NickyP (Nicky) · 2023-11-27T22:32:23.555Z · comments (3)
"Epistemic range of motion" and LessWrong moderation
habryka (habryka4) · 2023-11-27T21:58:40.834Z · comments (3)
Apply to the Conceptual Boundaries Workshop for AI Safety
Chipmonk · 2023-11-27T21:04:59.037Z · comments (0)
[link] There is no IQ for AI
Gabriel Alfour (gabriel-alfour-1) · 2023-11-27T18:21:26.196Z · comments (10)
Two concepts of an “episode” (Section 2.2.1 of “Scheming AIs”)
Joe Carlsmith (joekc) · 2023-11-27T18:01:29.153Z · comments (1)
[link] [Linkpost] George Mack's Razors
trevor (TrevorWiesinger) · 2023-11-27T17:53:45.065Z · comments (8)
On possible cross-fertilization between AI and neuroscience [Creativity]
Bill Benzon (bill-benzon) · 2023-11-27T16:50:26.531Z · comments (22)
[link] Ethicophysics I
MadHatter · 2023-11-27T15:44:29.236Z · comments (16)
[link] Sentience Institute 2023 End of Year Summary
michael_dello · 2023-11-27T12:11:37.228Z · comments (0)
[question] A Question about Corrigibility (2015)
A.H. (AlfredHarwood) · 2023-11-27T12:05:51.659Z · answers+comments (2)
Appendices to the live agendas
technicalities · 2023-11-27T11:10:32.187Z · comments (4)
Shallow review of live agendas in alignment & safety
technicalities · 2023-11-27T11:10:27.464Z · comments (69)
[link] Napoleon stole the Roman Inquisition archives and investigated the Galileo case
Meow P (meow-p) · 2023-11-27T09:41:31.737Z · comments (0)
[link] Found Paper: "FDT in an evolutionary environment"
the gears to ascension (lahwran) · 2023-11-27T05:27:50.709Z · comments (47)
next page (older posts) →