LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

How are Those AI Participants Doing Anyway?
mushroomsoup · 2025-01-24T22:37:47.999Z · comments (0)
[link] Tetherware #1: The case for humanlike AI
Jáchym Fibír · 2025-01-30T10:58:11.717Z · comments (0)
[question] Why not train reasoning models with RLHF?
CBiddulph (caleb-biddulph) · 2025-01-30T07:58:35.742Z · answers+comments (3)
Will AI Resilience protect Developing Nations?
ejk64 · 2025-01-21T15:31:32.378Z · comments (0)
Death vs. Suffering: The Endurist-Serenist Divide on Life’s Worst Fate
Alex_Steiner · 2025-01-27T03:59:40.279Z · comments (7)
[question] What are the chances that Superhuman Agents are already being tested on the internet?
artemium · 2025-01-20T11:09:33.835Z · answers+comments (1)
[link] A concise definition of what it means to win
testingthewaters · 2025-01-25T06:37:37.305Z · comments (0)
Disproving the "People-Pleasing" Hypothesis for AI Self-Reports of Experience
rife (edgar-muniz) · 2025-01-26T15:53:10.530Z · comments (18)
[Linkpost] Why AI Safety Camp struggles with fundraising (FBB #2)
gergogaspar (gergo-gaspar) · 2025-01-21T17:27:51.965Z · comments (0)
Detailed Ideal World Benchmark
Knight Lee (Max Lee) · 2025-01-30T02:31:39.852Z · comments (0)
Scanless Whole Brain Emulation
Knight Lee (Max Lee) · 2025-01-27T10:00:08.036Z · comments (4)
Allegory of the Tsunami
Evan Hu (evan-hu) · 2025-01-29T19:09:33.761Z · comments (0)
[link] Constitutions for ASI?
ukc10014 · 2025-01-28T16:32:39.307Z · comments (0)
Using an LLM for creative writing feels wrong to me
Declan Molony (declan-molony) · 2025-01-28T06:42:24.799Z · comments (13)
Untrusted monitoring insights from watching ChatGPT play coordination games
jwfiredragon · 2025-01-29T04:53:33.125Z · comments (0)
The many failure modes of consumer-grade LLMs
dereshev · 2025-01-26T19:01:09.891Z · comments (0)
Is it ethical to work in AI "content evaluation"?
anon_databoy123 (noob1234) · 2025-01-27T19:58:26.176Z · comments (2)
King Lear - A Reinterpretation
Kailuo Wang (kailuo-wang) · 2025-01-21T23:54:21.583Z · comments (1)
Should Art Carry the Weight of Shaping our Values?
Krishna Maneesha Dendukuri (krishna_maneesha-d) · 2025-01-28T18:43:32.517Z · comments (0)
Updating and Editing Factual Knowledge in Language Models
Dhananjay Ashok (dhananjay-ashok) · 2025-01-23T19:34:37.121Z · comments (2)
Starting Thoughts on RLHF
Michael Flood (michael-flood) · 2025-01-23T22:16:49.793Z · comments (0)
[question] Who's track record of AI predictions would you like to see evaluated?
Jonny Spicer (jonnyspicer) · 2025-01-29T12:05:30.311Z · answers+comments (1)
The Hidden Status Game in Hospital Slacking
EpistemicExplorer · 2025-01-20T18:35:54.086Z · comments (4)
[link] Predation as Payment for Criticism
Benquo · 2025-01-30T01:06:27.591Z · comments (2)
Absorbing Your Friends' Powers
Alice Blair (Diatom) · 2025-01-30T02:32:27.091Z · comments (0)
Superintelligent AI will make mistakes
juggins · 2025-01-30T15:12:50.561Z · comments (1)
[link] Hello World
Charlie Sanders (charlie-sanders) · 2025-01-30T15:33:57.427Z · comments (0)
[question] Implication of Uncomputable Problems
Nathan1123 · 2025-01-30T16:48:38.222Z · answers+comments (0)
Locating and Editing Knowledge in LMs
Dhananjay Ashok (dhananjay-ashok) · 2025-01-24T22:53:40.559Z · comments (0)
The Road to Evil Is Paved with Good Objectives: Framework to Classify and Fix Misalignments.
Shivam · 2025-01-30T02:44:47.907Z · comments (0)
Democratizing AI Governance: Balancing Expertise and Public Participation
Lucile Ter-Minassian (lucile-ter-minassian) · 2025-01-21T18:29:06.160Z · comments (0)
Gettier cases, Rigid Designators, and Referential Opacity
Antigone (luke-st-clair) · 2025-01-28T18:46:10.180Z · comments (0)
Using the probabilistic method to bound the performance of toy transformers
Alex Gibson · 2025-01-21T23:01:38.067Z · comments (0)
[question] Enhanced Clarity to Bridge the AI Labeling Gap?
Pathways (jimmy-1) · 2025-01-26T06:48:36.396Z · answers+comments (0)
Navigating Diversity: Understanding Human Behaviors Through Genetics, Neurodivergence, and Trauma
j_passeri · 2025-01-26T08:23:16.352Z · comments (0)
[link] Ideas for CoT Models: A Geometric Perspective on Latent Space Reasoning
Rohan Ganapavarapu (rohan-ganapavarapu) · 2025-01-24T19:01:47.339Z · comments (0)
To know or not to know
arisAlexis (arisalexis) · 2025-01-27T13:17:33.672Z · comments (3)
Are we the Wolves now? Human Eugenics under AI Control
Brit (james-spencer) · 2025-01-30T08:31:34.423Z · comments (0)
AI and Non-Existence.
Eleven · 2025-01-25T19:36:22.624Z · comments (9)
All pigeons are ugly!
Eris (anton-zheltoukhov) · 2025-01-28T15:18:25.507Z · comments (2)
The ‘anti woke’ are positioned to win but can they capitalize?
Hzn · 2025-01-21T09:52:50.673Z · comments (0)
Rational Utopia
ank · 2025-01-29T14:16:09.862Z · comments (2)
A critique of Soares "4 background claims"
YanLyutnev (YanLutnev) · 2025-01-27T20:27:51.026Z · comments (0)
The Goodness of Morning
YanLyutnev (YanLutnev) · 2025-01-27T23:25:38.273Z · comments (1)
It is (probably) time for a Buterlian Jihad
waterlubber · 2025-01-20T05:55:17.156Z · comments (13)
Hitler was not a monster
halgir · 2025-01-21T18:21:55.777Z · comments (5)
The Fundamental Circularity Theorem: Why Some Mathematical Behaviours Are Inherently Unprovable
Alister Munday (alister-munday) · 2025-01-22T18:20:25.697Z · comments (2)
The real political spectrum
Hzn · 2025-01-22T08:55:39.328Z · comments (0)
← previous page (newer posts) · next page (older posts) →