LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Algebraic Linguistics
abstractapplic · 2024-12-07T19:18:39.935Z · comments (27)
A Sober Look at Steering Vectors for LLMs
Joschka Braun (joschka-braun) · 2024-11-23T17:30:00.745Z · comments (0)
AXRP Episode 38.2 - Jesse Hoogland on Singular Learning Theory
DanielFilan · 2024-11-27T06:30:03.821Z · comments (0)
A Principled Cartoon Guide to NVC
plex (ete) · 2025-01-07T21:01:07.904Z · comments (5)
Monthly Roundup #26: January 2025
Zvi · 2025-01-20T15:30:08.680Z · comments (15)
Voluntary Salary Reduction
jefftk (jkaufman) · 2025-01-15T03:40:02.909Z · comments (2)
“Charity” as a conflationary alliance term
Jan_Kulveit · 2024-12-12T21:49:50.057Z · comments (2)
Alternative Cancer Care As Biohacking & Book Review: Surviving "Terminal" Cancer
DenizT · 2025-01-06T07:43:52.773Z · comments (6)
What's Behind the SynBio Bust?
sarahconstantin · 2025-01-30T22:30:06.916Z · comments (2)
XX by Rian Hughes: Pretentious Bullshit
Yair Halberstadt (yair-halberstadt) · 2025-01-08T13:02:52.438Z · comments (5)
Dress Up For Secular Solstice
Gordon H.S. (gordon-schaefer) · 2024-12-15T16:28:24.607Z · comments (13)
D&D.Sci Dungeonbuilding: the Dungeon Tournament Evaluation & Ruleset
aphyer · 2025-01-07T05:02:25.929Z · comments (8)
Learning Multi-Level Features with Matryoshka SAEs
Bart Bussmann (Stuckwork) · 2024-12-19T15:59:00.036Z · comments (4)
Compute and size limits on AI are the actual danger
Shmi (shminux) · 2024-11-23T21:29:37.433Z · comments (5)
If all trade is voluntary, then what is "exploitation?"
Darmani · 2024-12-27T11:21:30.036Z · comments (61)
[link] Moderately Skeptical of "Risks of Mirror Biology"
Davidmanheim · 2024-12-20T12:57:31.824Z · comments (3)
[link] Announcing the Q1 2025 Long-Term Future Fund grant round
Linch · 2024-12-20T02:20:22.448Z · comments (0)
[question] What is MIRI currently doing?
Roko · 2024-12-14T02:39:20.886Z · answers+comments (14)
Theory of Change for AI Safety Camp
Linda Linsefors · 2025-01-22T22:07:10.664Z · comments (3)
[Letter] Chinese Quickstart
lsusr · 2024-12-01T06:38:15.796Z · comments (0)
[link] You should delay engineering-heavy research in light of R&D automation
Daniel Paleka · 2025-01-07T02:11:11.501Z · comments (3)
The Monster in Our Heads
testingthewaters · 2025-01-19T23:58:11.251Z · comments (4)
[link] Anthropic CEO calls for RSI
Andrea_Miotti (AndreaM) · 2025-01-29T16:54:24.943Z · comments (10)
Eliciting bad contexts
Geoffrey Irving · 2025-01-24T10:39:39.358Z · comments (2)
[link] A progress policy agenda
jasoncrawford · 2024-12-19T18:42:37.327Z · comments (1)
People aren't properly calibrated on FrontierMath
cakubilo · 2024-12-23T19:35:44.467Z · comments (4)
Operator
Zvi · 2025-01-28T20:00:08.374Z · comments (1)
Two Weeks Without Sweets
jefftk (jkaufman) · 2024-12-31T03:30:02.003Z · comments (0)
1. Meet the Players: Value Diversity
Allison Duettmann (allison-duettmann) · 2025-01-02T19:00:52.696Z · comments (2)
[Cross-post] Every Bay Area "Walled Compound"
davekasten · 2025-01-23T15:05:08.629Z · comments (3)
Extending control evaluations to non-scheming threats
joshc (joshua-clymer) · 2025-01-12T01:42:54.614Z · comments (1)
[link] What I expected from this site: A LessWrong review
Nathan Young · 2024-12-20T11:27:39.683Z · comments (5)
Quantum without complication
Optimization Process · 2025-01-16T08:53:11.347Z · comments (2)
Per Tribalismum ad Astra
Martin Sustrik (sustrik) · 2025-01-19T06:50:07.763Z · comments (5)
Call for evaluators: Participate in the European AI Office workshop on general-purpose AI models and systemic risks
Tom DAVID (tom-david) · 2024-11-27T02:54:16.263Z · comments (0)
AI Safety Seed Funding Network - Join as a Donor or Investor
Alexandra Bos (AlexandraB) · 2024-12-16T19:30:43.812Z · comments (0)
Why Aligning an LLM is Hard, and How to Make it Easier
RogerDearnaley (roger-d-1) · 2025-01-23T06:44:04.048Z · comments (3)
Mini Go: Gateway Game
jefftk (jkaufman) · 2025-01-14T03:30:02.020Z · comments (1)
You can validly be seen and validated by a chatbot
Kaj_Sotala · 2024-12-20T12:00:03.015Z · comments (3)
Gratitudes: Rational Thanks Giving
Seth Herd · 2024-11-29T03:09:47.410Z · comments (2)
Acknowledging Background Information with P(Q|I)
JenniferRM · 2024-12-24T18:50:25.323Z · comments (8)
Aligning AI Safety Projects with a Republican Administration
Deric Cheng (deric-cheng) · 2024-11-21T22:12:27.502Z · comments (1)
Renormalization Redux: QFT Techniques for AI Interpretability
Lauren Greenspan (LaurenGreenspan) · 2025-01-18T03:54:28.652Z · comments (12)
NYC Congestion Pricing: Early Days
Zvi · 2025-01-14T14:00:07.445Z · comments (0)
[link] Our new video about goal misgeneralization, plus an apology
Writer · 2025-01-14T14:07:21.648Z · comments (0)
AI #101: The Shallow End
Zvi · 2025-01-30T14:50:08.269Z · comments (1)
[question] Why are there no interesting (1D, 2-state) quantum cellular automata?
Optimization Process · 2024-11-26T00:11:37.833Z · answers+comments (13)
Agents don't have to be aligned to help us achieve an indefinite pause.
Hastings (hastings-greer) · 2025-01-25T18:51:03.523Z · comments (0)
Disagreement on AGI Suggests It’s Near
tangerine · 2025-01-07T20:42:43.456Z · comments (15)
Compositionality and Ambiguity:  Latent Co-occurrence and Interpretable Subspaces
Matthew A. Clarke (Antigone) · 2024-12-20T15:16:51.857Z · comments (0)
← previous page (newer posts) · next page (older posts) →