LessWrong 2.0 Reader

View: New · Old · Top

← previous page (newer posts) · next page (older posts) →

Wobbly Table Theorem in Practice
Morpheus · 2023-09-28T14:33:16.898Z · comments (None)
Weighing Animal Worth
jefftk (jkaufman) · 2023-09-28T13:50:06.752Z · comments (6)
Coordination and well-scaling projects
Loppukilpailija (jarviniemi) · 2023-09-28T08:01:52.080Z · comments (None)
[link] ARC Evals: Responsible Scaling Policies
Zach Stein-Perlman · 2023-09-28T04:30:37.140Z · comments (9)
Petrov Day Retrospective, 2023 (re: the most important virtue of Petrov Day & unilaterally promoting it)
Ruby · 2023-09-28T02:48:58.994Z · comments (70)
[link] Jimmy Apples, source of the rumor that OpenAI has achieved AGI internally, is a credible insider.
Jorterder (utebaypi) · 2023-09-28T01:20:47.628Z · comments (2)
Investigating the rumors of OpenAI achieving AGI
Jorterder (utebaypi) · 2023-09-28T01:17:14.778Z · comments (1)
[link] Alibaba Group releases Qwen, 14B parameter LLM
nikola (nikolaisalreadytaken) · 2023-09-28T00:12:03.653Z · comments (1)
[link] Metaculus Launches 2023/2024 FluSight Challenge Supporting CDC, $5K in Prizes
ChristianWilliams · 2023-09-27T21:35:15.616Z · comments (None)
Projects I would like to see (possibly at AI Safety Camp)
Linda Linsefors · 2023-09-27T21:27:29.539Z · comments (4)
Towards Better Milestones for Monitoring AI Capabilities
snewman · 2023-09-27T21:18:30.966Z · comments (None)
[question] [link] Is Bjorn Lomborg roughly right about climate change policy?
yhoiseth · 2023-09-27T20:06:30.722Z · answers+comments (12)
Commonsense Good, Creative Good
jefftk (jkaufman) · 2023-09-27T19:50:07.486Z · comments (7)
Petrov Day [Spoiler Warning]
lsusr · 2023-09-27T19:20:04.657Z · comments (6)
[link] The Hidden Complexity of Wishes - The Animation
Writer · 2023-09-27T17:59:37.188Z · comments (None)
[link] MMLU’s Moral Scenarios Benchmark Doesn’t Measure What You Think it Measures
corey morris (corey-morris) · 2023-09-27T17:54:39.598Z · comments (1)
[question] What's your standard for good work performance?
Chi Nguyen · 2023-09-27T16:58:16.114Z · answers+comments (3)
The Role of Groups in the Progression of Human Understanding
Chris_Leong · 2023-09-27T15:09:45.445Z · comments (None)
[link] The Great Disembedding
rogersbacon · 2023-09-27T14:53:25.116Z · comments (4)
[question] how do short-timeliners reason about the differences between brain and AI?
JavierCC (javier-caeiro-canabal) · 2023-09-27T08:13:58.659Z · answers+comments (11)
[question] Is there a widely accepted metric for 'genuineness' in interpersonal communication?
M. Y. Zuo · 2023-09-27T05:30:46.716Z · answers+comments (3)
Bariatric surgery seems like a no-brainer for most morbidly obese people
lc · 2023-09-27T01:05:32.976Z · comments (11)
[link] Jacob on the Precipice
Richard_Ngo (ricraz) · 2023-09-26T21:16:39.590Z · comments (7)
Text Posts from the Kids Group: 2022
jefftk (jkaufman) · 2023-09-26T20:40:06.656Z · comments (2)
[link] GPT-4 for personal productivity: online distraction blocker
Sergii (sergey-kharagorgiev) · 2023-09-26T17:41:31.031Z · comments (11)
ARENA 2.0 - Impact Report
TheMcDouglas · 2023-09-26T17:13:19.952Z · comments (4)
Mechanistic Interpretability Reading group
1stuserhere (firstuser-here) · 2023-09-26T16:26:44.757Z · comments (None)
Announcing the CNN Interpretability Competition
scasper · 2023-09-26T16:21:50.276Z · comments (None)
Making AIs less likely to be spiteful
Nicolas Macé (NicolasMace) · 2023-09-26T14:12:06.202Z · comments (2)
[link] [Linkpost] Mark Zuckerberg confronted about Meta's Llama 2 AI's ability to give users detailed guidance on making anthrax - Business Insider
mic (michael-chen) · 2023-09-26T12:05:57.396Z · comments (7)
Enforcing Far-Future Contracts for Governments
FCCC · 2023-09-26T04:26:46.442Z · comments (18)
Carioca Petrov Day
Giskard (tiago-macedo) · 2023-09-26T00:30:36.906Z · comments (None)
[question] A few Alignment questions: utility optimizers, SLT, sharp left turn and identifiability
Igor Timofeev (igor-timofeev-1) · 2023-09-26T00:27:23.229Z · answers+comments (1)
Impact stories for model internals: an exercise for interpretability researchers
jenny · 2023-09-25T23:15:29.189Z · comments (3)
[link] Autonomic Sanity
Sable · 2023-09-25T22:37:07.262Z · comments (9)
[question] What is wrong with this "utility switch button problem" approach?
Donald Hobson (donald-hobson) · 2023-09-25T21:36:47.166Z · answers+comments (3)
You should just smile at strangers a lot
chaosmage · 2023-09-25T20:12:56.907Z · comments (10)
[link] The King and the Golem
Richard_Ngo (ricraz) · 2023-09-25T19:51:22.980Z · comments (11)
[link] Public Opinion on AI Safety: AIMS 2023 and 2021 Summary
Jacy Reese Anthis (Jacy Reese) · 2023-09-25T18:55:41.532Z · comments (2)
Welcome to Apply: The 2024 Vitalik Buterin Fellowships in AI Existential Safety by FLI!
Zhijing Jin · 2023-09-25T18:42:13.320Z · comments (2)
Evaluating hidden directions on the utility dataset: classification, steering and removal
Annah (annah) · 2023-09-25T17:19:13.988Z · comments (3)
Linkpost: A model of biases as arising from meta-beliefs
JuanGarcia · 2023-09-25T17:14:55.538Z · comments (None)
[question] What causes a decision theory to be used?
Dagon · 2023-09-25T16:33:36.161Z · answers+comments (2)
[link] Understanding strategic deception and deceptive alignment
Marius Hobbhahn (marius-hobbhahn) · 2023-09-25T16:27:47.357Z · comments (15)
[link] The Merits of Contrarianism & Why I hate Chatbots. [My Experience with the Ideological Turing Test @ a Less Wrong meetup]
Amina V. (aminah-vinson) · 2023-09-25T16:13:04.113Z · comments (1)
Inside Views, Impostor Syndrome, and the Great LARP
johnswentworth · 2023-09-25T16:08:17.040Z · comments (40)
“X distracts from Y” as a thinly-disguised fight over group status / politics
Steven Byrnes (steve2152) · 2023-09-25T15:18:18.644Z · comments (13)
[link] Amazon to invest up to $4 billion in Anthropic
Davis_Kingsley · 2023-09-25T14:55:35.983Z · comments (8)
Should Effective Altruists be Valuists instead of utilitarians?
spencerg · 2023-09-25T14:03:10.958Z · comments (3)
Feedly Breaks MathML
jefftk (jkaufman) · 2023-09-25T13:40:05.759Z · comments (3)
← previous page (newer posts) · next page (older posts) →