LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[question] What are some positive developments in AI safety in 2024?
Satron · 2024-11-15T10:32:39.541Z · answers+comments (5)

Mini PAPR Review
jefftk (jkaufman) · 2024-12-12T19:10:01.692Z · comments (0)

[link] When do experts think human-level AI will be created?
Vishakha (vishakha-agrawal) · 2024-12-30T06:20:33.158Z · comments (0)

Orange and Strawberry Truffles
jefftk (jkaufman) · 2025-01-05T01:50:01.587Z · comments (1)

[link] Predictions of Near-Term Societal Changes Due to Artificial Intelligence
Annapurna (jorge-velez) · 2024-12-29T14:53:57.176Z · comments (0)

A Ground-Level Perspective on Capacity Building in International Development
Sean Aubin (sean-aubin) · 2025-01-05T20:36:54.308Z · comments (1)

[question] Is "hidden complexity of wishes problem" solved?
Roman Malov · 2025-01-05T22:59:30.911Z · answers+comments (4)

Apply to be a TA for TARA
yanni kyriacos (yanni) · 2024-12-20T02:25:03.514Z · comments (0)

[question] Using hex to get murder advice from GPT-4o
Laurence Freeman (laurence-freeman) · 2024-11-13T18:30:23.475Z · answers+comments (5)

[link] Bridgewater x Metaculus Forecasting Contest Goes Global — Feb 3, $25k, Opportunities
ChristianWilliams · 2025-01-07T21:40:30.899Z · comments (0)

Exporting Facebook Comments, Again
jefftk (jkaufman) · 2024-11-30T12:40:07.339Z · comments (6)

Festival Stats 2024
jefftk (jkaufman) · 2024-11-12T02:00:04.831Z · comments (0)

[link] Exploring Cooperation: The Path to Utopia
Davidmanheim · 2024-12-25T18:31:55.565Z · comments (0)

Misfortune and Many Worlds
Jonah Wilberg (jrwilb@googlemail.com) · 2024-12-08T20:25:12.109Z · comments (4)

[link] Bird's eye view: An interactive representation to see large collection of text "from above".
Alexandre Variengien (alexandre-variengien) · 2024-12-21T00:15:02.239Z · comments (4)

[link] Updating on Bad Arguments
Guive (GAA) · 2024-12-21T01:19:15.686Z · comments (2)

[link] What is "wireheading"?
Vishakha (vishakha-agrawal) · 2024-12-17T07:49:50.957Z · comments (0)

[link] Chemical Turing Machines
Yudhister Kumar (randomwalks) · 2024-12-03T05:26:25.950Z · comments (2)

Americans are fat and sick—and it’s their fault…right?
Declan Molony (declan-molony) · 2024-11-19T06:41:36.648Z · comments (6)

[link] Roots of Progress is hiring an event manager
jasoncrawford · 2024-12-03T20:46:42.929Z · comments (0)

[link] A Public Choice Take on Effective Altruism
vaishnav92 · 2024-12-15T16:58:50.683Z · comments (4)

CCing Mailing Lists on External Communication
jefftk (jkaufman) · 2024-12-04T22:00:02.038Z · comments (0)

Arthropod (non) sentience
Arturo Macias (arturo-macias) · 2024-11-25T16:01:58.514Z · comments (8)

Force Sequential Output with SCP?
jefftk (jkaufman) · 2024-11-09T12:40:06.098Z · comments (4)

[link] Densing Law of LLMs
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-12-08T19:35:09.244Z · comments (2)

[link] Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-11-26T09:58:44.025Z · comments (0)

[link] Frontier AI systems have surpassed the self-replicating red line
aproteinengine · 2024-12-11T03:06:14.927Z · comments (4)

Contra Musician Gender II
jefftk (jkaufman) · 2024-11-13T03:30:09.510Z · comments (0)

[question] What's the best metric for measuring quality of life?
ChristianKl · 2024-12-27T14:29:30.813Z · answers+comments (4)

Value/Utility: A History
Lorec · 2024-11-19T23:01:39.167Z · comments (0)

[link] Markets Are Information - Beating the Sportsbooks at Their Own Game
JJXW · 2024-11-07T20:58:43.389Z · comments (1)

[link] Ideologies are slow and necessary, for now
Gabriel Alfour (gabriel-alfour-1) · 2024-12-23T01:57:47.153Z · comments (1)

Smart people should do biology
Haotian (haotian-huang) · 2024-12-05T19:11:20.671Z · comments (2)

Notes from Copenhagen Secular Solstice 2024
Søren Elverlin (soren-elverlin-1) · 2024-12-22T15:08:20.848Z · comments (0)

[link] AISN #44: The Trump Circle on AI Safety Plus, Chinese researchers used Llama to create a military tool for the PLA, a Google AI system discovered a zero-day cybersecurity vulnerability, and Complex Systems
Corin Katzke (corin-katzke) · 2024-11-19T16:36:40.501Z · comments (0)

Do Deep Neural Networks Have Brain-like Representations?: A Summary of Disagreements
Joseph Emerson (joseph-emerson) · 2024-11-18T00:07:15.155Z · comments (0)

Near- and medium-term AI Control Safety Cases
Martín Soto (martinsq) · 2024-12-23T17:37:48.860Z · comments (0)

I Have A New Paper Out Arguing Against The Asymmetry And For The Existence of Happy People Being Very Good
omnizoid · 2024-11-21T17:21:41.426Z · comments (3)

Refuting Searle’s wall, Putnam’s rock, and Johnson’s popcorn
Davidmanheim · 2024-12-09T08:24:26.594Z · comments (29)

Executive Director for AIS France - Expression of interest
gergogaspar (gergo-gaspar) · 2024-12-19T08:14:54.023Z · comments (0)

How to make evals for the AISI evals bounty
TheManxLoiner · 2024-12-03T10:44:45.700Z · comments (0)

0 Motivation Mapping through Information Theory
P. João (gabriel-brito) · 2024-12-16T23:17:17.254Z · comments (0)

[link] Anthropic teams up with Palantir and AWS to sell AI to defense customers
Matrice Jacobine · 2024-11-09T11:50:34.050Z · comments (0)

Is this a better way to do matchmaking?
Chipmonk · 2024-12-16T19:06:14.574Z · comments (4)

AXRP Episode 38.4 - Shakeel Hashim on AI Journalism
DanielFilan · 2025-01-05T00:20:05.096Z · comments (0)

[link] Corrigibility should be an AI's Only Goal
PeterMcCluskey · 2024-12-29T20:25:17.922Z · comments (1)

Not all biases are equal - a study of sycophancy and bias in fine-tuned LLMs
jakub_krys (kryjak) · 2024-11-11T23:11:15.233Z · comments (0)

[link] An Uncanny Moat
Adam Newgas (BorisTheBrave) · 2024-11-15T11:39:15.165Z · comments (0)

[link] Riffing on Machines of Loving Grace
an1lam · 2025-01-01T01:06:45.122Z · comments (0)

Proactive 'If-Then' Safety Cases
Nathan Helm-Burger (nathan-helm-burger) · 2024-11-18T21:16:37.237Z · comments (0)

← previous page (newer posts) · next page (older posts) →

^{^}

Whether they're more capable by dint of being bigger (GPT-4), or being trained on better data (Sonnet 3.5.1), or having a better training loop + architecture (DeepSeek V3), etc.

^{^}

My earlier comment about this mistakenly used $β_{1}$ and $β_{2}$ in place of $β_{t}$ and $β_{s}$ , which may have been confusing. I'll go fix that to be consistent with your notation.

LessWrong 2.0 Reader

Archive

Recent comments