LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[question] What are some positive developments in AI safety in 2024?
Satron · 2024-11-15T10:32:39.541Z · answers+comments (5)
Mini PAPR Review
jefftk (jkaufman) · 2024-12-12T19:10:01.692Z · comments (0)
[link] When do experts think human-level AI will be created?
Vishakha (vishakha-agrawal) · 2024-12-30T06:20:33.158Z · comments (0)
Orange and Strawberry Truffles
jefftk (jkaufman) · 2025-01-05T01:50:01.587Z · comments (1)
[link] Predictions of Near-Term Societal Changes Due to Artificial Intelligence
Annapurna (jorge-velez) · 2024-12-29T14:53:57.176Z · comments (0)
A Ground-Level Perspective on Capacity Building in International Development
Sean Aubin (sean-aubin) · 2025-01-05T20:36:54.308Z · comments (1)
[question] Is "hidden complexity of wishes problem" solved?
Roman Malov · 2025-01-05T22:59:30.911Z · answers+comments (4)
Apply to be a TA for TARA
yanni kyriacos (yanni) · 2024-12-20T02:25:03.514Z · comments (0)
[question] Using hex to get murder advice from GPT-4o
Laurence Freeman (laurence-freeman) · 2024-11-13T18:30:23.475Z · answers+comments (5)
[link] Bridgewater x Metaculus Forecasting Contest Goes Global — Feb 3, $25k, Opportunities
ChristianWilliams · 2025-01-07T21:40:30.899Z · comments (0)
Exporting Facebook Comments, Again
jefftk (jkaufman) · 2024-11-30T12:40:07.339Z · comments (6)
Festival Stats 2024
jefftk (jkaufman) · 2024-11-12T02:00:04.831Z · comments (0)
[link] Exploring Cooperation: The Path to Utopia
Davidmanheim · 2024-12-25T18:31:55.565Z · comments (0)
Misfortune and Many Worlds
Jonah Wilberg (jrwilb@googlemail.com) · 2024-12-08T20:25:12.109Z · comments (4)
[link] Bird's eye view: An interactive representation to see large collection of text "from above".
Alexandre Variengien (alexandre-variengien) · 2024-12-21T00:15:02.239Z · comments (4)
[link] Updating on Bad Arguments
Guive (GAA) · 2024-12-21T01:19:15.686Z · comments (2)
[link] What is "wireheading"?
Vishakha (vishakha-agrawal) · 2024-12-17T07:49:50.957Z · comments (0)
[link] Chemical Turing Machines
Yudhister Kumar (randomwalks) · 2024-12-03T05:26:25.950Z · comments (2)
Americans are fat and sick—and it’s their fault…right?
Declan Molony (declan-molony) · 2024-11-19T06:41:36.648Z · comments (6)
[link] Roots of Progress is hiring an event manager
jasoncrawford · 2024-12-03T20:46:42.929Z · comments (0)
[link] A Public Choice Take on Effective Altruism
vaishnav92 · 2024-12-15T16:58:50.683Z · comments (4)
CCing Mailing Lists on External Communication
jefftk (jkaufman) · 2024-12-04T22:00:02.038Z · comments (0)
Arthropod (non) sentience
Arturo Macias (arturo-macias) · 2024-11-25T16:01:58.514Z · comments (8)
Force Sequential Output with SCP?
jefftk (jkaufman) · 2024-11-09T12:40:06.098Z · comments (4)
[link] Densing Law of LLMs
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-12-08T19:35:09.244Z · comments (2)
[link] Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-11-26T09:58:44.025Z · comments (0)
[link] Frontier AI systems have surpassed the self-replicating red line
aproteinengine · 2024-12-11T03:06:14.927Z · comments (4)
Contra Musician Gender II
jefftk (jkaufman) · 2024-11-13T03:30:09.510Z · comments (0)
[question] What's the best metric for measuring quality of life?
ChristianKl · 2024-12-27T14:29:30.813Z · answers+comments (4)
Value/Utility: A History
Lorec · 2024-11-19T23:01:39.167Z · comments (0)
[link] Markets Are Information - Beating the Sportsbooks at Their Own Game
JJXW · 2024-11-07T20:58:43.389Z · comments (1)
[link] Ideologies are slow and necessary, for now
Gabriel Alfour (gabriel-alfour-1) · 2024-12-23T01:57:47.153Z · comments (1)
Smart people should do biology
Haotian (haotian-huang) · 2024-12-05T19:11:20.671Z · comments (2)
Notes from Copenhagen Secular Solstice 2024
Søren Elverlin (soren-elverlin-1) · 2024-12-22T15:08:20.848Z · comments (0)
[link] AISN #44: The Trump Circle on AI Safety Plus, Chinese researchers used Llama to create a military tool for the PLA, a Google AI system discovered a zero-day cybersecurity vulnerability, and Complex Systems
Corin Katzke (corin-katzke) · 2024-11-19T16:36:40.501Z · comments (0)
Do Deep Neural Networks Have Brain-like Representations?: A Summary of Disagreements
Joseph Emerson (joseph-emerson) · 2024-11-18T00:07:15.155Z · comments (0)
Near- and medium-term AI Control Safety Cases
Martín Soto (martinsq) · 2024-12-23T17:37:48.860Z · comments (0)
I Have A New Paper Out Arguing Against The Asymmetry And For The Existence of Happy People Being Very Good
omnizoid · 2024-11-21T17:21:41.426Z · comments (3)
Refuting Searle’s wall, Putnam’s rock, and Johnson’s popcorn
Davidmanheim · 2024-12-09T08:24:26.594Z · comments (29)
Executive Director for AIS France - Expression of interest
gergogaspar (gergo-gaspar) · 2024-12-19T08:14:54.023Z · comments (0)
How to make evals for the AISI evals bounty
TheManxLoiner · 2024-12-03T10:44:45.700Z · comments (0)
0 Motivation Mapping through Information Theory
P. João (gabriel-brito) · 2024-12-16T23:17:17.254Z · comments (0)
[link] Anthropic teams up with Palantir and AWS to sell AI to defense customers
Matrice Jacobine · 2024-11-09T11:50:34.050Z · comments (0)
Is this a better way to do matchmaking?
Chipmonk · 2024-12-16T19:06:14.574Z · comments (4)
AXRP Episode 38.4 - Shakeel Hashim on AI Journalism
DanielFilan · 2025-01-05T00:20:05.096Z · comments (0)
[link] Corrigibility should be an AI's Only Goal
PeterMcCluskey · 2024-12-29T20:25:17.922Z · comments (1)
Not all biases are equal - a study of sycophancy and bias in fine-tuned LLMs
jakub_krys (kryjak) · 2024-11-11T23:11:15.233Z · comments (0)
[link] An Uncanny Moat
Adam Newgas (BorisTheBrave) · 2024-11-15T11:39:15.165Z · comments (0)
[link] Riffing on Machines of Loving Grace
an1lam · 2025-01-01T01:06:45.122Z · comments (0)
Proactive 'If-Then' Safety Cases
Nathan Helm-Burger (nathan-helm-burger) · 2024-11-18T21:16:37.237Z · comments (0)
← previous page (newer posts) · next page (older posts) →