LessWrong 2.0 Reader

View: New · Old · Top

← previous page (newer posts) · next page (older posts) →

[link] RAND report finds no effect of current LLMs on viability of bioterrorism attacks
StellaAthena · 2024-01-25T19:17:30.493Z · comments (14)
[question] Bayesian Reflection Principles and Ignorance of the Future
crickets · 2024-01-25T19:00:16.463Z · answers+comments (3)
"Does your paradigm beget new, good, paradigms?"
Raemon · 2024-01-25T18:23:15.497Z · comments (5)
AI #48: The Talk of Davos
Zvi · 2024-01-25T16:20:26.625Z · comments (9)
Importing a Python File by Name
jefftk (jkaufman) · 2024-01-25T16:00:13.210Z · comments (7)
[link] [Repost] The Copenhagen Interpretation of Ethics
mesaoptimizer · 2024-01-25T15:20:08.162Z · comments (3)
Nash Bargaining between Subagents doesn't solve the Shutdown Problem
A.H. (AlfredHarwood) · 2024-01-25T10:47:11.877Z · comments (1)
Status-oriented spending
Adam Zerner (adamzerner) · 2024-01-25T06:46:47.029Z · comments (19)
Protecting agent boundaries
Chipmonk · 2024-01-25T04:13:50.993Z · comments (6)
[question] What subjects are unexpectedly high-utility?
FinalFormal2 · 2024-01-25T04:00:28.448Z · answers+comments (18)
[question] Is a random box of gas predictable after 20 seconds?
Thomas Kwa (thomas-kwa) · 2024-01-24T23:00:53.184Z · answers+comments (35)
[question] Will quantum randomness affect the 2028 election?
Thomas Kwa (thomas-kwa) · 2024-01-24T22:54:30.800Z · answers+comments (48)
[link] AISN #30: Investments in Compute and Military AI Plus, Japan and Singapore’s National AI Safety Institutes
aogara (Aidan O'Gara) · 2024-01-24T19:38:33.461Z · comments (1)
Krueger Lab AI Safety Internship 2024
Joey Bream (joey-bream) · 2024-01-24T19:17:40.606Z · comments (0)
Agents that act for reasons: a thought experiment
Michele Campolo · 2024-01-24T16:47:19.469Z · comments (0)
[link] Impact Assessment of AI Safety Camp (Arb Research)
Samuel Holton (samuel-holton) · 2024-01-24T16:19:12.431Z · comments (0)
The case for ensuring that powerful AIs are controlled
ryan_greenblatt · 2024-01-24T16:11:51.354Z · comments (66)
LLMs can strategically deceive while doing gain-of-function research
Igor Ivanov (igor-ivanov) · 2024-01-24T15:45:08.795Z · comments (4)
Monthly Roundup #14: January 2024
Zvi · 2024-01-24T12:50:09.231Z · comments (22)
This might be the last AI Safety Camp
Remmelt (remmelt-ellen) · 2024-01-24T09:33:29.438Z · comments (33)
Global LessWrong/AC10 Meetup on VRChat
Tomás B. (Bjartur Tómas) · 2024-01-24T05:44:26.587Z · comments (2)
Humans aren't fleeb.
Charlie Steiner · 2024-01-24T05:31:46.929Z · comments (5)
A Paradigm Shift in Sustainability
Jose Miguel Cruz y Celis (jose-miguel-cruz-y-celis) · 2024-01-23T23:34:44.121Z · comments (0)
From Finite Factors to Bayes Nets
J Bostock (Jemist) · 2024-01-23T20:03:51.845Z · comments (7)
Institutional economics through the lens of scale-free regulative development, morphogenesis, and cognitive science
Roman Leventov · 2024-01-23T19:42:31.739Z · comments (0)
Making a Secular Solstice Songbook
jefftk (jkaufman) · 2024-01-23T19:40:05.055Z · comments (6)
[link] Simple Appreciations
Jonathan Moregård (JonathanMoregard) · 2024-01-23T16:23:52.001Z · comments (11)
[question] What environmental cues had you not seen them would have ended in disaster?
koratkar · 2024-01-23T14:59:33.545Z · answers+comments (1)
[link] Loneliness and suicide mitigation for students using GPT3-enabled chatbots (survey of Replika users in Nature)
Kaj_Sotala · 2024-01-23T14:05:40.986Z · comments (2)
[link] "Safety as a Scientific Pursuit" (2024)
technicalities · 2024-01-23T12:40:13.902Z · comments (3)
Brainstorming: Slow Takeoff
David Piepgrass (david-piepgrass) · 2024-01-23T06:58:08.107Z · comments (0)
Reframing Acausal Trolling as Acausal Patronage
StrivingForLegibility · 2024-01-23T03:04:53.706Z · comments (0)
Orthogonality or the "Human Worth Hypothesis"?
Jeffs · 2024-01-23T00:57:41.064Z · comments (31)
[link] the subreddit size threshold
bhauth · 2024-01-23T00:38:13.747Z · comments (3)
[link] Starting in mechanistic interpretability
Jakub Smékal (jakub-smekal) · 2024-01-22T23:40:56.871Z · comments (0)
We need a Science of Evals
Marius Hobbhahn (marius-hobbhahn) · 2024-01-22T20:30:39.493Z · comments (13)
[link] Announcing the SoS Research Collective for independent researchers (and academics thinking independently)
rogersbacon · 2024-01-22T20:13:04.731Z · comments (0)
[link] A Brief Assessment of OpenAI's Preparedness Framework & Some Suggestions for Improvement
simeon_c (WayZ) · 2024-01-22T20:08:57.250Z · comments (0)
D&D.Sci(-fi): Colonizing the SuperHyperSphere [Evaluation and Ruleset]
abstractapplic · 2024-01-22T19:20:05.001Z · comments (7)
' petertodd'’s last stand: The final days of open GPT-3 research
mwatkins · 2024-01-22T18:47:00.710Z · comments (16)
[link] InterLab – a toolkit for experiments with multi-agent interactions
Tomáš Gavenčiak (tomas-gavenciak) · 2024-01-22T18:23:35.661Z · comments (0)
San Fernando Valley Rationalist Meetup
Thomas Broadley (thomas-broadley) · 2024-01-22T16:49:59.235Z · comments (1)
Who Organizes Dances?
jefftk (jkaufman) · 2024-01-22T14:30:06.045Z · comments (0)
Values Darwinism
pchvykov · 2024-01-22T10:44:41.548Z · comments (13)
[question] The akrasia doom loop and executive function disorders: a question
TeaTieAndHat (Augustin Portier) · 2024-01-22T07:01:09.646Z · answers+comments (7)
[link] Predicting AGI by the Turing Test
Yuxi_Liu · 2024-01-22T04:22:38.526Z · comments (2)
Incorporating Justice Theory into Decision Theory
StrivingForLegibility · 2024-01-21T19:17:11.653Z · comments (20)
[link] Deliberate Dysentery: Q&A about Human Challenge Trials
Niko_McCarty (niko-2) · 2024-01-21T19:05:35.754Z · comments (1)
When Does Altruism Strengthen Altruism?
jefftk (jkaufman) · 2024-01-21T18:50:05.424Z · comments (2)
A Shutdown Problem Proposal
johnswentworth · 2024-01-21T18:12:48.664Z · comments (61)
← previous page (newer posts) · next page (older posts) →