LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

AGI with RL is Bad News for Safety
Nadav Brandes (nadav-brandes) · 2024-12-21T19:36:03.970Z · comments (20)

[link] Can o1-preview find major mistakes amongst 59 NeurIPS '24 MLSB papers?
Abhishaike Mahajan (abhishaike-mahajan) · 2024-12-18T14:21:03.661Z · comments (0)

subfunctional overlaps in attentional selection history implies momentum for decision-trajectories
Emrik (Emrik North) · 2024-12-22T14:12:49.027Z · comments (1)

[link] Being Present is Not a Skill
Chipmonk · 2024-12-18T01:11:04.715Z · comments (6)

[link] A primer on machine learning in cryo-electron microscopy (cryo-EM)
Abhishaike Mahajan (abhishaike-mahajan) · 2024-12-22T15:11:58.860Z · comments (0)

[link] We are in a New Paradigm of AI Progress - OpenAI's o3 model makes huge gains on the toughest AI benchmarks in the world
garrison · 2024-12-22T21:45:52.026Z · comments (3)

[link] Funding Case: AI Safety Camp 11
Remmelt (remmelt-ellen) · 2024-12-23T08:51:55.255Z · comments (0)

Doing Sport Reliably via Dancing
Johannes C. Mayer (johannes-c-mayer) · 2024-12-20T12:06:59.517Z · comments (0)

[link] o3 is not being released to the public. First they are only giving access to external safety testers. You can apply to get early access to do safety testing
KatWoods (ea247) · 2024-12-20T18:30:44.421Z · comments (0)

[link] Don't Associate AI Safety With Activism
Eneasz · 2024-12-18T08:01:50.357Z · comments (15)

[link] The Genesis Project
aproteinengine · 2024-12-19T21:26:51.344Z · comments (0)

I'm Writing a Book About Liberalism
Yoav Ravid · 2024-12-19T00:13:33.895Z · comments (6)

Stop Making Sense
JenniferRM · 2024-12-23T05:16:12.428Z · comments (0)

How I saved 1 human life (in expectation) without overthinking it
Christopher King (christopher-king) · 2024-12-22T20:53:13.492Z · comments (0)

Open Thread Winter 2024/2025
habryka (habryka4) · 2024-12-25T21:02:41.760Z · comments (0)

Robbin's Farm Sledding Route
jefftk (jkaufman) · 2024-12-21T22:10:01.175Z · comments (1)

Mid-Generation Self-Correction: A Simple Tool for Safer AI
MrThink (ViktorThink) · 2024-12-19T23:41:00.702Z · comments (0)

[link] AISN #45: Center for AI Safety 2024 Year in Review
Corin Katzke (corin-katzke) · 2024-12-19T18:15:56.416Z · comments (0)

Living with Rats in College
lsusr · 2024-12-25T10:44:13.085Z · comments (0)

Simple Steganographic Computation Eval - gpt-4o and gemini-exp-1206 can't solve it yet
Filip Sondej · 2024-12-19T15:47:05.512Z · comments (2)

No Internally-Crispy Mac and Cheese
jefftk (jkaufman) · 2024-12-20T03:20:01.798Z · comments (5)

How Much to Give is a Pragmatic Question
jefftk (jkaufman) · 2024-12-24T04:20:01.480Z · comments (1)

[link] My AI timelines
xpostah · 2024-12-22T21:06:41.722Z · comments (2)

Apply now to SPAR!
agucova · 2024-12-19T22:29:58.963Z · comments (0)

Apply to the 2025 PIBBSS Summer Research Fellowship
DusanDNesic · 2024-12-24T10:25:12.882Z · comments (0)

Last Line of Defense: Minimum Viable Shelters for Mirror Bacteria
Ulrik Horn (ulrik-horn) · 2024-12-21T08:28:14.860Z · comments (19)

Do you need a better map of your myriad of maps to the territory?
CstineSublime · 2024-12-24T02:00:30.426Z · comments (2)

Preliminary Thoughts on Flirting Theory
la .alis. (Diatom) · 2024-12-24T07:37:47.045Z · comments (4)

[link] Bird's eye view: An interactive representation to see large collection of text "from above".
Alexandre Variengien (alexandre-variengien) · 2024-12-21T00:15:02.239Z · comments (4)

Exploring the petertodd / Leilan duality in GPT-2 and GPT-J
mwatkins · 2024-12-23T13:17:53.755Z · comments (0)

[link] Ideologies are slow and necessary, for now
Gabriel Alfour (gabriel-alfour-1) · 2024-12-23T01:57:47.153Z · comments (1)

Notes from Copenhagen Secular Solstice 2024
Søren Elverlin (soren-elverlin-1) · 2024-12-22T15:08:20.848Z · comments (0)

Executive Director for AIS France - Expression of interest
gergogaspar (gergo-gaspar) · 2024-12-19T08:14:54.023Z · comments (0)

Panology
JenniferRM · 2024-12-23T21:40:14.540Z · comments (8)

Near- and medium-term AI Control Safety Cases
Martín Soto (martinsq) · 2024-12-23T17:37:48.860Z · comments (0)

[link] Updating on Bad Arguments
Guive (GAA) · 2024-12-21T01:19:15.686Z · comments (2)

What conclusions can be drawn from a single observation about wealth in tennis?
Trevor Cappallo (trevor-cappallo) · 2024-12-18T09:55:34.923Z · comments (3)

Better difference-making views
MichaelStJules · 2024-12-21T18:27:45.552Z · comments (0)

[link] Exploring Cooperation: The Path to Utopia
Davidmanheim · 2024-12-25T18:31:55.565Z · comments (0)

[question] Recommendations on communities that discuss AI applications in society
Annapurna (jorge-velez) · 2024-12-24T13:37:49.821Z · answers+comments (2)

A short critique of Omohundro's "Basic AI Drives"
Soumyadeep Bose (soumyadeep-bose) · 2024-12-19T19:19:52.864Z · comments (0)

Replaceable Axioms give more credence than irreplaceable axioms
Yoav Ravid · 2024-12-20T00:51:13.578Z · comments (2)

Reduce AI Self-Allegiance by saying "he" instead of "I"
Knight Lee (Max Lee) · 2024-12-23T09:32:29.947Z · comments (1)

Printable book of some rationalist creative writing (from Scott A. & Eliezer)
CounterBlunder · 2024-12-23T15:44:31.437Z · comments (0)

[link] Inescapably Value-Laden Experience—a Catchy Term I Made Up to Make Morality Rationalisable
James Stephen Brown (james-brown) · 2024-12-19T04:45:37.906Z · comments (0)

Using LLM Search to Augment (Mathematics) Research
kaleb (geomaturge) · 2024-12-19T18:59:34.391Z · comments (0)

[question] Why is neuron count of human brain relevant to AI timelines?
xpostah · 2024-12-24T05:15:58.839Z · answers+comments (6)

Vision of a positive Singularity
RussellThor · 2024-12-23T02:19:35.050Z · comments (0)

Apply to be a TA for TARA
yanni kyriacos (yanni) · 2024-12-20T02:25:03.514Z · comments (0)

[link] What is compute governance?
Vishakha (vishakha-agrawal) · 2024-12-23T06:32:25.588Z · comments (0)

← previous page (newer posts) · next page (older posts) →

^{^}

Admittedly it's possible that this is totally happening all over the place and people are just covering it up in order to have all of the glory/status for themselves. But I doubt it: there are enough remarkably selfless LLM enthusiasts that if this were happening, I'd expect it would've gone viral already.

LessWrong 2.0 Reader

Archive

Recent comments