LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] The limits of black-box evaluations: two hypotheticals
TFD · 2025-04-11T20:45:45.693Z · comments (0)

Join Us for the Memory Decoding Journal Club!
Devin Ward (Carboncopies Foundation) · 2025-04-04T17:13:11.884Z · comments (0)

Memory Decoding Journal Club: A collaboration of the Carboncopies Foundation and BPF Aspirational Neuroscience
Devin Ward (Carboncopies Foundation) · 2025-04-05T20:27:43.225Z · comments (0)

Why Bigger Models Generalize Better
PapersToAGI (nee-1) · 2025-04-11T19:54:51.423Z · comments (0)

LLM Alignment Experiment: Effect of roles and optimism on probabilities
sjay8 · 2025-04-02T17:44:47.207Z · comments (0)

Machines of Stolen Grace
Riley Tavassoli (riley-tavassoli) · 2025-03-27T18:15:23.736Z · comments (0)

On different discussion traditions
Eugene Shcherbinin (eugene-shcherbinin-1) · 2025-04-07T23:00:36.132Z · comments (0)

FlexChunk: Enabling 100M×100M Out-of-Core SpMV (~1.8 min, ~1.7 GB RAM) with Near-Linear Scaling
Daniil Strizhov (mila-dolontaeva) · 2025-04-06T05:27:06.271Z · comments (0)

Thoughts on Creating a Good Language
Towards_Keeperhood (Simon Skade) · 2025-04-06T15:57:57.688Z · comments (2)

An artistic illustration of Scalable Oversight - "A world apart, neither gods nor mortals"
Marius Adrian Nicoară · 2025-04-16T12:41:44.874Z · comments (0)

Text First, Evidence Later? Managing Quality and Trust in an Era of AI-Augmented Research
Thehumanproject.ai · 2025-04-10T18:52:58.934Z · comments (1)

Astral Codex Ten meetup in Kaliningrad (Russia)
YanLyutnev (YanLutnev) · 2025-04-01T14:07:53.296Z · comments (1)

[link] Understanding and overcoming AGI apathy
Dhruv Sumathi (dhruv-sumathi) · 2025-04-17T01:04:53.853Z · comments (0)

[question] How familiar is the Lesswrong community as a whole with the concept of Reward-modelling?
Oxidize · 2025-04-09T23:33:18.044Z · answers+comments (8)

A Platform for Falsifiable Conjectures and Public Refutation — Would This Be Useful?
PetrusNonius · 2025-04-08T21:09:23.819Z · comments (1)

[link] Doing Prioritization Better
arvomm (arvo-munoz) · 2025-04-16T18:46:41.797Z · comments (0)

How do SAE Circuits Fail? A Case Study Using a Starts-with-'E' Letter Detection Task
adsingh-64 · 2025-03-30T00:47:18.711Z · comments (0)

Sam Altman's sister claims Sam sexually abused her -- Part 5: Timeline, continued
pythagoras5015 (pl5015) · 2025-04-14T01:00:07.084Z · comments (0)

Applications Open for Impact Accelerator Program for Experienced Professionals
Clark Wisenbaker (accounts-hip) · 2025-04-14T16:27:32.340Z · comments (0)

Sam Altman's sister claims Sam sexually abused her -- Part 4: Timeline, continued
pythagoras5015 (pl5015) · 2025-04-13T23:41:55.411Z · comments (0)

[link] Digital Error Correction and Lock-In
alamerton · 2025-04-08T15:46:31.602Z · comments (0)

[question] How many times faster can the AGI advance the science than humans do?
StanislavKrym · 2025-03-28T15:16:52.320Z · answers+comments (0)

Religious Persistence: A Missing Primitive for Robust Alignment
lauriewired · 2025-04-14T22:03:45.868Z · comments (3)

Correcting Deceptive Alignment using a Deontological Approach
JeaniceK · 2025-04-14T22:07:57.860Z · comments (0)

[link] Nihilism Is Not Enough By Peter Thiel
shawkisukkar · 2025-04-15T00:13:01.375Z · comments (0)

A Fraction of Global Market Capitalization as the Best Currency
Greenless Mirror (mikhail-2) · 2025-03-31T13:30:03.970Z · comments (25)

Routine Novelty
BazingaBoy (martin-nenov) · 2025-03-31T15:47:05.217Z · comments (0)

Does the universe's recognition of measurement provide stronger evidence for being in a simulation than universal fine-tuning?
amelia (314159) · 2025-04-09T08:20:10.561Z · comments (0)

AI Needs Us? Information Theory and Humans as data
tomdekan (tomd@hey.com) · 2025-03-29T15:51:16.070Z · comments (6)

Do we want too much from a potentially godlike AGI?
StanislavKrym · 2025-04-11T23:33:06.710Z · comments (0)

Debunk the myth -Testing the generalized reasoning ability of LLM
Defender7762 · 2025-04-11T20:17:02.956Z · comments (6)

[link] Rethinking Friction: Equity and Motivation Across Domains
eltimbalino · 2025-04-08T03:58:02.839Z · comments (0)

On Downvotes, Cultural Fit, and Why I Won’t Be Posting Again
funnyfranco · 2025-03-31T19:26:27.090Z · comments (32)

Alignment through atomic agents
micseydel · 2025-03-27T18:43:14.569Z · comments (0)

[link] find_purpose.exe
heatdeathandtaxes · 2025-04-12T19:31:38.951Z · comments (0)

An argument for asexuality
filthy_hedonist (sid-kolichala) · 2025-03-27T18:08:48.624Z · comments (10)

[link] The Cynic Wasps in the Beehive
mempko · 2025-04-12T19:30:44.227Z · comments (0)

Would this solve the (outer) alignment problem, or at least help?
Wes R · 2025-04-06T18:49:14.145Z · comments (1)

A Solution to Sandbagging and other Self-Provable Misalignment: Constitutional AI Detectives
Knight Lee (Max Lee) · 2025-04-14T10:27:24.903Z · comments (2)

Will the AGIs be able to run the civilisation?
StanislavKrym · 2025-03-28T04:50:07.568Z · comments (2)

[question] Is the ethics of interaction with primitive peoples already solved?
StanislavKrym · 2025-04-11T14:56:21.306Z · answers+comments (0)

[link] Six reasons why objective morality is nonsense
Zero Contradictions · 2025-04-11T02:11:04.775Z · comments (6)

A New Challenge to all Bayesians!
milanrosko · 2025-04-02T02:38:35.562Z · comments (0)

Reframing AI Safety Through the Lens of Identity Maintenance Framework
Hiroshi Yamakawa (hiroshi-yamakawa) · 2025-04-01T06:16:45.228Z · comments (0)

How to defeat superintelligence, the Sta-Hi way
kilgoar (william-walshe) · 2025-04-09T13:58:59.541Z · comments (0)

Debunking the Hard Problem: Consciousness as Integrated Prediction
gmax (maxim-gurevich) · 2025-04-15T08:38:50.637Z · comments (8)

Ai Cone of Probabilties - what aren't we talking about?
Marzipan · 2025-04-05T05:51:27.859Z · comments (5)

Karel Čapek’s 'War with the Newts' 1936 review
Petr 'Margot' Andreev (petr-andreev) · 2025-04-04T23:12:39.572Z · comments (1)

Null Rationalism
kilgoar (william-walshe) · 2025-04-05T03:26:06.034Z · comments (0)

Insect Suffering Is The Biggest Issue: What To Do About It
omnizoid · 2025-04-01T12:51:08.115Z · comments (9)

← previous page (newer posts) · next page (older posts) →

^{^}

Personal anecdote: I won a wager with a school friend who got a job in an EV start-up after a decent career in IT and disagreed with me

LessWrong 2.0 Reader

Archive

Recent comments