LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] The limits of black-box evaluations: two hypotheticals
TFD · 2025-04-11T20:45:45.693Z · comments (0)
Join Us for the Memory Decoding Journal Club!
Devin Ward (Carboncopies Foundation) · 2025-04-04T17:13:11.884Z · comments (0)
Memory Decoding Journal Club: A collaboration of the Carboncopies Foundation and BPF Aspirational Neuroscience
Devin Ward (Carboncopies Foundation) · 2025-04-05T20:27:43.225Z · comments (0)
Why Bigger Models Generalize Better
PapersToAGI (nee-1) · 2025-04-11T19:54:51.423Z · comments (0)
LLM Alignment Experiment: Effect of roles and optimism on probabilities
sjay8 · 2025-04-02T17:44:47.207Z · comments (0)
Machines of Stolen Grace
Riley Tavassoli (riley-tavassoli) · 2025-03-27T18:15:23.736Z · comments (0)
On different discussion traditions
Eugene Shcherbinin (eugene-shcherbinin-1) · 2025-04-07T23:00:36.132Z · comments (0)
FlexChunk: Enabling 100M×100M Out-of-Core SpMV (~1.8 min, ~1.7 GB RAM) with Near-Linear Scaling
Daniil Strizhov (mila-dolontaeva) · 2025-04-06T05:27:06.271Z · comments (0)
Thoughts on Creating a Good Language
Towards_Keeperhood (Simon Skade) · 2025-04-06T15:57:57.688Z · comments (2)
An artistic illustration of Scalable Oversight - "A world apart, neither gods nor mortals"
Marius Adrian Nicoară · 2025-04-16T12:41:44.874Z · comments (0)
Text First, Evidence Later? Managing Quality and Trust in an Era of AI-Augmented Research
Thehumanproject.ai · 2025-04-10T18:52:58.934Z · comments (1)
Astral Codex Ten meetup in Kaliningrad (Russia)
YanLyutnev (YanLutnev) · 2025-04-01T14:07:53.296Z · comments (1)
[link] Understanding and overcoming AGI apathy
Dhruv Sumathi (dhruv-sumathi) · 2025-04-17T01:04:53.853Z · comments (0)
[question] How familiar is the Lesswrong community as a whole with the concept of Reward-modelling?
Oxidize · 2025-04-09T23:33:18.044Z · answers+comments (8)
A Platform for Falsifiable Conjectures and Public Refutation — Would This Be Useful?
PetrusNonius · 2025-04-08T21:09:23.819Z · comments (1)
[link] Doing Prioritization Better
arvomm (arvo-munoz) · 2025-04-16T18:46:41.797Z · comments (0)
How do SAE Circuits Fail? A Case Study Using a Starts-with-'E' Letter Detection Task
adsingh-64 · 2025-03-30T00:47:18.711Z · comments (0)
Sam Altman's sister claims Sam sexually abused her -- Part 5: Timeline, continued
pythagoras5015 (pl5015) · 2025-04-14T01:00:07.084Z · comments (0)
Applications Open for Impact Accelerator Program for Experienced Professionals
Clark Wisenbaker (accounts-hip) · 2025-04-14T16:27:32.340Z · comments (0)
Sam Altman's sister claims Sam sexually abused her -- Part 4: Timeline, continued
pythagoras5015 (pl5015) · 2025-04-13T23:41:55.411Z · comments (0)
[link] Digital Error Correction and Lock-In
alamerton · 2025-04-08T15:46:31.602Z · comments (0)
[question] How many times faster can the AGI advance the science than humans do?
StanislavKrym · 2025-03-28T15:16:52.320Z · answers+comments (0)
Religious Persistence: A Missing Primitive for Robust Alignment
lauriewired · 2025-04-14T22:03:45.868Z · comments (3)
Correcting Deceptive Alignment using a Deontological Approach
JeaniceK · 2025-04-14T22:07:57.860Z · comments (0)
[link] Nihilism Is Not Enough By Peter Thiel
shawkisukkar · 2025-04-15T00:13:01.375Z · comments (0)
A Fraction of Global Market Capitalization as the Best Currency
Greenless Mirror (mikhail-2) · 2025-03-31T13:30:03.970Z · comments (25)
Routine Novelty
BazingaBoy (martin-nenov) · 2025-03-31T15:47:05.217Z · comments (0)
Does the universe's recognition of measurement provide stronger evidence for being in a simulation than universal fine-tuning?
amelia (314159) · 2025-04-09T08:20:10.561Z · comments (0)
AI Needs Us? Information Theory and Humans as data
tomdekan (tomd@hey.com) · 2025-03-29T15:51:16.070Z · comments (6)
Do we want too much from a potentially godlike AGI?
StanislavKrym · 2025-04-11T23:33:06.710Z · comments (0)
Debunk the myth -Testing the generalized reasoning ability of LLM
Defender7762 · 2025-04-11T20:17:02.956Z · comments (6)
[link] Rethinking Friction: Equity and Motivation Across Domains
eltimbalino · 2025-04-08T03:58:02.839Z · comments (0)
On Downvotes, Cultural Fit, and Why I Won’t Be Posting Again
funnyfranco · 2025-03-31T19:26:27.090Z · comments (32)
Alignment through atomic agents
micseydel · 2025-03-27T18:43:14.569Z · comments (0)
[link] find_purpose.exe
heatdeathandtaxes · 2025-04-12T19:31:38.951Z · comments (0)
An argument for asexuality
filthy_hedonist (sid-kolichala) · 2025-03-27T18:08:48.624Z · comments (10)
[link] The Cynic Wasps in the Beehive
mempko · 2025-04-12T19:30:44.227Z · comments (0)
Would this solve the (outer) alignment problem, or at least help?
Wes R · 2025-04-06T18:49:14.145Z · comments (1)
A Solution to Sandbagging and other Self-Provable Misalignment: Constitutional AI Detectives
Knight Lee (Max Lee) · 2025-04-14T10:27:24.903Z · comments (2)
Will the AGIs be able to run the civilisation?
StanislavKrym · 2025-03-28T04:50:07.568Z · comments (2)
[question] Is the ethics of interaction with primitive peoples already solved?
StanislavKrym · 2025-04-11T14:56:21.306Z · answers+comments (0)
[link] Six reasons why objective morality is nonsense
Zero Contradictions · 2025-04-11T02:11:04.775Z · comments (6)
A New Challenge to all Bayesians!
milanrosko · 2025-04-02T02:38:35.562Z · comments (0)
Reframing AI Safety Through the Lens of Identity Maintenance Framework
Hiroshi Yamakawa (hiroshi-yamakawa) · 2025-04-01T06:16:45.228Z · comments (0)
How to defeat superintelligence, the Sta-Hi way
kilgoar (william-walshe) · 2025-04-09T13:58:59.541Z · comments (0)
Debunking the Hard Problem: Consciousness as Integrated Prediction
gmax (maxim-gurevich) · 2025-04-15T08:38:50.637Z · comments (8)
Ai Cone of Probabilties - what aren't we talking about?
Marzipan · 2025-04-05T05:51:27.859Z · comments (5)
Karel Čapek’s 'War with the Newts' 1936 review
Petr 'Margot' Andreev (petr-andreev) · 2025-04-04T23:12:39.572Z · comments (1)
Null Rationalism
kilgoar (william-walshe) · 2025-04-05T03:26:06.034Z · comments (0)
Insect Suffering Is The Biggest Issue: What To Do About It
omnizoid · 2025-04-01T12:51:08.115Z · comments (9)
← previous page (newer posts) · next page (older posts) →