LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Using LLM Search to Augment (Mathematics) Research
kaleb (geomaturge) · 2024-12-19T18:59:34.391Z · comments (0)
[link] Inescapably Value-Laden Experience—a Catchy Term I Made Up to Make Morality Rationalisable
James Stephen Brown (james-brown) · 2024-12-19T04:45:37.906Z · comments (0)
Logic vs intuition <=> algorithm vs ML
pchvykov · 2025-01-04T09:06:51.822Z · comments (0)
Speedrunning Rationality: Day I
aproteinengine · 2025-01-04T14:28:49.220Z · comments (0)
[link] World Models I'm Currently Building
temporary · 2024-12-15T16:29:08.287Z · comments (1)
[link] How to Edit an Essay into a Solstice Speech?
Czynski (JacobKopczynski) · 2024-12-15T04:30:50.545Z · comments (1)
Why empiricists should believe in AI risk
Knight Lee (Max Lee) · 2024-12-11T03:51:17.979Z · comments (0)
[question] Has Anthropic checked if Claude fakes alignment for intended values too?
Maloew (maloew-valenar) · 2024-12-23T00:43:07.490Z · answers+comments (1)
Good Fortune and Many Worlds
Jonah Wilberg (jrwilb@googlemail.com) · 2024-12-27T13:21:43.142Z · comments (0)
Grokking revisited: reverse engineering grokking modulo addition in LSTM
Nikita Khomich (nikitoskh) · 2024-12-16T18:48:43.533Z · comments (0)
Activation Magnitudes Matter On Their Own: Insights from Language Model Distributional Analysis
Matt Levinson · 2025-01-10T06:53:02.228Z · comments (0)
Vision of a positive Singularity
RussellThor · 2024-12-23T02:19:35.050Z · comments (0)
Dishbrain and implications.
RussellThor · 2024-12-29T10:42:43.912Z · comments (0)
ARC-AGI is a genuine AGI test but o3 cheated :(
Knight Lee (Max Lee) · 2024-12-22T00:58:05.447Z · comments (6)
Linkpost: Look at the Water
J Bostock (Jemist) · 2024-12-30T19:49:04.107Z · comments (3)
Investing in Robust Safety Mechanisms is critical for reducing Systemic Risks
Tom DAVID (tom-david) · 2024-12-11T13:37:24.177Z · comments (3)
Thoughts on the In-Context Scheming AI Experiment
ExCeph · 2025-01-09T02:19:09.558Z · comments (0)
Is AI Alignment Enough?
Aram Panasenco (panasenco) · 2025-01-10T18:57:48.409Z · comments (2)
Some implications of radical empathy
MichaelStJules · 2025-01-07T16:10:16.755Z · comments (0)
[question] How do we quantify non-philanthropic contributions from Buffet and Soros?
Philosophistry (philip-dhingra) · 2024-12-20T22:50:32.260Z · answers+comments (0)
[link] Solving Newcomb's Paradox In Real Life
Alice Wanderland (alice-wanderland) · 2024-12-11T19:48:44.486Z · comments (0)
[question] How should I optimize my decision making model for 'ideas'?
CstineSublime · 2024-12-18T04:09:58.025Z · answers+comments (0)
[question] 2025 Alignment Predictions
anaguma · 2025-01-02T05:37:36.912Z · answers+comments (3)
ACI#9: What is Intelligence
Akira Pyinya · 2024-12-09T21:54:41.077Z · comments (0)
5. Uphold Voluntarism: Digital Defense
Allison Duettmann (allison-duettmann) · 2025-01-02T19:05:33.963Z · comments (0)
3. Improve Cooperation: Better Technologies
Allison Duettmann (allison-duettmann) · 2025-01-02T19:03:16.588Z · comments (2)
[question] Are Sparse Autoencoders a good idea for AI control?
Gerard Boxo (gerard-boxo) · 2024-12-26T17:34:55.617Z · answers+comments (2)
[question] How do you decide to phrase predictions you ask of others? (and how do you make your own?)
CstineSublime · 2025-01-10T02:44:26.737Z · answers+comments (0)
[link] Independent research article analyzing consistent self-reports of experience in ChatGPT and Claude
rife (edgar-muniz) · 2025-01-06T17:34:01.505Z · comments (9)
[link] What is Confidence—in Game Theory and Life?
James Stephen Brown (james-brown) · 2024-12-10T23:06:24.072Z · comments (0)
You are too dumb to understand insurance
Lorec · 2025-01-09T23:33:53.778Z · comments (7)
[link] The Golden Opportunity for American AI
Annapurna (jorge-velez) · 2025-01-04T10:26:05.430Z · comments (2)
Algorithmic Asubjective Anthropics, Cartesian Subjective Anthropics
Lorec · 2024-12-27T01:58:39.880Z · comments (0)
Launching Third Opinion: Anonymous Expert Consultation for AI Professionals
karl (oaisis) · 2024-12-19T19:06:15.355Z · comments (0)
Can we have Epiphanies and Eureka moments more frequently?
CstineSublime · 2025-01-08T02:20:26.897Z · comments (0)
Introducing Avatarism: A Rational Framework for Building actual Heaven
ratiba ro (ratiba-ro) · 2024-12-15T17:17:45.440Z · comments (2)
Reminder: AI Safety is Also a Behavioral Economics Problem
zoop · 2024-12-20T01:40:53.847Z · comments (0)
Keeping self-replicating nanobots in check
Knight Lee (Max Lee) · 2024-12-09T05:25:45.898Z · comments (4)
Towards a Unified Interpretability of Artificial and Biological Neural Networks
jan_bauer · 2024-12-21T23:10:45.842Z · comments (0)
The Type of Writing that Pushes Women Away
Dahlia (sdjfhkj-dkjfks) · 2025-01-08T18:54:52.070Z · comments (3)
How Your Physiology Affects the Mind's Projection Fallacy
YanLyutnev (YanLutnev) · 2024-12-14T21:10:23.240Z · comments (0)
The Technist Reformation: A Discussion with o1 About The Coming Economic Event Horizon
Yuli_Ban · 2024-12-11T02:34:22.329Z · comments (1)
I Recommend More Training Rationales
Gianluca Calcagni (gianluca-calcagni) · 2024-12-31T14:06:44.007Z · comments (0)
A Systematic Approach to AI Risk Analysis Through Cognitive Capabilities
Tom DAVID (tom-david) · 2025-01-09T00:18:04.608Z · comments (0)
Walking Sue
Matthew McRedmond (matthew-mcredmond) · 2024-12-18T13:19:41.575Z · comments (5)
The CARLIN Method: Teaching AI How to Be Genuinely Funny
Greg Robison (grobison) · 2024-12-09T21:51:05.504Z · comments (0)
Gothenburg LW / ACX meetup
Stefan (stefan-1) · 2025-01-08T21:39:18.309Z · comments (0)
[link] What are polysemantic neurons?
Vishakha (vishakha-agrawal) · 2025-01-08T07:35:42.758Z · comments (0)
Duplicate token neurons in the first layer of gpt2-small
Alex Gibson · 2024-12-27T04:21:55.896Z · comments (0)
[link] The Economics & Practicality of Starting Mars Colonization
Zero Contradictions · 2024-12-26T10:56:26.019Z · comments (1)
← previous page (newer posts) · next page (older posts) →