LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[question] Why Can’t Sub-AGI Solve AI Alignment? Or: Why Would Sub-AGI AI Not be Aligned?
MrThink (ViktorThink) · 2024-07-02T20:13:24.054Z · answers+comments (23)
Likelihood calculation with duobels
Martin Gerdes (martin-gerdes) · 2024-10-01T16:21:01.268Z · comments (0)
[question] is there a big dictionary somewhere with all your jargon and acronyms and whatnot?
KvmanThinking (avery-liu) · 2024-10-17T11:30:50.937Z · answers+comments (7)
[link] A Logical Proof for the Emergence and Substrate Independence of Sentience
rife (edgar-muniz) · 2024-10-24T21:08:09.398Z · comments (31)
Ways to think about alignment
Abhimanyu Pallavi Sudhir (abhimanyu-pallavi-sudhir) · 2024-10-27T01:40:50.762Z · comments (0)
[question] When do alignment researchers retire?
Jordan Taylor (Nadroj) · 2024-06-25T23:30:25.520Z · answers+comments (2)
[question] What are the primary drivers that caused selection pressure for intelligence in humans?
Towards_Keeperhood (Simon Skade) · 2024-11-07T09:40:20.275Z · answers+comments (15)
Contrapositive Natural Abstraction - Project Intro
Elliot Callender (javanotmocha) · 2024-06-24T18:37:21.761Z · comments (5)
Some Comments on Recent AI Safety Developments
testingthewaters · 2024-11-09T16:44:58.936Z · comments (0)
[link] Social interaction-inspired AI alignment
Chipmonk · 2024-06-24T08:10:08.719Z · comments (2)
Towards a Clever Hans Test: Unmasking Sentience Biases in Chatbot Interactions
glykokalyx · 2024-11-10T22:34:58.956Z · comments (0)
How does generalized accessibility compare to targeted accessibility?
ErioirE (erioire) · 2024-07-17T17:07:09.829Z · comments (0)
[question] What the cost difference in processing input vs. output tokens with LLMs?
kotrfa · 2024-08-08T10:43:18.049Z · answers+comments (10)
Theories With Mentalistic Atoms Are As Validly Called Theories As Theories With Only Non-Mentalistic Atoms
Lorec · 2024-11-12T06:45:26.039Z · comments (4)
Hamiltonian Dynamics in AI: A Novel Approach to Optimizing Reasoning in Language Models
Javier Marin Valenzuela (javier-marin-valenzuela) · 2024-10-09T19:14:56.162Z · comments (0)
Help us seed AI Safety Brussels
gergogaspar (gergo-gaspar) · 2024-08-07T06:32:24.760Z · comments (2)
EAGxBerkeley 2024
Lauriander · 2024-07-15T18:38:16.980Z · comments (0)
Joint mandatory donation as a way to increase the number of donations
Crazy philosopher (commissar Yarrick) · 2024-07-07T10:56:57.222Z · comments (3)
Leverage points for a pause
Remmelt (remmelt-ellen) · 2024-08-28T09:21:17.593Z · comments (0)
Alignment from equivariance
hamishtodd1 · 2024-08-13T21:09:11.849Z · comments (0)
Interview with Bill O’Rourke - Russian Corruption, Putin, Applied Ethics, and More
JohnGreer · 2024-10-27T17:11:28.891Z · comments (0)
On Artificial Wisdom
Jordan Arel · 2024-07-12T00:20:33.241Z · comments (0)
Auckland New Zealand - ACX Meetups Everywhere Fall 2024
Mark Gilmour (mark-gilmour) · 2024-08-29T18:35:31.852Z · comments (0)
For Limited Superintelligences, Epistemic Exclusion is Harder than Robustness to Logical Exploitation
Lorec · 2024-09-15T20:49:06.370Z · comments (9)
Methodology: Contagious Beliefs
James Stephen Brown (james-brown) · 2024-10-19T03:58:17.966Z · comments (0)
On the Practical Applications of Interpretability
Nick Jiang (nick-jiang) · 2024-10-15T17:18:25.280Z · comments (0)
AI development is an act of social revolution
artemiocobb · 2024-07-03T18:00:17.947Z · comments (0)
Computational irreducibility challenges the simulation hypothesis
Clément L · 2024-08-11T16:14:29.655Z · comments (15)
San Francisco ACX Meetup “First Saturday”
Nate Sternberg (nate-sternberg) · 2024-09-29T03:13:34.615Z · comments (0)
Hamburg Germany - ACX Meetups Everywhere Fall 2024
Gunnar (gunnar ) · 2024-08-29T18:37:11.622Z · comments (0)
(draft) Cyborg software should be open (?)
AtillaYasar (atillayasar) · 2024-11-01T07:24:51.966Z · comments (5)
Bayes' Theorem: In Search of Gold (Lesson 1)
bayesyatina · 2024-06-28T08:39:16.638Z · comments (0)
Some desirable properties of automated wisdom
Marius Adrian Nicoară · 2024-07-13T06:05:34.386Z · comments (2)
Recursion in AI is scary. But let’s talk solutions.
Oleg Trott (oleg-trott) · 2024-07-16T20:34:58.580Z · comments (10)
Enabling New Applications with Today's Mechanistic Interpretability Toolkit
ananya_joshi · 2024-10-25T17:53:23.960Z · comments (0)
[link] Motivation Theory
Zero Contradictions · 2024-08-08T05:05:04.741Z · comments (0)
Memetics as an analogy and its implicit connotations
Rachel Shu (wearsshoes) · 2024-06-25T05:13:12.746Z · comments (0)
The AI alignment problem in socio-technical systems from a computational perspective: A Top-Down-Top view and outlook
zhaoweizhang (Zhaowei Zhang) · 2024-07-15T18:56:08.108Z · comments (0)
Playing Minecraft with a Superintelligence
Johannes C. Mayer (johannes-c-mayer) · 2024-08-17T22:47:42.767Z · comments (0)
Vilnius – ACX Meetups Everywhere Fall 2024
NoUsernameSelected · 2024-08-19T17:38:12.378Z · comments (1)
[link] Unnatural abstractions
Aprillion · 2024-08-10T22:31:42.949Z · comments (3)
AI Compute governance: Verifying AI chip location
Farhan · 2024-10-12T17:36:45.942Z · comments (0)
Quantum Immortality: A Perspective if AI Doomers are Probably Right
avturchin · 2024-11-07T16:06:08.106Z · comments (34)
[question] Seeking feedback on a critique of the paperclip maximizer thought experiment
bio neural (bio-neural) · 2024-07-15T18:39:30.545Z · answers+comments (9)
MIT FutureTech are hiring for a Technical Associate role
peterslattery · 2024-09-09T20:16:49.299Z · comments (0)
Near-death experiences
Declan Molony (declan-molony) · 2024-10-08T06:34:04.107Z · comments (1)
Endogenous Growth and Human Intelligence
Nicholas D. (nicholas-d) · 2024-09-18T14:05:54.567Z · comments (0)
[link] Podcast discussing Hanson's Cultural Drift Argument
vaishnav92 · 2024-10-20T17:58:41.416Z · comments (0)
The Potential Impossibility of Subjective Death
VictorLJZ · 2024-07-04T18:17:28.141Z · comments (34)
Keeping content out of LLM training datasets
Ben Millwood (ben-millwood) · 2024-07-18T10:27:27.827Z · comments (0)
← previous page (newer posts) · next page (older posts) →