LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] My Model of Epistemology
adamShimi · 2024-08-31T17:01:45.472Z · comments (0)

Monthly Roundup #22: September 2024
Zvi · 2024-09-17T12:20:08.297Z · comments (10)

[link] On Fables and Nuanced Charts
Niko_McCarty (niko-2) · 2024-09-08T17:09:07.503Z · comments (2)

Open Problems in AIXI Agent Foundations
Cole Wyeth (Amyr) · 2024-09-12T15:38:59.007Z · comments (2)

Book Review: On the Edge: The Gamblers
Zvi · 2024-09-24T11:50:06.065Z · comments (1)

Eye contact is effortless when you’re no longer emotionally blocked on it
Chipmonk · 2024-09-27T21:47:01.970Z · comments (24)

LASR Labs Spring 2025 applications are open!
Erin Robertson · 2024-10-04T13:44:20.524Z · comments (0)

Representation Tuning
Christopher Ackerman (christopher-ackerman) · 2024-06-27T17:44:33.338Z · comments (9)

A sketch of acausal trade in practice
Richard_Ngo (ricraz) · 2024-02-04T00:32:54.622Z · comments (4)

Open consultancy: Letting untrusted AIs choose what answer to argue for
Fabien Roger (Fabien) · 2024-03-12T20:38:03.785Z · comments (5)

[Valence series] 4. Valence & Social Status (deprecated)
Steven Byrnes (steve2152) · 2023-12-15T14:24:41.040Z · comments (19)

Predictive model agents are sort of corrigible
Raymond D · 2024-01-05T14:05:03.037Z · comments (6)

Proposal for improving the global online discourse through personalised comment ordering on all websites
Roman Leventov · 2023-12-06T18:51:37.645Z · comments (21)

Secondary Risk Markets
Vaniver · 2023-12-11T21:52:46.836Z · comments (4)

List of strategies for mitigating deceptive alignment
joshc (joshua-clymer) · 2023-12-02T05:56:50.867Z · comments (2)

What Helped Me - Kale, Blood, CPAP, X-tiamine, Methylphenidate
Johannes C. Mayer (johannes-c-mayer) · 2024-01-03T13:22:11.700Z · comments (12)

'Theories of Values' and 'Theories of Agents': confusions, musings and desiderata
Mateusz Bagiński (mateusz-baginski) · 2023-11-15T16:00:48.926Z · comments (8)

Open Thread – Winter 2023/2024
habryka (habryka4) · 2023-12-04T22:59:49.957Z · comments (160)

[link] AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI Risks
aogara (Aidan O'Gara) · 2023-10-31T19:34:54.837Z · comments (1)

Empirical vs. Mathematical Joints of Nature
Elizabeth (pktechgirl) · 2024-06-26T01:55:22.858Z · comments (1)

How I select alignment research projects
Ethan Perez (ethan-perez) · 2024-04-10T04:33:08.092Z · comments (4)

Humans aren't fleeb.
Charlie Steiner · 2024-01-24T05:31:46.929Z · comments (5)

Forecasting AI (Overview)
jsteinhardt · 2023-11-16T19:00:04.218Z · comments (0)

AXRP Episode 33 - RLHF Problems with Scott Emmons
DanielFilan · 2024-06-12T03:30:05.747Z · comments (0)

Linear encoding of character-level information in GPT-J token embeddings
mwatkins · 2023-11-10T22:19:14.654Z · comments (4)

Monthly Roundup #12: November 2023
Zvi · 2023-11-14T15:20:06.926Z · comments (5)

[link] Why Yudkowsky is wrong about "covalently bonded equivalents of biology"
titotal (lombertini) · 2023-12-06T14:09:15.402Z · comments (40)

[link] Suffering Is Not Pain
jbkjr · 2024-06-18T18:04:43.407Z · comments (45)

CHAI internship applications are open (due Nov 13)
Erik Jenner (ejenner) · 2023-10-26T00:53:49.640Z · comments (0)

Adam Smith Meets AI Doomers
James_Miller · 2024-01-31T15:53:03.070Z · comments (10)

Reflective consistency, randomized decisions, and the dangers of unrealistic thought experiments
Radford Neal · 2023-12-07T03:33:16.149Z · comments (25)

D&D.Sci (Easy Mode): On The Construction Of Impossible Structures
abstractapplic · 2024-05-17T00:25:42.950Z · comments (12)

Copyright Confrontation #1
Zvi · 2024-01-03T15:50:04.850Z · comments (7)

Unpicking Extinction
ukc10014 · 2023-12-09T09:15:41.291Z · comments (10)

[link] math terminology as convolution
bhauth · 2023-10-30T01:05:11.823Z · comments (1)

Wireheading and misalignment by composition on NetHack
pierlucadoro · 2023-10-27T17:43:41.727Z · comments (4)

[link] Inferring the model dimension of API-protected LLMs
Ege Erdil (ege-erdil) · 2024-03-18T06:19:25.974Z · comments (3)

Intransitive Trust
Screwtape · 2024-05-27T16:55:29.294Z · comments (15)

[question] If I have some money, whom should I donate it to in order to reduce expected P(doom) the most?
KvmanThinking (avery-liu) · 2024-10-03T11:31:19.974Z · answers+comments (34)

[link] My Apartment Art Commission Process
jenn (pixx) · 2024-08-26T18:36:44.363Z · comments (4)

Augmenting Statistical Models with Natural Language Parameters
jsteinhardt · 2024-09-20T18:30:10.816Z · comments (0)

Video and transcript of presentation on Otherness and control in the age of AGI
Joe Carlsmith (joekc) · 2024-10-08T22:30:38.054Z · comments (1)

ARENA4.0 Capstone: Hyperparameter tuning for MELBO + replication on Llama-3.2-1b-Instruct
25Hour (aaron-kaufman) · 2024-10-05T11:30:11.953Z · comments (2)

(Maybe) A Bag of Heuristics is All There Is & A Bag of Heuristics is All You Need
Sodium · 2024-10-03T19:11:58.032Z · comments (17)

How to develop a photographic memory 1/3
PhilosophicalSoul (LiamLaw) · 2023-12-28T13:26:36.669Z · comments (6)

[link] GPT2, Five Years On
Joel Burget (joel-burget) · 2024-06-05T17:44:17.552Z · comments (0)

[link] Robin Hanson & Liron Shapira Debate AI X-Risk
Liron · 2024-07-08T21:45:40.609Z · comments (4)

[link] The last era of human mistakes
owencb · 2024-07-24T09:58:42.116Z · comments (2)

[link] AI governance needs a theory of victory
Corin Katzke (corin-katzke) · 2024-06-21T16:15:46.560Z · comments (6)

What I Learned (Conclusion To "The Sense Of Physical Necessity")
LoganStrohl (BrienneYudkowsky) · 2024-03-20T21:24:37.464Z · comments (0)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

sharmake-farah on Big tech transitions are slow (with implications for AI)

IMO, a lot of basic cruxes for differing views on the impact of AI in the 21st century ultimately depend on the question "Can AI be a substitute for the majority of economically relevant tasks a human does, and then become a substitute for any new industry?"

If the answer is yes, a lot of the more radical worldviews become on the table. If the answer is no, then I'd probably agree with a lot of the more moderate views on AI impacts.

Indeed, I'd argue AI as substitute for basically all human tasks that are relevant to the economy should replace the AGI notion often flown around, since it's more clear and provides less opportunities for motte and balieys and other bad arguments often thrown around.

notfnofn on A Logical Proof for the Emergence and Substrate Independence of Sentience

note that in the setting of the second paragraph I wrote, every "firing pattern" will eventually emerge. You may have misunderstood my comment as taking the basic premise of your post as true and quibbling about the details, but I am skeptical about even the fundamental idea

viliam on cryonics is a pascal's mugging?

Words "Pascal's mugging" are used when the change of success is extremely, almost infinitely small (but the action is defended by a supposed practically infinite reward). Founding a startup is not Pascal's mugging, even if most startups fail. Buying a lottery ticket is not Pascal's mugging, even if only one ticket in a thousand or in a million wins.

So basically, calling cryonics a Pascal's mugging means that you believe that the changes are much smaller than one in a million. Why would you believe that?

I think we had a survey here a few years ago, and the people who believe in cryonics give it like 15% chance on average; they just think it's worth it. As an analogy, imagine dying from some incurable disease, and then someone offers you a pill that has a 15% chance to save you. You probably would want to spend a non-zero amount of money on such pill.

Yes, cryonics requires some sci-fi levels of technology. However, the world around us already is a sci-fi world from the perspective of someone who lived 100 years ago, and a completely fairy tale from the perspective of someone who lived 200 or 500 years ago. It doesn't seem implausible to assume the same about the future in 100 or more years.

Reviving frozen people does not seem to contradict the science as we know it. It seems mostly like a technological problem, and the future will probably contain better technology.

On the other hand, some things such as the speed of light or the second law of thermodynamics seem like hard limits that even the future people will not be able to overcome. Which suggests that the decomposed bodies in the graves will probably not be revived, even using a sci-fi technology. (Maybe there is a chance that something like time travel will be invented, but that chance seems much smaller that the chance of fixing some frozen cells.)

df-fd on Is the Power Grid Sustainable?

I was under the impression that the biggest cost of grid electricity is stability, that is most of the time the price charged on consumer is much [i.e. about 2x] higher than the average cost on the grid market, but occasionally the grid market price would go up astronomically [ say 1000x] for brief periods of time [say hours], and the household consumer would be insulated from that. I thought that something similar happened in Texas when a cold snap happened?

if you are confident that your battery can hold you over those crunch period I assume you can just import grid energy at grid market price cheaper than the solar can provide [currently you can get paid 0.03/kwh for using electricty at peak solar here is Sydney]. I mean your solar, no matter how cheap, can not beat being given money. or so was the result last I did the math in Australia.

Actually I don't have the number now but the calculation I did suggested that running solar but using the grid as a battery is more cost effective than running your own battery, but my result may not generalise.

I am suprise you can get gas so cheap where you are, in Sydney the cost of electricity is similar to you 0.33/kwh but gas is 0.17/kwh. Have you check if you are receiving some subsidies for it?

viliam on What is malevolence? On the nature, measurement, and distribution of dark traits

In theory, the reward for doing good should be prestige. (Which in turn may translate to more tangible rewards.) But that mostly works in small groups and doesn't scale well.

Some aspect of this seems like a coordination problem. Whatever is your personal definition of "good", you would probably approve of a system that gives good people some kind of prestige, at least among other good people.

For example, people may disagree about whether veganism is good or bad, but from a perspective of a vegan, it would be nice if vegans could have some magical "vegan mark" that would be unfalsifiable and immediately visible to other vegans. That way, you could promote your values not just by practicing and preaching your values, but also by rewarding other people who practice the same values. (For example, if you sell some products, you could give discounts to vegans. If many people start doing that, veganism may become more popular. Perhaps some people would criticize that as doing things for the wrong reasons, but the animals probably wouldn't mind.) Similarly, effective altruists would approve of rewarding effective altruists, open source developers would approve of rewarding open source developers, etc.

These things exist to some degree (e.g. the open source developers can put a link to their projects in a profile), but often the existing solutions don't scale well. If you only have dozen effective altruists, they know each other by name, but if you get thousands, this stops working.

One problem here is the association of "good" with "unselfish" and "non-judgmental", which suggests that good people rewarding other good people is somehow... bad? In my opinion, we need to rethink that, because from the perspective of incentives and reinforcement, that is utterly stupid. The reason for these memes is that the past attempts to reward good often led to... people optimizing to be seen as good, rather than actually being good. That is a serious problem that I don't know how to solve; I just have a strong feeling that going to the opposite extreme is not the right answer.

kabir-kumar on The Rocket Alignment Problem

personally, I found how Beth just kept saying 'not really' and not saying the actual physics very very annoying.

mondsemmel on Big tech transitions are slow (with implications for AI)

Worldwide sentiment is pretty against immigration nowadays. Not that it will happen, but imagine if anti-immigration sentiment could be marshalled into a worldwide ban on AI development and deployment. That would be a strange, strange timeline.

kabir-kumar on Are we dropping the ball on Recommendation AIs?

Yup, I think research that studies the effect of recommendation algorithms on the brain, from various social media platforms and compares them to the effects of narcotics, would be extremely useful.
I think we're really really lacking in decent legislation for recommendation algorithms atm - at the absolute bare minimum, platforms which use very addictive algorithms should have some kind of warning label informing users of the possibility of addiction - similarly to cigarettes - so that parents know clearly what might happen to their children.
This is going to be even more important as things like character.ai grow.

sil-ver on A Logical Proof for the Emergence and Substrate Independence of Sentience

Nice; I think we're on the same page now. And fwiw, I agree (except that I think you need just a little more than just "fire at the same time"). But yes, if the artificial neurons affect the electromagnetic field in the same way -- so not only fire at the same time, but with precisely the same strength, and also have the same level of charge when they're not firing -- then this should preserve both communication via synaptic connections and gap junctions, as well as any potential non-local ephaptic coupling or brain wave shenanigans, and therefore, the change to the overall behavior of the brain will be so minimal that it shouldn't affect its consciousness. (And note that concerns the brain's entire behavior, i.e., the algorithm it's running, not just its input/output map.)

If you want to work more on this topic, I would highly recommend trying to write a proof for why simulations of humans on digital computers must also be conscious -- which, as I said in the other thread, I think is harder than the proof you've given here. Like, try to figure out exactly what assumptions you do and do not require -- both assumptions about how consciousness works and how the brain works -- and try to be as formal/exact as possible. I predict that actually trying to do this will lead to genuine insights at unexpected places. No one has ever attempted this on LW (or at least there were no attempts that are any good),^[1] so this would be a genuinely novel post.

I'm claiming this based on having read every post with the consciousness tag -- so I guess it's possible that someone has written something like this and didn't tag it, and I've just never seen it. ↩︎

jkaufman on Is the Power Grid Sustainable?

If the cost of power generation were the main contributor to the overall cost of the system then I think you'd be right: economies of scale and the ability to generate in cheap places and sell in expensive places would do a lot to keep people on the grid. But looking at my bill (footnote [1]) the non-generation costs are high enough that if current trends continue that should flip; see my response to cata, above [LW(p) · GW(p)].