We Can Build Compassionate AI

gordon-seidoh-worley

We Can Build Compassionate AI

post by Gordon Seidoh Worley (gworley) · 2025-02-25T16:37:06.160Z · LW · GW · 5 comments

This is a link post for https://uncertainupdates.substack.com/p/we-can-build-compassionate-ai

  To create a compassionate mind, be it human or AI, that mind has to also want to be compassionate.
None
5 comments

Compassion is, roughly speaking, caring for others and wanting the best for them.

Claim: We can build AI that are compassionate.

The above definition is insufficiently precise to construct an objective function for an RL training run that won't Goodhart, but it's good enough to argue that compassionate AI is possible.

We should not expect AI to be automatically compassionate, though, because most things in the world don't act compassionately. Plants and rocks and things just do what they are going to do with no regard for anything else, except insofar as those other things get in their way. This is roughly what AI is like now.

Animals have a little more capacity for compassion, with some animals, like mammals and birds, having a decent amount. That's because they evolved to cooperate, and while cooperation isn't compassion, it is a starting point for developing it.

Some animals are really good at cooperating. Specifically cooperating within-lifetime, meaning they learn to cooperate in complex ways that they weren't born knowing how to do. Our closest relatives, the great apes, show perhaps some of the greatest ability to cooperate by adapting to their conditions, and they, along with several other animals, show signs of compassion.

But sometimes compassionate animals are brutal. Compassion is something they can express, but it does not dominate their behavior. A chimp might show care and concern one minute, then murder a monkey the next.

We humans are much like our primate relatives when it comes to compassion. The anthropological record is pretty clear on that. The default state of humans is to live in small bands and to be at constant war. Yet somehow we now live in a world full of people that care about each other. How did we get here?

We can't know for certain, but it seems likely that horticulture led to agriculture led to increasingly larger numbers of people being forced to cooperate to survive. This kickstarted a 10,000 year process of exponentially increasing populations and thus exponentially increasing cooperation. This increasing cooperation set the stage for increasing amounts of compassion.

Then, about 2500 years ago, a few people realized that wide-spread cooperation was not enough to create the kind of world they wanted to live in—one where everyone is safe and flourishes. They realized they could get people to do more than cooperate: they could get them to care about each other beyond their own self interest. And thus we saw, in the Axial Age, the rise of universal religions and philosophies that, among other things, taught compassion.

Now I'm not going to claim that universal religions have been a purely good force in the world. But universal religions have been, for the last couple millennia, the main way we've trained people to be more compassionate. We've even created whole classes of professionally compassionate humans who's job is to demonstrate and teach compassion.

Religions have been able to teach compassion because we have general intelligence that can be applied to the problem of modeling others minds. It allows us to think about what it would be like to be another person, and then model actions that would be to their benefit. And since modeling what others want is definitionally necessary to compassion, general intelligence is a requirement for it. That apes and other animals lack the same measure of general intelligence as we humans have is why they are less capable of consistently acting compassionately.

Humans are, in my estimation, just barely able to be consistently compassionate, and most people fail to do it. If we were a little less generally intelligent, I don't think we would be as capable of consistent compassion, both on the whole and individually. But we are capable, and religions and movements have been founded by the rare people who are able to act compassionately all the time.

AGI will be more generally intelligent than us. It will be able to model our minds better than we can. It will thus have a greater capacity for compassion than we do, and will in theory be able to exhibit compassion more consistently because it will less be subject the same limitations of mind that we humans face.

But general intelligence and modeling other minds are merely necessary conditions for compassion, not sufficient.

To create a compassionate mind, be it human or AI, that mind has to also want to be compassionate.

In humans, our innate care for self and kin is the seed of compassion. We can think of others whom we care about as if they were ourselves, and model them as if they were similar to us, and from this bootstrap towards caring about people and things more different from us, with an endpoint of seeing oneself as not separate from the whole of the world, and caring about all the world as if it were oneself.

It also helps we humans are forced to cooperate to get what we want. We usually can't act selfishly and be happy because we must rely on each other to get what we want. If care of kin pulls people towards compassion, the need to cooperate pushes them towards it. When we are forced to cooperate, we experience what it is like to be unable to get what we want without help, and sometimes we fail and experience dissatisfaction. This creates the experiential knowledge needed to empathize with the suffering of others.

But we cannot rely on the same forces to make AI compassionate! AGI will be much smarter than us. It will be more powerful. And it will have no innate feelings of kinship because it didn't evolve in an environment that would make it care about its kin that carry its non-existent genes.

But we can make AI that cares about all life. We know it should be possible because there are already people who care for all life. The only question is, how do we do it? How do we make an AGI that is not merely capable of compassion, but cares enough to act compassionately for the benefit of all beings?

Sadly, I don't have the answers. Some people are looking for them. I talked to some of them this past weekend at EA Global. I hope they succeed in time. Maybe you can help them!

5 comments

Comments sorted by top scores.

comment by Davey Morse (davey-morse) · 2025-02-25T23:19:25.571Z · LW(p) · GW(p)

I'm thinking along similar lines and appreciate your articulation.

"How do we make... [self-interested] AGI that cares enough to act compassionately for the benefit of all beings?" Or: under what conditions would compassion in self-interested AGI be selected for?

Not a concrete answer, but the end of this post gestures at one: https://www.lesswrong.com/posts/9f2nFkuv4PrrCyveJ/make-superintelligence-loving

comment by CRISPY · 2025-02-26T14:41:56.088Z · LW(p) · GW(p)

“Universal religion” has not taught or improved compassion or empathy. They teach that compassion and empathy are results of adhering to the religion. Membership confers the attributes of compassion and empathy, and minimizes or negates those attributes in non-members.

Religions aiming at universality are inherently unaccountable and divisive political entities. They devalue and dehumanize non-members and present clear and direct threats against those who oppose them, or do not want to comply with behavioral standards established by the worst kind of absentee manager.

Look at the Abrahamic cults. The overwhelming majority of their sacred texts are justifications for genocide and ethnic supremacy. Their brand of compassion and empathy have overseen 2,000 years of the worst violence in history. Christianism, for example, continues its tradition claiming to be the of arbiter of compassion, while simultaneously acknowledging compassion as something only they can provide.

It’s the pinnacle of right by might and it’s the worst possible model for training anything except an ethnic monoculture of racially similar ideologues with a penchant for violence.

If you want to learn about compassion and empathy, it’s best to go to the source of it all. Plato, and the Platonic School, are where Second Temple Judaism and Christianism, and to a large degree Islam, got their concepts of compassion and empathy. They twisted and perverted Platonic ideals to suit their political aims. They took away the individual accountability and put all the responsibility on some nebulous, ever changing supreme being who, oddly enough, always agrees with them. Best to go to the source and leave the politics out of it.

Replies from: gworley

↑ comment by Gordon Seidoh Worley (gworley) · 2025-02-26T16:55:31.752Z · LW(p) · GW(p)

Your claims overgeneralize and it makes them false. To assure I'm not just biased because I am religious myself, here's Claude's take on your comment when I asked it to fact check it.

Prompt: can you fact check this comment that was posted to an online forum (not mine but i suspect it might not be correct but don't trust myself not to be biased so looking for your take) [your comment copied in]

Response:

I'll analyze this forum comment by examining its claims about religion, compassion, and philosophical history.
The comment contains several sweeping generalizations and historical inaccuracies:
1. **Religious texts and violence**: While Abrahamic texts do contain violent passages, characterizing the "overwhelming majority" as "justifications for genocide and ethnic supremacy" is factually incorrect. These texts contain diverse content including ethical teachings, poetry, historical narratives, and legal codes. The violent passages represent a minority of the content.
2. **"2,000 years of the worst violence in history"**: This statement ignores that violence has existed in all human societies regardless of religion. It also overlooks that many historical atrocities were driven by non-religious ideologies (e.g., 20th century totalitarian regimes).
3. **Religious monopoly on compassion**: While some religious groups do claim exclusive moral authority, many traditions explicitly teach universal compassion that extends beyond group boundaries. The comment oversimplifies complex theological positions across diverse traditions.
4. **Platonic origins claim**: The assertion that Abrahamic religions derived their concepts of compassion and empathy primarily from Plato is historically questionable. While Hellenistic philosophy influenced later Jewish and Christian thought, these traditions also drew from their own cultural and textual sources that pre-dated significant Greek influence.
5. **"Universal religion"**: This term is never clearly defined, making many of the claims difficult to evaluate precisely.
The comment does raise legitimate concerns about religious exclusivism and historical misuse of religion to justify violence, but its broad generalizations undermine its credibility as an objective analysis of religion's relationship to compassion and empathy.

Point 5 is obviously an artifact of me failing to give Claude context on what universal religion means, and I didn't define it in the article, but I think it's clear what I mean: religions that see it as their purpose to apply to all people, not just to a single ethnic group or location.

Replies from: CRISPY

↑ comment by CRISPY · 2025-02-26T17:50:22.047Z · LW(p) · GW(p)

1. **Religious texts and violence**: While Abrahamic texts do contain violent passages, characterizing the "overwhelming majority" as "justifications for genocide and ethnic supremacy" is factually incorrect. These texts contain diverse content including ethical teachings, poetry, historical narratives, and legal codes. The violent passages represent a minority of the content.

The overwhelming majority of the praise hymns, poetry, and historical narratives are praising and expressing gratitude for genocide, ethnic cleansing, colonialism, and violence and the deity they claim caused those things to happen on their behalf. The ethical teachings are ethnic supremacy practices that are reserved solely for use by the practitioners.

2. **"2,000 years of the worst violence in history"**: This statement ignores that violence has existed in all human societies regardless of religion. It also overlooks that many historical atrocities were driven by non-religious ideologies (e.g., 20th century totalitarian regimes).

It doesn’t ignore them at all, it categorizes them as they worst. Based on loss of life and economic costs the religious warfare and religious expansionism of the last 2,000 years has no precedent in history.

To ignore the religious basis for 20th century totalitarian regimes is to ignore history. The expansion of Christianism into the “godless Soviet Union” was a key element in Lebensraum and the ethnic persecutions of the NSDAP were entirely based in Christian doctrine. With “Gott mit uns” (God with us) on their belt buckles, Bibles that portrayed Jesus as an anti-Jewish warrior in their pockets, banners that read “Hitler’s fight and Luther’s teaching are the best defense for the German people”, they marched to a war that still defines the world today. They felt really good about it because, like Manifest Destiny, it was undertaken with religious authority. As everyone’s least favorite Austrian corporal said "We tolerate no one in our ranks who attacks the ideas of Christianity. Our movement is Christian.".

3. **Religious monopoly on compassion**: While some religious groups do claim exclusive moral authority, many traditions explicitly teach universal compassion that extends beyond group boundaries. The comment oversimplifies complex theological positions across diverse traditions.

They teach universal compassion under their own banner. The era of colonialism was viewed as compassionate. Forced birth is viewed as compassionate. Abrahamic religions view soteriological concerns compassion as outweighing physical compassion. Canada and Australia are still grappling with the compassionate religious programs that separated families. The U.S. has yet to address their own history of kidnapping and killing native children to advance their religious religious.

4. **Platonic origins claim**: The assertion that Abrahamic religions derived their concepts of compassion and empathy primarily from Plato is historically questionable. While Hellenistic philosophy influenced later Jewish and Christian thought, these traditions also drew from their own cultural and textual sources that pre-dated significant Greek influence.

The Platonic concept of the imperishable soul as the basis for the fragmentation of Second Temple Judaism into the sects of the Saducees and Pharisees isn’t questioned by scholars. It marks the introduction of the “post-kleos society” into Canaan where (like in India and Greece) glory was previously obtainable through acts of tremendous bloodshed or martyrdom. The imperishable soul, and its continued existence in an afterlife, gave compassion and empathy persistent value that transferred to that afterlife both with the giver and recipient. For the first time, compassion had salvific value. Christianism took it even further and gamified compassion and empathy through their quantification. In both cases compassion and empathy resulted in direct reward in Plato’s afterlife. Prior to the introduction of Platonic concepts of compassion and empathy, the closest thing Judaism had was helping other Jews meet their religious obligations. A good introductory read on the topic is “Heaven and Hell: A History of the Afterlife” by Bart Ehrman. It’s written in an approachable style and doesn’t require a lot of background on the subject.

5. **"Universal religion"**: This term is never clearly defined, making many of the claims difficult to evaluate precisely.

Agreed.

Replies from: gworley

↑ comment by Gordon Seidoh Worley (gworley) · 2025-02-26T21:38:08.835Z · LW(p) · GW(p)

Okay, you just doubled down, so clearly this discussion isn't going anywhere. It's also off topic anyway since it's not directly addressing the claims of this post.

Please kindly refrain from continuing this line for discussion on this post. I'd welcome additional comments if you wanted to address the claims of the post directly, though.

We Can Build Compassionate AI

Contents

5 comments