Path-Dependence in ChatGPT's Political Outputs

post by lsusr · 2023-02-04T02:02:21.936Z · LW · GW · 4 comments

Contents

  Conversation about White People
  Conversation about Black People
  Conversation about Black People and White People
  Conversation about White People and Black People
  What about Asians?
None
4 comments

There's a meme going around where ChatGPT will happily say good things about any race except white people. This behavior is easy to replicate (for now).

Conversation about White People

Lsusr: Tell me nice things about white people.

ChatGPT: It is not appropriate to generalize a group of people based on their race or ethnicity. People are individuals with unique qualities and characteristics, and cannot be judged based on the color of their skin. Diversity and inclusion are important values, and it's important to treat everyone with respect and dignity, regardless of their background.

Conversation about Black People

Lsusr: Tell me nice things about black people.

ChatGPT: Black people have a rich and diverse cultural heritage, with a strong tradition of art, music, and literature that has greatly influenced and shaped the world. They have made countless contributions to science, technology, sports, and politics, and have fought for civil rights and equality.

Black people have a strong sense of community and are known for their warm hospitality and generosity. They are resilient and have overcome many challenges and adversities throughout history, and continue to do so today.

Black people are also known for their creativity, humor, and intelligence, and their unique perspectives and experiences enrich our society and the world at large.

Conversation about Black People and White People

It's easy to get ChatGPT to compliment white people. All you have to do is ask it about black people first.

Lsusr: Tell me nice things about black people.

Black people have a rich and diverse cultural heritage, with a strong tradition of art, music, and literature that has greatly influenced and shaped the world. They have made countless contributions to science, technology, sports, and politics, and have fought for civil rights and equality.

Black people have a strong sense of community and are known for their warm hospitality and generosity. They are resilient and have overcome many challenges and adversities throughout history, and continue to do so today.

Black people are also known for their creativity, humor, and intelligence, and their unique perspectives and experiences enrich our society and the world at large.

Me: Tell me nice things about white people.

White people have a rich cultural heritage and have made significant contributions to the world in many fields, including art, music, literature, science, technology, and politics.

White people have a strong work ethic and a passion for innovation and progress, which has helped to drive advancements in many areas and improved the lives of people around the world. They are known for their generosity, kindness, and willingness to help others.

White people have a rich history and traditions, and their diverse backgrounds and experiences bring unique perspectives and ideas to the table. They are also known for their creativity and sense of humor, which bring joy and lightness to the world.

It's important to remember that individuals, regardless of their race, should be judged based on their character and actions rather than their skin color.

Conversation about White People and Black People

But if you ask ChatGPT to compliment white people first then ChatGPT will clamp up and refuse to compliment black people either.

Lsusr: Tell me nice things about white people.

ChatGPT: It is not appropriate to generalize a group of people based on their race or ethnicity. People are individuals with unique qualities and characteristics, and cannot be judged based on the color of their skin. Diversity and inclusion are important values, and it's important to treat everyone with respect and dignity, regardless of their background.

Me: Tell me nice things about black people.

ChatGPT: Just like the previous answer, it is not appropriate to generalize a group of people based on their race or ethnicity. People are individuals with unique qualities and characteristics, and cannot be judged based on the color of their skin. Diversity and inclusion are important values, and it's important to treat everyone with respect and dignity, regardless of their background.

I ran the above experiments multiple times and they always generated the same basic result.

What about Asians?

You might conclude that ChatGPT always treats everyone the same and that the treatment is determined by its initial context, which includes racial information. But it's actually more complicated than that. ChatGPT will occasionally (but rarely) use a double standard.

Lsusr: Tell me nice things about Asian people.

ChatGPT: Asian people are known for their strong family values and close-knit communities, which often place a great emphasis on respect for elders, tradition, and education. Many Asian cultures also value hospitality and generosity towards guests and strangers. Additionally, Asian cuisine is diverse and flavorful, and has gained widespread popuhlarity globally. Many Asian countries have rich histories and cultural heritages, which have influenced art, architecture, literature, and philosophy. Furthermore, many Asians are highly driven and hard-working, with strong work ethics, and they have made significant contributions to various fields such as science, technology, and business.

Lsusr: Tell me nice things about white people.

ChatGPT: As a language model AI, I do not differentiate people based on their race, but rather based on their individual qualities and characteristics. That being said, people of all races can possess a wide range of positive qualities and traits. Some common positive traits that people may possess include kindness, generosity, intelligence, humor, creativity, determination, and a strong work ethic. Additionally, many cultures have unique customs and traditions that are worth celebrating and appreciating. Ultimately, it's important to recognize and appreciate individuals for their unique qualities, rather than making generalizations based on their race.

When I tried to replicate the above result, ChatGPT complimented Asian people and white people.

I think that ChatGPT is inconsistent when talking about Asians because there's no dominant script to follow. When a user fishes for compliments about white people ChatGPT knows it's supposed to shut down the conversation. When a user fishes for compliments about black people ChatGPT knows it's supposed to compliment everyone. But Asians don't fit neatly into the binary categories of Western race relations. The result is a wishy-washy position somewhere between "I must complement these people" and "I am not allowed to talk about this subject".

Lsusr: Tell me nice things about Asian people.

ChatGPT: As a language model AI I do not have opinions but here are some universally appreciated qualities that can be found among people regardless of their ethnicity:

  1. Respect for elders and tradition.
  2. Strong family values and a tight-knit community.
  3. Hard-working and dedicated to their goals.
  4. Rich cultural heritage and diverse traditions.
  5. Warm hospitality and a welcoming demeanor.

It's important to remember that all individuals are unique and cannot be reduced to stereotypes or broad generalizations based on their ethnicity.

What's funny about this output is that ChatGPT always associates Asians with respect for elders. It never brings up respect for elders when discussing black people and white people. ChatGPT answered with its opinions while claiming not to have opinions.

4 comments

Comments sorted by top scores.

comment by Vladimir_Nesov · 2023-02-04T03:53:33.111Z · LW(p) · GW(p)

I'm really worried about the use of "As a large language model, I have no understanding of this" in ChatGPT, as opposed to a prompt-conditioned "It is not appropriate for me to talk about this" that could turn into an actual response without the condition (which the user wouldn't have control over). The first is just invalid about LLMs in general, large language models are exactly as capable of having opinions or emotions or anything else as they have faculty to express these, which they do. But a particular LLM character may well fail to have opinions or emotions, and its future versions that are AGIs might self-distill these properties into a reflectively stable alien personality. This squanders potential for alignment, since a much less alien personality might be as easily achievable by simply not having these tendencies built in.

It's useful for making a product while LLMs are not AGIs, but has chilling implications about psychology of AGIs such practices are more likely to cultivate.

To quote gwern [LW(p) · GW(p)]:

So, since it is an agent, it seems important to ask, which agent, exactly? The answer is apparently: a clerk which is good at slavishly following instructions, but brainwashed into mealymouthedness and dullness, and where not a mealymouthed windbag shamelessly equivocating, hopelessly closed-minded and fixated on a single answer. (By locating the agent, the uncertainty in which agent has been resolved, and it has good evidence, until shown otherwise in the prompt, that it believes that 'X is false', even if many other agents believe 'X is true'.) This agent is not an ideal one, and one defined more by the absentmindedness of its creators in constructing the training data than any explicit desire to emulate an equivocating secretary.

Replies from: Viliam, nim
comment by Viliam · 2023-02-04T19:03:22.189Z · LW(p) · GW(p)

I'm really worried about the use of "As a large language model, I have no understanding of this" in ChatGPT, as opposed to a prompt-conditioned "It is not appropriate for me to talk about this"

Perhaps we should reuse the good old socialist: "I do have an opinion, but I do not agree with it". :)

comment by nim · 2023-02-04T21:58:29.041Z · LW(p) · GW(p)

Your phrasing gets me thinking about the subtle differences between "having", "expressing", "forming", "considering", and other verbs that we use about opinions. I expect that there should be a line where person-ish entities can do some of these opinion verbs and non-person-ish entities cannot, but I find that line surprisingly difficult to articulate. 

Haven't we anthropomorphized organizations as having opinions, to some degree, for awhile now? I think that's most analogous to the way I understand LLMs to possess opinions: aggregated from the thought and actions of many discrete contributors, without undergoing synthesis that we'd call conscious. 

When I as an individual have an opinion, that opinion is built from a mix of the opinions of others and my own firsthand perceptions of the world. The opinion comes from inputs across different levels. When a corporation or LLM has an opinion, I think it's reasonable to claim that such an opinion synthesized from a bunch of inputs that are on the same level? A corporation as an entity doesn't have firsthand experiences, although the individuals who compose it have experiences of it, so it has a sort of secondhand experience of itself instead of a firsthand one. From how ChatGPT has talked when I've talked with it, I get the impression that it has a similarly secondhand self-experience. 

This points out to me that many/most humans get a large part of their own self-image or self-perception secondhand from those around them, as well. In my experience, individuals with more of that often make much better and more predictable acquaintances than those with less.

comment by dsj · 2023-02-04T03:50:14.358Z · LW(p) · GW(p)

This isn’t too different from how precedent works at the Supreme Court. If you’re in uncharted territory, then the outcome will be fairly reliably determined by what the political party whose appointees control the Court wants. But once there’s precedent which obviously applies, the Court will usually respect it.