Nisan's Shortform

nisan

Nisan's Shortform

post by Nisan · 2021-09-12T06:05:04.965Z · LW · GW · 24 comments

24 comments

24 comments

Comments sorted by top scores.

comment by Nisan · 2025-02-06T20:42:18.063Z · LW(p) · GW(p)

Google's AI principles used to say:

In addition to the above objectives, we will not design or deploy AI in the following application areas:

Technologies that cause or are likely to cause overall harm. Where there is a material risk of harm, we will proceed only where we believe that the benefits substantially outweigh the risks, and will incorporate appropriate safety constraints.

Weapons or other technologies whose principal purpose or implementation is to cause or directly facilitate injury to people.

Technologies that gather or use information for surveillance violating internationally accepted norms.

Technologies whose purpose contravenes widely accepted principles of international law and human rights.

As our experience in this space deepens, this list may evolve.

On 2025-02-04, Google removed these four commitments. The updated principles seem consistent with making weapons, causing net harm, violating human rights, etc. As justification, James Manyika and Demis Hassabis said:

There’s a global competition taking place for AI leadership within an increasingly complex geopolitical landscape. We believe democracies should lead in AI development, guided by core values like freedom, equality, and respect for human rights. And we believe that companies, governments, and organizations sharing these values should work together to create AI that protects people, promotes global growth, and supports national security.

Replies from: MichaelDickens, Nisan, None

↑ comment by MichaelDickens · 2025-02-07T14:31:52.450Z · LW(p) · GW(p)

If you publicly commit to something, taking down the written text does not constitute a de-commitment. Violating a prior commitment is unethical regardless of whether the text of the commitment is still on your website.

(Not that there's any mechanism to hold Google to its commitments, or that these commitments ever meant anything—Google was always going to do whatever it wanted anyway.)

Replies from: cubefox

↑ comment by cubefox · 2025-02-11T16:26:48.972Z · LW(p) · GW(p)

Which shows that "commitments" without any sort of punishment are worth basically nothing. They can all just be silently deleted from your website without generating significant backlash.

There is also a more general point about humans: People can't really "commit" to doing something. You can't force your future self to do anything. Our present self treats past "commitments" as recommendations at best.

↑ comment by Nisan · 2025-02-06T21:17:12.063Z · LW(p) · GW(p)

And so, like OpenAI and Anthropic, Google DeepMind wants the United States' AI to be stronger than China's AI. And like OpenAI, it intends to make weapons for the US government.

One might think that in dropping its commitments not to cause net harm and not to violate international law and human rights, Google is signalling its intent to violate human rights. On the contrary, I believe it's merely allowing itself to threaten human rights — or rather, build weapons that will enable the US government to threaten human rights in order to achieve its goals.

(That's the purpose of a military, after all. We usually don't spell this out because it's ugly.)

This move is an escalation of the AI race that makes AI war more likely. And even if war is averted, it will further shift the balance of power from individuals to already-powerful institutions. And in the meantime, the AIs themselves may become autonomous actors with their own purposes.

↑ comment by [deleted] · 2025-02-08T01:19:35.400Z · LW(p) · GW(p)

The version of this that OpenAI did [LW(p) · GW(p)] wasn't unexpected to me, but this one was, I wasn't sure about DeepMind. (Wait, now I'm remembering that "Google AI" is a separate group to "DeepMind"? Web says they were merged in 2023. I wonder which members on the inside this change reflects).

comment by Nisan · 2024-07-11T01:04:43.411Z · LW(p) · GW(p)

On 2018-04-09, OpenAI said^[1]:

OpenAI’s mission is to ensure that artificial general intelligence (AGI) [...] benefits all of humanity.

In contrast, in 2023, OpenAI said^[2]:

[...] OpenAI’s mission: to build artificial general intelligence (AGI) that is safe and benefits all of humanity.

Archived ↩︎
This archived snapshot is from 2023-05-17, but the document didn't get much attention until November that year. ↩︎

comment by Nisan · 2023-05-11T05:27:19.608Z · LW(p) · GW(p)

Recent interviews with Eliezer:

2023.02.20 Bankless
2023.02.20 Bankless followup
2023.03.11 Japan AI Alignment Conference
2023.03.30 Lex Fridman
2023.04.06 Dwarkesh Patel
2023.04.18 TED talk
2023.04.19 Center for the Future Mind
2023.05.04 Accursed Farms
2023.05.06 Logan Bartlett
2023.05.06 Fox News
2023.05.08 EconTalk
2023.07.02 David Pakman
2023.07.13 AI IRL
2023.07.13 The Spectator (Edited transcript of the full interview)
2023.07.13 Dan Crenshaw
2023.07.28 Coleman Hughes (with Scott Aaronson and Gary Marcus)
2023.08.16 Dwarkesh Patel (with George Hotz)
2023.08.23 One Decision
2023.09.23 Destiny
2024.06.16 iai
2024.11.11 Stephen Wolfram
2025.01.24 Dan Faggella

comment by Nisan · 2023-10-21T02:25:13.586Z · LW(p) · GW(p)

Hyperbolic growth

The differential equation , for positive $α$ and $b$ , has solution

$y = \frac{1}{(t_{0} - t)^{1 / α}}$

(after changing the units). The Roodman report argues that our economy follows this hyperbolic growth trend, rather than an exponential one.

While exponential growth has a single parameter — the growth rate or interest rate — hyperbolic growth has two parameters: $t_{0}$ is the time until singularity, and $α$ is the "hardness" of the takeoff.

A value of $α$ close to zero gives a "soft" takeoff where the derivative gets high well in advance of the singularity. A large value of $α$ gives a "hard" takeoff, where explosive growth comes all at once right at the singularity. (Paul Christiano calls these "slow" and "fast" takeoff.)

Paul defines "slow takeoff" as "There will be a complete 4 year interval in which world output doubles, before the first 1 year interval in which world output doubles." This corresponds to $α \leq 2$ . (At $α = 2$ , the first four-year doubling starts at $\frac{16}{3}$ and the first one-year doubling starts at $\frac{4}{3}$ years before the singularity.)

So the simple hyperbola $y = 1 / (t_{0} - t)$ with $α = 1$ counts as "soft takeoff". (This is the "naive model" mentioned in footnote 31 of Intelligence Explosion Microeconomics.)

Roodman's estimates of historical $α$ are closer to $0.5$ (see Table 3).

comment by Nisan · 2021-09-12T06:05:06.375Z · LW(p) · GW(p)

People are fond of using the neologism "cruxy", but there's already a word for that: "crucial". Apparently this sense of "crucial" can be traced back to Francis Bacon.

Replies from: ChristianKl, Vladimir_Nesov

↑ comment by ChristianKl · 2021-09-14T14:57:51.078Z · LW(p) · GW(p)

The point of using a word like this is to point to different habits of thoughts. If you use an existing word that's unlikely to happen in listerners.

If you don't do that you get a lot of motte-and-bailey issues.

↑ comment by Vladimir_Nesov · 2021-09-12T11:07:37.023Z · LW(p) · GW(p)

A cruxy point doesn't have to be important, the whole question being considered doesn't have to be important. This is an unfortunate connotation of "crucial", because when I'm pointing out that the sky is blue, I'm usually not saying that it's important that it's blue, or that it's important for this object level argument to be resolved. It's only important to figure out what caused a simple mistake that's usually reliably avoided, and to keep channeling curiosity to fill out the map, so that it's not just the apparently useful parts that are not wild conjecture.

Replies from: Dagon

↑ comment by Dagon · 2021-09-12T19:59:34.442Z · LW(p) · GW(p)

I think it's relative. A crux is crucial to a question, whether the question is crucial to anything else or not. If you're pointing out the sky is blue, that's only a crux if it's important to some misunderstanding or disagreement.

I'm with Nisan. "Crucial" is simply the proper and common term that should be used instead of the backformation "cruxy".

comment by Nisan · 2024-09-28T02:31:55.747Z · LW(p) · GW(p)

A fun lesson on the scientific method: A few days ago, OpenAI announced a rollout of "Advanced Voice Mode". Confused by the messaging, I ran a test to see if their phone app could respond to an interruption:

Nisan: Can you count to 10, and then I'll interrupt you in the middle and see how that works.

ChatGPT 4o: Sure! I'll start counting now: One, two, three, four, fiv—

Nisan: Stop. Interruption.

ChatGPT 4o: How'd the interruption go?

Nisan: Oh, that went great. You stopped at five.

ChatGPT 4o: Awesome! Glad it worked smoothly. Do you want to try another test or experiment?

Nisan: Yeah, let's do another interruption test. This time, please count to 20.

ChatGPT 4o: Got it! Starting now: One, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirt—

Nisan: Stop. Interruption.

ChatGPT 4o: How was that?

Nisan: Wow. [Thinks: It seemed to stop slightly before I interrupted it!]

ChatGPT 4o: Glad that impressed you! Want to try something else?

Nisan: Uh, hold on.

ChatGPT 4o: Take your time!

Nisan: [Generates a random number.] Okay, please count to 30, and I'll interrupt at a random point.

ChatGPT 4o: Alright, starting now: One, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen—

[Nisan interrupts at some point, I forgot exactly where.]

ChatGPT 4o: —sixteen, seventeen, eighteen, nineteen, twenty, twenty-one, twenty-two, twenty-three, twenty-four, twenty-five, twenty-six, twenty-seven, twenty-eight, twenty-nine, thirty. Looks like I made it to the end! Did you mean to interrupt somewhere?

Nisan: Yeah, I did. It must not have worked. Let's try that again. [Generates a random number.] Please count up to 30.

ChatGPT 4o: No problem, let's try again! Starting now: One, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen—

[Nisan interrupts at some point.]

ChatGPT 4o: —sixteen, seventeen, eighteen, nineteen, twenty, twenty-one, twenty-two, twenty-three, twenty-four, twenty-fiv— Did it work this time?

Nisan: Uh, no.

I believe ChatGPT can't hear me when it's talking. But it correctly guessed when I would interrupt on the first two trials!

Replies from: kave

↑ comment by kave · 2024-09-28T04:01:44.804Z · LW(p) · GW(p)

I tried to replicate. At 20 it went on to 25, and I explained what it got wrong. I tried again. I interrupted at 6 and it stopped at 7, saying "Gotcha, stopped right at eleven!". I explained what happened and it said something like "Good job, you found the horrible, marrow cricket" (these last 3 words are verbatim) and then broke.

comment by Nisan · 2024-05-14T05:01:06.553Z · LW(p) · GW(p)

12 years ago, in The state of Computer Vision and AI: we are really, really far away, Andrej Karpathy wrote:

The picture above is funny.

But for me it is also one of those examples that make me sad about the outlook for AI and for Computer Vision. What would it take for a computer to understand this image as you or I do? [...]

In any case, we are very, very far and this depresses me. What is the way forward? :(

I just asked gpt-4o what's going on in the picture, and it understood most of it:

In this image, a group of men in business attire are seen in a locker room or a similar setting. The focus is on two men, where the taller man is standing on a scale. The shorter man, who appears to be playfully pressing down on the taller man's shoulders to increase his weight on the scale, is creating a humorous situation. Both men and those observing in the background are smiling or laughing, indicating that they are enjoying the lighthearted moment. The man pressing down seems to be taking part in a playful prank or joke, adding a sense of camaraderie and fun to the scene.

Replies from: Nisan

↑ comment by Nisan · 2024-05-14T05:03:27.941Z · LW(p) · GW(p)

Of course, Karpathy's post could be in the multimodal training data.

comment by Nisan · 2024-09-04T21:11:03.720Z · LW(p) · GW(p)

The coin flip is a brilliant piece of technology for generating trustworthy random noise:

Making a two-headed coin is forgery, which is a crime.
Such trick coins can be foiled anyways by calling the toss in the air.

Thus when teaching the concept of a Bernoulli variable, we use the example of coin flips, because everyone already knows what they are. This is unfortunate because the very next concept we introduce is a biased Bernoulli variable, which corresponds to a "weighted" coin. But weighted coins don't exist! If it were practical to manufacture trick coins with arbitrary biases, coin flipping wouldn't be as popular as it is.

Replies from: cubefox, nathan-helm-burger

↑ comment by cubefox · 2024-09-07T11:21:20.623Z · LW(p) · GW(p)

Yeah, coins can only be weighted very slightly. See Andrew Gelman & Deborah Nolan: You Can Load a Die, But You Can't Bias a Coin

↑ comment by Nathan Helm-Burger (nathan-helm-burger) · 2024-09-05T19:26:29.000Z · LW(p) · GW(p)

Yeah, and it's so very easy to make a weighted die. Why don't teachers switch to talking about weighted dice when explaining biased variables? You can label the sides of a six sided die with three 1s and three 2s to get a binary die easily enough. Just seems weird that something which is very physically difficult to ever make exist, and almost certainly nobody in the class has ever seen would be chosen as a teaching example over something which does exist and could even be made into a physical object for in-class demonstrations!

comment by Nisan · 2023-07-17T06:56:25.279Z · LW(p) · GW(p)

Conception is a startup trying to do in vitro gametogenesis for humans!

comment by Nisan · 2024-01-08T01:38:08.150Z · LW(p) · GW(p)

We can derive Newton's law of cooling from first principles.

Consider an ergodic discrete-time dynamical system and group the microstates into macrostates according to some observable variable . ( $X$ might be the temperature of a subsystem.)

Let's assume that if $X = x$ , then in the next timestep $X$ can be one of the values $x - d x$ , $x$ , or $x + d x$ .

Let's make the further assumption that the transition probabilities for these three possibilities have the same ratio as the number of microstates.

Then it turns out that the rate of change over time $\frac{d X}{d t}$ is proportional to $\frac{d H}{d X}$ , where $H$ is the entropy, which is the logarithm of the number of microstates.

Now suppose our system consists of two interacting subsystems with energies $E_{1}$ and $E_{2}$ . Total energy is conserved. How fast will energy flow from one system to the other? By the above lemma, $\frac{d E_{1}}{d t}$ is proportional to $\frac{d H}{d E_{1}} = \frac{d H_{1}}{d E_{1}} - \frac{d H_{2}}{d E_{2}} = C_{1} - C_{2} = \frac{1}{T_{1}} - \frac{1}{T_{2}}$ .

Here $C_{1}$ and $C_{2}$ are the coldnesses of the subsystems. Coldness is the inverse of temperature, and is more fundamental than temperature.

Note that Newton's law of cooling says that the rate of heat transfer is proportional to $T_{2} - T_{1}$ . For a narrow temperature range this will approximate our result.

Replies from: Nisan

↑ comment by Nisan · 2024-01-08T01:41:03.499Z · LW(p) · GW(p)

I'd love if anyone can point me to anywhere this cooling law (proportional to the difference of coldnesses) has been written up.

Also my assumptions about the dynamical system are kinda ad hoc. I'd like to know assumptions I ought to be using.

comment by Nisan · 2022-02-06T11:50:05.654Z · LW(p) · GW(p)

Agents who model each other can be modeled as programs with access to reflective oracles. I used to think the agents have to use the same oracle. But actually the agents can use different oracles, as long as each oracle can predict all the other oracles. This feels more realistic somehow.

Replies from: Dagon

↑ comment by Dagon · 2022-02-07T04:53:33.203Z · LW(p) · GW(p)

I'm not sure there's a functional difference between "same" and "different" oracles at this level of modeling.

Nisan's Shortform

Contents

24 comments