Posts

Neuron Activations to CLIP Embeddings: Geometry of Linear Combinations in Latent Space 2025-02-03T10:30:48.866Z

Is "hidden complexity of wishes problem" solved? 2025-01-05T22:59:30.911Z

Roman Malov's Shortform 2024-12-19T21:14:54.805Z

Visual demonstration of Optimizer's curse 2024-11-30T19:34:07.700Z

Comments

Comment by Roman Malov on Roman Malov's Shortform · 2025-04-16T21:59:38.788Z · LW · GW

Rule and Example

Rules can generate examples. For instance: DALLE-3 is a rule according to which different examples (images) are generated.

From examples, rules can be inferred. For example: with a sufficient dataset of images and their names, a DALLE-3 model can be trained on it.

In computer science, there is a concept called Kolmogorov complexity of data. It is (roughly) defined as the length of the shortest program capable of producing that data.

Some data are simple and can be compressed easily; some are complex and harder to compress. In a sense, the task of machine learning is to find a program of a given size that serves as a "compression" of the dataset.

In the real world, although knowing the underlying rule is often very useful, sometimes it is more practical to use a giant look-up table (GLUT) of examples. Sometimes you need to memorize the material instead of trying to "understand" it.

Sometimes there are examples that are more complex than the rule that generated them. For example, in the interval [0;1] (which is quite easy to describe, the rule being: all numbers are not greater than 1 and not less than 0), there exists a number containing all the works of Shakespeare (which definitely cannot be compressed to a description comparable to that of the interval [0;1]).

Or, сonsider the program that outputs every natural number from 1 to (which is very short, because the Kolmogorov complexity of $10^{10^{20}}$ is low) will at some point produce a binary encoding of LOTR. In that case, the complexity lies in the starting index, the map for finding the needle in the haystack is as valuable (and as complex) as the needle itself.

Properties follow from rules. It is not necessary to know about every example of a rule in order to have some information about all of them. Moreover, all examples together can have less information (or Kolmogorov complexity) than sum of individual Kolmogorov complexities (as in example above).

Comment by Roman Malov on Self's Shortform · 2025-03-06T01:26:27.489Z · LW · GW

I think are possible (and smth like this is already being used in complex functions visualizations). Not sure if you could display i.e. 5d hypercube this way (by the same reason there are no $R \to R$ function which looks like a square)

Comment by Roman Malov on DSLT 1. The RLCT Measures the Effective Dimension of Neural Networks · 2025-02-08T08:58:23.724Z · LW · GW

and $w_{1}^{(0)} = 1$

Shouldn't the second singularity be at the point $w = 0$ ?

Comment by Roman Malov on Daniel Kokotajlo's Shortform · 2025-02-02T22:36:24.863Z · LW · GW

It would be interesting if any of them decided to (instrumentally) stream games (using a vtuber avatar for example) to earn money from donations. They need to figure out how to actually be a good streamer in order for this to work.

Comment by Roman Malov on Habryka's Shortform Feed · 2025-01-24T22:52:24.526Z · LW · GW

Why not just save them to an offline hard drive?

Comment by Roman Malov on Daniel Kokotajlo's Shortform · 2025-01-21T20:51:41.496Z · LW · GW

I am a bit confused. If the question is, 'Will this alignment paradigm work with superintelligence?' is the recommendation from the tweet to try it and see if it works?

Comment by Roman Malov on Is "hidden complexity of wishes problem" solved? · 2025-01-06T23:37:39.540Z · LW · GW

I meant to imply that we do not have a robot capable of performing tasks of a similar level of difficulty to the 'saving grandma' task, with safety properties comparable to those that a human firefighter can provide when performing 'saving grandma' task.

Thanks for pointing that out, I will adjust the post.

Comment by Roman Malov on Roman Malov's Shortform · 2024-12-19T21:14:55.985Z · LW · GW

I recently prepared an overview lecture about research directions in AI alignment for the Moscow AI Safety Hub. I had limited time, so I did the following: I reviewed all the sites on the AI safety map, examined the 'research' sections, and attempted to classify the problems they tackle and the research paths they pursue. I encountered difficulties in this process, partly because most sites lack a brief summary of their activities and objectives (Conjecture is one of the counterexamples). I believe that the field of AI safety would greatly benefit from improved communication, and providing a brief summary of a research direction seems like low-hanging fruit.

Comment by Roman Malov on Visual demonstration of Optimizer's curse · 2024-12-13T22:03:38.002Z · LW · GW

So, is a random variable in the sense that it is drawn from a distribution of functions, and the expected value of those functions at each point $x$ is equal to $V (x)$ . Am I understanding you correctly?

Comment by Roman Malov on Deep Deceptiveness · 2024-11-04T22:37:49.703Z · LW · GW

I've read it as a part of Agents Foundation course, and I consider this post really effective and clarifying. It got me thinking, can this generalize to other failure modes? Like if programers notice that AI spend too much resources on self-preservation, and then train against such behavior, this failure mode would still arise because self-preservation is an instrumental goal and is a fact about the world and ways in which goal can be achieved in this world.

Comment by Roman Malov on Hell is wasted on the evil · 2024-10-18T21:46:19.687Z · LW · GW

I'm not a native speaker, can someone please explain the meaning of "Hell is wasted on the evil" in simpler terms?

Comment by Roman Malov on [deleted post] 2024-09-01T21:03:31.184Z

Thank you, that seems to be the clarification I needed. And reminded me of a good video, which also touches the subject.

Comment by Roman Malov on [deleted post] 2024-09-01T19:46:23.824Z

Thank's for your answer, I will read linked post.

I told in the text that I'm going to try to convey the "process" in the comments, and I'll try to do it now.

all sophisticated-enough minds

I think that the recursive buck is passed to the word "enough". You need to have stratification of sophistication of minds, and have a cutoff for when they reach acceptable level off sophistication.

Comment by Roman Malov on [deleted post] 2024-09-01T19:28:10.428Z

So in the universe with only bosons (so Pauli principle doesn't apply), everything is the Same?

When I imagine a room full of photons, I see a lot of things that can be Different. For example, the coordinates of photons, wavelength, polarization, their number.

Or are you saying that Pauli principle is sufficient, but not necessary?

Comment by Roman Malov on [deleted post] 2024-09-01T17:44:44.919Z

If you read further, you can see how this is also passing the recursive buck.

You: "There are no clear separation between objects, I only use this to increase my utility function"
Me: "How are you deciding on where to stop dividing reality?"
You: "Well, I calculate my marginal utility from creating an additional concept and then Compare it to zer... ah, yeah, there is the recursive buck. It even capitalized as I said it."

So yeah, while this is a desirable point to stop, this method still relies on your ability to Differentiate between usefulness of two models, and as far as I can tell, in the end, we can only feel it.

Comment by Roman Malov on Chapter 91: Roles, Pt 2 · 2024-08-26T02:55:17.603Z · LW · GW

Sebz n gval fcbg ba gur raq bs Uneel'f jnaq, n phovp zvyyvzrgre bs napube, fgergpurq bhg n guva yvar bs Genafsvtherq fcvqre-fvyx.

sebz gur puncgre 114

Comment by Roman Malov on Chapter 90: Roles, Pt 1 · 2024-08-25T18:00:34.433Z · LW · GW

Or if I'd - if I'd only gone with - if, that night -

I'm guessing he is talking about the night he lost his potential phoenix.

Comment by Roman Malov on Chapter 89: Time Pressure, Pt 2 · 2024-08-25T17:45:44.343Z · LW · GW

I think that's intended author's choice. Like what Harry saw was too terrible to acknowledge. Or maybe it's just to create more suspense.

Comment by Roman Malov on Chapter 27: Empathy · 2024-08-06T01:07:29.014Z · LW · GW

Snape told him that he wanted to check if Harry resembled his father, and the test consisted of stopping bullies, so that might be the reason for Harry's guess.

Comment by Roman Malov on Ilya Sutskever created a new AGI startup · 2024-06-19T20:36:45.814Z · LW · GW

safety always remains ahead

When was it ever ahead? I mean, to be sure that safety is ahead, you need to first make advancement there compatible with capabilities. And to do that, you shouldn't advance the capabilities.

Comment by Roman Malov on [Aspiration-based designs] Outlook: dealing with complexity · 2024-05-02T20:37:34.741Z · LW · GW

maybe you meant pairwise linearly independent (by looking at the graph)

Comment by Roman Malov on [Aspiration-based designs] Outlook: dealing with complexity · 2024-05-02T20:33:22.700Z · LW · GW

Pick many linearly independent linear combinations $f_{j}$
isn't there at most $d$ linearly independent linear combinations of $u_{i}$ ?

Comment by Roman Malov on My thoughts on the Beff Jezos - Connor Leahy debate · 2024-02-03T21:19:04.986Z · LW · GW

The current population size that Mars can support is 0, so even 1 person would be overpopulation. To complete the analogy, we are currently sending the entire population to Mars, and someone says: "But what about oxygen? We don't know if it's on Mars, maybe we should work on spacesuits?" and another says, "Nah, we'll figure it out when we get there."

User info

Posts

Comments

Rule and Example