Has anyone actually tried to convince Terry Tao or other top mathematicians to work on alignment?

p-2

Has anyone actually tried to convince Terry Tao or other top mathematicians to work on alignment?

post by P. · 2022-06-08T22:26:55.710Z · LW · GW · 27 comments

This is a question post.

  Answers
    60 TekhneMakre
    41 adamzerner
    26 SP
    14 AllAmericanBreakfast
    12 interstice
    10 Joseph Van Name
    8 Trevor1
    5 Raphaël S
    4 mukashi
None
27 comments

This has been discussed several times in the past, see:

Have You Tried Hiring People? [LW · GW], a LW post
- It talks about this ACX comment thread
Greg Coulbourn’s “Mega-money for mega-smart people to solve AGI Alignment”
- Google Docs document
- LW comment about the document [LW(p) · GW(p)]
- Short discussion on MIRI’s Facebook page
- Thread on the EA forum [EA(p) · GW(p)]
Comments arguing for Terence Tao specifically
- Comment 1 [LW(p) · GW(p)]
- Comment 2 [LW(p) · GW(p)] with replies worth reading

But I’m not aware of anyone that has actually even tried to do something like this.

Of special interest is this comment [LW(p) · GW(p)] by Eliezer about Tao:

We'd absolutely pay him if he showed up and said he wanted to work on the problem. Every time I've asked about trying anything like this, all the advisors claim that you cannot pay people at the Terry Tao level to work on problems that don't interest them. We have already extensively verified that it doesn't particularly work for eg university professors.

So if anyone has contacted him or people like him (instead of regular college professors), I’d like to know how that went.

Otherwise, especially for people that aren’t merely pessimistic but measure success probability in log-odds [LW · GW], sending that email is a low cost action that we should definitely try.

~~So you (whoever is reading this) have until June 23rd to convince me that I shouldn’t send this to his @math.ucla.edu address:~~

Edit: I’ve been informed that someone with much better chances of success will be trying to contact him soon, so the priority now is to convince Demis Hassabis (see further below) and to find other similarly talented people.

Title: Have you considered working on AI alignment?

Body:

It is not once but twice that I have heard leaders of AI research orgs say they want you to work on AI alignment. Demis Hassabis said on a podcast that when we near AGI (i.e. when we no longer have time) he would want to assemble a team with you on it but, I quote, “I didn’t quite tell him the full plan of that”. And Eliezer Yudkowsky of MIRI (contact@intelligence.org) said in an online comment [LW(p) · GW(p)] “We'd absolutely pay him if he showed up and said he wanted to work on the problem. Every time I've asked about trying anything like this, all the advisors claim that you cannot pay people at the Terry Tao level to work on problems that don't interest them.”, so he didn’t even send you an email. I know that to you it isn't the most interesting problem to think about^[1] but it really, actually is a very very important and urgent completely open problem. It isn’t simply a theoretical concern, if Demis’ predictions of 10 to 20 years to AGI are anywhere near correct, it will deeply affect you and your family (and everyone else).
If you are ever interested you can start by reading the pages linked in EA Cambridge’s AGI Safety Fundamentals course or the Alignment Forum [AF · GW].
Best of luck,
P.

You can do any of:

Reporting your past results.
Convincing me that this is a net negative on expectation.
- The worst thing I can think of that can realistically happen is this leading to something like the Einstein-Szilard letter. But considering that Elon Musk has already tried to warn governments, I don’t think it would change much.
Arguing that it is important to wait until we have the results of the AI Safety Arguments Competition [LW · GW] or something similar. I currently don’t think so, he should be convinced for the same reasons we are convinced.
Suggesting changes to the email or a new email altogether. If you think it is terrible, say so!
- He declines many kinds of email requests, see points 5 and 12 here.
If you have more social capital than me (e.g. if you know him) or you work at an alignment organisation, volunteering to send the email yourself. He could think “If they actually care about hiring me, why aren’t they contacting me directly?”.
Saying under what conditions your organisation would be willing to hire him, so I can add it to the email.

What you probably shouldn’t do is to send your own email without telling the rest of us. His attention is a limited resource and bothering him with many different emails might reduce his sympathy for the cause.

And other than him, how many people do you think have a comparable chance of solving the problem or making significant progress? And how do we identify them? By the number of citations? Prizes won? I would like to have a list like that along with conditions under which each alignment org would hire each person. The probability of convincing Tao might be low, but with, say, 100 people like him the chances of finding someone might be decent.

I’m pretty sure that most of them haven’t heard about alignment, or have and just discarded it as something not worth thinking about. I don’t think this means that they couldn’t do great alignment work if they tried, maybe getting them to seriously think about the problem at all is the hard part, and after that their genius simply generalises to this new area.

Relatedly, does anyone here know why Demis Hassabis isn’t assembling his dream team [LW(p) · GW(p)] right now? The same as above applies, but until ~~the 1st of July~~ June 23rd:

Title: Are you sure you should wait before creating your dream alignment team?

Body:

On S2, Ep9 of the DeepMind podcast you said that when we get close to AGI we should pause pushing the performance of AI systems to guarantee they are safe. What do you think the timeline would be like in that scenario? When we get close, while DeepMind and maybe some other teams might pause the development, everyone else will keep working as fast as possible to get to the finish line, and all else equal whoever devotes less resources to non-capabilities work will get there first. Creating AGI is already a formidable task, but we at least have formalisms like AIXI that can serve as a guide by telling us how we could achieve general intelligence given unlimited computing power. For alignment we have no such thing, existing IRL algorithms couldn't learn human values even with direct IO access and unlimited computing power, and then there is the inner alignment problem. If we don’t start working on the theoretical basis of alignment now, we won’t have aligned systems when the time comes.

This should be obvious to him, but just in case.

^{^}
This linked here [EA · GW] before, but I was told it was a bad idea.

Answers

answer by TekhneMakre · 2022-06-09T00:05:15.038Z · LW(p) · GW(p)

Thank you for posting about this here so that you can get feedback, and so that other people can know how much people are doing this sort of thing (and by the same token it could be good for people who've already done this sort of thing to say so).

I have a bit of a sinking feeling reading your draft; I'll try to say concrete things about it, but I don't think I'll capture all of what's behind the feeling. I think part of the feeling is about, this just won't work.

Part of it is like, the email seems to come from a mindset that doesn't give weight to curiosity and serious investigation (what Tao does with his time).

I know that to you it isn't the most interesting problem to think about[1] but it really, actually is a very very important and urgent completely open problem. It isn’t simply a theoretical concern, if Demis’ predictions of 10 to 20 years to AGI are anywhere near correct, it will deeply affect you and your family (and everyone else).

I think there's a sort of violence or pushiness here that's anti-helpful. It doesn't acknowledge that Tao doesn't have good reason to trust your judgements about what's "very very important and urgent", and people who go around telling other people what things are "very very important and urgent" in contexts without trust in judgement are often trying to coerce people into doing things they don't want to do. It doesn't acknowledge that people aren't utility maximizing machines, but instead have momentum and joy and specialization and context. (Not to say that Tao doesn't deserve to be informed about the consequences of events happening in the world and his possible effect on those consequences, and not to say that stating beliefs is bad, and not to say that Tao might just be curious and learn about AI risk if it's shown to him in the right way.)

Another thing is the sources recommendation. The links should be to technical arguments about problems in AI alignment and technical descriptions of the overall problem, the sort of thing X-risk and AI-risk thinkers say to each other, not to material prepared with introductoriness in mind.

It is not once but twice that I have heard leaders of AI research orgs say they want you to work on AI alignment.

This is kind of weird and pushy. On the face of it, it looks like either you're confused and think that big high status people to you are also big high status people to Tao and therefore should be able to give him orders about what to work on, and that Tao is even the sort of entity that takes orders; or at least, it looks like you yourself are trying to take orders from big high status people, propagating perceived urgency from them to whoever else, without regard to individual agents's local/private information about what's good for them to do. Like, it looks like you got scared, flailed and grasped for whatever the high status people said they think might be cool, and then wanted to push that. (I'm being blunt here, but to be clear, if something like this is happening, that's very empathizable-with; I don't think "you're bad" or anything like that, and doing stuff that seems like it would have good consequences is generally good.)

If you are ever interested you can start by reading

This is sort of absurd: 1. if Tao were interested, he could likely have lots of conversations with competent AI alignment thinkers, which would be a much better use of his time, and 2. frankly, it seems like you're posturing as someone giving orders to Tao.

↑ comment by JeffreyK · 2022-06-27T01:48:51.304Z · LW(p) · GW(p)

I agree with TekhneMakre...it comes across like an average looking unconfident person asking out a gorgeous celeb. Probably a friend approaching him is best, but an email can't hurt. I would get a few people together to work on it...my approach would be to represent truly who we are as a motivated group of people that has the desire to write this email to him by saying something like, "There's a great forum of AI interested and concerned folks that we are a part of, many of us on the younger side, and we fear for the future of humanity from misaligned AI and we look to people like you Dr. Tao as being the kind of gifted person we hope could become involved early in helping guide AI in the right directions that would keep us all safe. We are younger and up and coming, so we don't know how to appeal to what interests you, so we're just laying it out there so you can know there are thousands of us and we're hoping to create a conversation with you and your high level peers to drive some energy in this direction and maybe your direct involvement. Thanks."

↑ comment by P. · 2022-06-09T09:56:42.659Z · LW(p) · GW(p)

I was trying to rely on Tao’s trust in Demis's judgement, since he is an AI researcher. Mentioning Eliezer is mainly so he has someone to contact if he wants to get hired.

I wanted his thinking to be “this competent entity has spent some of his computational resources verifying that it is important to solve this problem, and now that I’m reminded of that I should also throw mine at it”.

Is he truly mostly interested in what he considers to be mentally stimulating? Not in improving the world, or in social nonsense, or guaranteeing that his family is completely safe from all threats?

Then was including this link [EA · GW] a bad idea? It gives examples of areas a mathematician might find interesting. And if not that, then what should I say? I’ve got nothing better. Do you know any technical introduction to alignment that he might like?

And about getting him to talk to other people, if anyone volunteers just DM me your contact information so that I can include it in the email (or reply directly if you don’t care about it being public). I mean, what else could I do?

Replies from: adrian-arellano-davin, TekhneMakre

↑ comment by mukashi (adrian-arellano-davin) · 2022-06-09T14:59:51.982Z · LW(p) · GW(p)

If you plan to rewrite that letter with a less pushy tone (I agree 100% with the comment from TechneMakre) I think it might be useful if you try to change the framework of the problem a bit. Imagine that a random guy is writing to you instead, and he is telling you to work on deviating possible meteorites reaching Earth. What sort of email would make you compelled to reply?

Replies from: P.

↑ comment by P. · 2022-06-09T16:10:54.362Z · LW(p) · GW(p)

I’ll rewrite it but I can’t just model other people after me. If I were writing it for someone like myself it would be a concise explanation of the main argument to make me want to spend time thinking about it followed by a more detailed explanation or links to further reading. As long as it isn’t mean I don’t think I would care if it’s giving me orders, begging for help or giving me information without asking for anything at all. But he at least already knows that unaligned AIs are a problem, I can only remind him of that, link to reading material or say that other people also think he should work on it.

But now the priority of that is lower, see the edit to the post. Do you think that the email to Demis Hassabis has similar problems or that it should stay like it is now?

↑ comment by TekhneMakre · 2022-06-10T02:12:48.382Z · LW(p) · GW(p)

Does the stuff about pushiness make sense to you? What do you think of it? I think as is, the letter, if Tao reads it, would be mildly harmful, for the reasons described by other commenters.

Replies from: P.

↑ comment by P. · 2022-06-10T15:46:42.581Z · LW(p) · GW(p)

I think I get it, but even if I didn’t now I know that’s how it sounds, and I think I know how to improve it. That will be for other mathematicians though (at least Maxim Kontsevich), see the edit to the post. Does the tone in the email to Demis seem like the right one to you?

Replies from: TekhneMakre

↑ comment by TekhneMakre · 2022-06-10T18:09:18.008Z · LW(p) · GW(p)

In terms of tone it seems considerably less bad. I definitely like it more than the other one because it seems to make arguments rather than give social cues. It might be improved by adding links giving technical descriptions about the terms you use (e.g. inner alignment (Hubinger's paper), IRL (maybe a Russell paper on CIRL)). I still don't think it would work, simply because I would guess Hassabis gets a lot of email from randos who are confused and the email doesn't seem to distinguish you from that (this may be totally unfair to you, and I'm not saying it's correct or not, it's just what I expect to happen). I also feel nervous about talking about arms races like that, enforcing a narrative where they're not only real but the default (this is an awkward thing to think because it sounds like I'm trying to manage Hassabis's social environment deceptively, and usually I would think that worrying about "reinforcing narratives" isn't a main thing to worry about and instead one should just say what one thinks, but, still my instincts say to worry about that here, which might be incorrect).

answer by Adam Zerner (adamzerner) · 2022-06-09T16:40:10.415Z · LW(p) · GW(p)

I think that to a non-trivial extent, we have a limited supply of such efforts. The more times Terry has been contacted about this, the less likely he is to respond positively.

And to some extent I suppose this is true about reaching out to such figures more generally. Ie. maybe word gets out that we've been doing such outreach and by the time we contact John Doe, even if it's our first time contacting John Doe, we may have exhausted our supply of John Doe's patience.

So then, I don't think such an action is as low cost as it may seem. It costs more than the time it takes to write the email.

What makes more sense to me is to try to traverse through social networks and reach him that way. Figure out which nodes are close to him who he listens to. Note that they might be bloggers like Scott Alexander or someone like Dan Luu. From there think about which of those nodes make sense to pursue. Maybe one, maybe multiple. Then backtrack and think about how we can utilize our current connections to reach those nodes.

I also think it'd be worth brainstorming more creative solutions with a bunch of yoda timers [? · GW]. I'll try one right now.

Billboard ad.
Well put together video.
Figure out what sort of people have the skillset for this and pay them as consultants.
Be like Hermione and read books to educate ourselves first. Especially because it signals competence. Signals of incompetence could make someone like Terry be a lot more likely to reject us.
Research more directly whether stuff like this has been done before. Talk to people who have done it to see what advice they have.
Put a bounty on it. $50k to get a response from him.
Attach money to it. $50k to Terry to sit down and discuss this.
Start with (way) less prestigious people. Presumably they're easier to reach and perhaps to convince. Then with a bunch of them working on it, higher prestige people would start to notice and be more easily convinced.

Some of these ideas seem pretty solid. My sense is that the best path forward is:

More brainstorming from the community.
Get in touch with various organizations (MIRI, CEA...) to see where they are at with this stuff.
Education. Figure out what academic fields/topics are relevant. Learn about them. Probably do some sort of write-up. Nothing too crazy, but I think the low hanging fruits should be addressed.
Decide on a path forward. I suspect that the initiatives should come from an organization like MIRI or CEA, because I assume people like Terry would be more likely to respond to representatives of decently prestigious organizations.

This is a little rambly. Sorry. I'll end here.

↑ comment by Not Relevant (not-relevant) · 2022-06-10T02:35:44.478Z · LW(p) · GW(p)

These are all good ideas, but I also think it’s important not to Chesterton’s Fence too hard. A lot of passionate people avoid doing alignment stuff because they assume it’s already been considered and decided against, even though the field doesn’t have that many people and much of its cultural capital is new.

Be serious, and deliberate, and make sure you’re giving it the best shot if this is the only shot we have, but most importantly, actually do it. There are not many other people trying.

Replies from: adamzerner

↑ comment by Adam Zerner (adamzerner) · 2022-06-10T03:59:42.315Z · LW(p) · GW(p)

Thanks for saying that. I think I needed to hear it.

↑ comment by JeffreyK · 2022-06-27T01:53:00.934Z · LW(p) · GW(p)

These are a lot of good ideas. I comment above I think a good approach is to truly represent that we are a bunch of younger people who fear for the future...this would appeal to a lot of folks at his level, to know the kids are scared and need his help.

answer by [deleted] · 2022-06-09T07:09:02.552Z · LW(p) · GW(p)

Hi, long-time lurker but first-time poster with a background in math here. I personally agree that it would be a good idea if we were to at least try to get some extremely talented mathematicians to think about alignment. Even if they decide not to, it might still be interesting to see what kinds of objections they have to working on it (e.g. is it because they think it's futile and doomed to failure, because they think AGI is never going to happen, because they think alignment will not be an issue, because they feel they have nothing to contribute, or because it's not technically interesting enough?).

However, I would also like to second TekhneMakre's concerns about the format and content of the email. If you sample some comments on posts on Terry Tao's blog, you will find that there are a number of commenters who would probably best be described as cranks who indefatigably try to convince Terry that their theories are worth paying attention to, that Terry is currently not wisely spending his time, etc. He (sensibly) ignores these comments, and has probably learned for the sake of sanity not to engage with anyone who seems to fit this bill. I am concerned that the email outlined in your post will set off the same response and thus be ignored. AI safety is still a rather fringe idea amongst academics, at least partly because it is speculative and lacking concreteness. It took me years as an academic-adjacent person to be even somewhat convinced that it could be a problem (I still am not totally convinced, but I am convinced it is at least worth looking into). I do not think an email appealing to emotion and anecdotes is likely to convince someone from that background encountering this problem.

I have three alternative suggestions; I'm not sure how good they are, so take them each with a grain of salt:

Firstly, note that Scott Aaronson has said here https://scottaaronson.blog/?p=6288#comment-1928043 that he would provisionally be willing to think about alignment for a year. This seems like it would have several advantages (1) He has already signaled interest, provisionally, so it would be easier to convince him that it might be worth working on, (2) He is already acquainted with many of the arguments for taking AGI seriously, so could start working on the problem more immediately, (3) He is well acquainted with the rationalist community and so would not be put off by rationalist norms or affiliated ideas such as EA (which I believe accounts for the skepticism of at least some academics), (4) Scott's area of work is CS theory, which seems like it would be more relevant to alignment than Tao's fields of interest.

Secondly, there are some academics who take AI safety arguments seriously. Jacob Steinhardt comes to mind, but I'm sure there are a decent number of others, especially given recent progress on AI. If these academics were to contact other top academics asking them to consider working on AI safety, the request would come across as much more credible. They would also know how to frame the problem in such a way to pique the interest of top mathematicians/computer scientists.

Thirdly, note that there are many academics who are open to working on big policy problems that do not directly concern their primary research interests. Terry Tao, I believe, is one of them, as evidenced by https://newsroom.ucla.edu/dept/faculty/professor-terence-tao-named-to-president-bidens-presidents-council-of-advisors-on-science-and-technology . I'm not sure to what extent this is an easier problem or a desirable course of action, but if you could convince some people in politics that this problem is worth taking seriously, it is possible that the government might directly ask these scientists to think about it.

This last point is not a suggestion, but I would like to add one note. Eliezer claims that he was told that you cannot pay top mathematicians to work on problems. I believe this is somewhat false. There are many examples of very talented professors and PhD students leaving academia to work at hedge funds. One example is Abhinav Kumar, who a few years ago was 1 of 4 coauthors on a paper solving the long-open problem on optimal sphere packings in 24 dimensions. He left an Associate Professorship at MIT to work at Renaissance Technologies (a hedge fund). Not exactly in the same vein, but Huawei has recruited 4 Fields medalists to work with them (e.g. see https://www.ihes.fr/en/laurent-lafforgue-en/ for one example) although I'm not certain whether they are working on applied problems. I cannot say whether money is a motivating factor in any given one of these cases, but there are more examples like this, and I think it is fair to say that at least some substantial fraction of all such people involved might have been motivated at least partly by money.

↑ comment by [deleted] · 2022-06-18T00:37:45.128Z · LW(p) · GW(p)

Seems that I wasn't the only person to notice Scott's comment on his blog :) He's just announced that he'll be working on alignment at OpenAI for a year: https://scottaaronson.blog/?p=6484

↑ comment by Greg C (greg-colbourn) · 2022-06-09T10:06:10.063Z · LW(p) · GW(p)

Oh wow, didn't realise how recent the Huawei recruitment of Field medalists was! This from today. Maybe we need to convince Huawei to care about AGI Alignment :)

↑ comment by P. · 2022-06-09T11:15:32.370Z · LW(p) · GW(p)

Then do you think I should contact Jacob Steinhardt to ask him what I should write to interest Tao and avoid seeming like a crank?

There isn’t much I can do about SA other than telling him to work on the problem in his free time.

Unless something extraordinary happens I’m definitely not contacting anyone in politics. Politicians being interested in AGI is a nightmarish scenario and those news about Huawei don’t help my paranoia about the issue.

Replies from: None

↑ comment by [deleted] · 2022-06-09T17:16:43.254Z · LW(p) · GW(p)

I personally think the probability of success would be maximized if we were to first contact high-status members of the rationalist community, get them on board with this plan, and ask them to contact Scott Aaronson as well as contact professors who would be willing to contact other professors.

The link to Scott Aaronson's blog says he provisionally would be willing to take a leave of absence from his job to work on alignment full-time for a year for $500k. I believe EA has enough funds that they could fund that if they deemed it to be worthwhile. I think the chance of success would be greatest if we contacted Eliezer and/or whoever is in charge of funds, asked them to make Scott a formal offer, and sent Scott an email with the offer and an invitation to talk to somebody (maybe Paul Christiano, his former student) working on alignment to see what kinds of things they think are worth working on.

I think even with the perfect email from most members of this community, the chances that e.g. Terry Tao reads it, takes it seriously, and works on alignment are not very good, due to lack of easily verifiable credibility of the sender. Institutional affiliation at least partly remedies this, and so I think it would be preferable if an email came from another professor who directly tried to convince them.

I think cold-emailing Jacob Steinhardt/Robin Hanson/etc. asking them to email other academics would have a better chance of succeeding given that the former indeed participate on this forum. However, even here, I think people are inclined to pay more attention to the views of those closer to them. My impression is that Eliezer and other high-ranked members of the rationalist community have closer connections to these alignment-interested professors (and know many more such professors) and could more successfully convince them to reach out to their colleagues about AI safety.

I don't mean to suggest that these less-direct ways are necessarily better. If for instance Eliezer is not willing to talk to Jacob about this, then it might be better to contact Jacob than to do nothing. If you are not able to reach Jacob by any method, it might be better to contact Tao directly than to do nothing. I guess I only wish to say that you might want to attempt these more established channels before reaching out personally.

I also think many academics may be averse to contacting their colleagues about AI safety as it may come with a risk to their academic reputation. So I think it is worth keeping in mind that the chance of succeeding at this may not be very high.

Finally, thank you again for the original post-- I think it is important.

answer by DirectedEvolution (AllAmericanBreakfast) · 2022-06-09T06:11:13.360Z · LW(p) · GW(p)

My concern is less your email, and more the precedent. Having the rationality community model and encourage obviously undesired forms of contact with high-prestige figures seems like it could lead to intrusions of privacy. One person sending an email is ignorable. If emails, phone calls, unsolicited office visits, etc. start piling up under the banner of “AI risk,” it could feel quite invasive to those on the receiving end. My concern in particular is that people doing as you’re doing may not have the capacity to coordinate their actions. We may not even know whether or how much “randomly emailing Terry Tao about X risk” is going on.

↑ comment by P. · 2022-06-09T09:17:27.088Z · LW(p) · GW(p)

That’s part of the point of the post, to coordinate so that fewer emails are sent. I asked if anyone tried something similar and asked people not to send their own emails without telling the rest of us.

answer by interstice · 2022-06-10T02:31:13.866Z · LW(p) · GW(p)

I think Maxim Kontsevich might be a better candidate for an elite mathematician to try to recruit. Check out this 2014 panel [LW · GW] with him, Tao and some other eminent mathematicians -- he alone said that he thought HLAI(in math) is plausible in our lifetimes, but also that working on it might be immoral(!) He also mentioned an AI forecast by Kolmogorov that I had never heard of before, so it seems he has some pre-existing interest in the area.

answer by Joseph Van Name · 2024-01-12T17:01:26.213Z · LW(p) · GW(p)

Um. If you want to convince a mathematician like Terry Tao to be interested in AI alignment, you will need to present yourself as a reasonably competent mathematician or related expert and actually formulate an AI problem in such a way so that someone like Terry Tao would be interested in it. If you yourself are not interested in the problem, then Terry Tao will not be interested in it either.

Terry Tao is interested in random matrix theory (he wrote the book on it), and random matrix theory is somewhat related to my approach to AI interpretability and alignment. If you are going to send these problems to a mathematician, please inform me about this before you do so.

My approach to alignment: Given matrices , define a superoperator $Γ (A_{1}, \dots, A_{r}; B_{1}, \dots, B_{r})$ by setting

$Γ (A_{1}, \dots, A_{r}; B_{1}, \dots, B_{r}) (X) = \sum_{k = 1}^{r} A_{k} X B_{k}^{*}$ , and define $Φ (A_{1}, \dots, A_{r}) = Γ (A_{1}, \dots, A_{r}; A_{1}, \dots, A_{r})$ . Define the $L_{2}$ -spectral radius of $A_{1}, \dots, A_{r}$ as $ρ_{2} (A_{1}, \dots, A_{r}) = ρ (Φ (A_{1}, \dots, A_{r}))^{1 / 2}$ . Here, $ρ (A) = {lim}_{n \to \infty} ∥ A^{n} ∥^{1 / n}$ is the usual spectral radius.

Define $ρ_{2, d}^{K} (A_{1}, \dots, A_{r}) = max {\frac{ρ (Γ (A_{1}, \dots, A_{r}; X_{1}, \dots, X_{r}))}{ρ_{2} (X_{1}, \dots, X_{r})} : X_{1}, \dots, X_{r} \in M_{d} (K)}$ . Here, $K$ is either the field of reals, field of complex numbers, or division ring of quaternions.

Given matrices $A_{1}, \dots, A_{r}; B_{1}, \dots, B_{r}$ , define

$∥ (A_{1}, \dots, A_{r}) ≃ (B_{1}, \dots, B_{r}) ∥ = \frac{Γ (A_{1}, \dots, A_{r}; B_{1}, \dots, B_{r})}{ρ_{2} (A_{1}, \dots, A_{r}) ρ_{2} (B_{1}, \dots, B_{r})}$ . The value $∥ (A_{1}, \dots, A_{r}) ≃ (B_{1}, \dots, B_{r}) ∥$ is always a real number in the interval $[0, 1]$ that is a measure of how jointly similar the tuples $(A_{1}, \dots, A_{r}), (B_{1}, \dots, B_{r})$ are. The motivation behind $ρ_{2, d}^{K} (A_{1}, \dots, A_{r})$ is that $\frac{ρ_{2, d}^{K} (A_{1}, \dots, A_{r})}{ρ_{2} (A_{1}, \dots, A_{r})}$ is always a real number in $[0, 1]$ (well except when the denominator is zero) that measures how well $A_{1}, \dots, A_{r}$ can be approximated by $d \times d$ -matrices. Informally, $\frac{ρ_{2, d}^{K} (A_{1}, \dots, A_{r})}{ρ_{2} (A_{1}, \dots, A_{r})}$ measures how random $A_{1}, \dots, A_{r}$ are where a lower value of $\frac{ρ_{2, d}^{K} (A_{1}, \dots, A_{r})}{ρ_{2} (A_{1}, \dots, A_{r})}$ indicates a lower degree of randomness.

A better theoretical understanding of $ρ_{2, d}^{K} (A_{1}, \dots, A_{r})$ would be great. If $X_{1}, \dots, X_{r} \in M_{d} (K)$ and $\frac{ρ (Γ (A_{1}, \dots, A_{r}; X_{1}, \dots, X_{r}))}{ϕ_{2} (X_{1}, \dots, X_{r})}$ is locally maximized, then we say that $(X_{1}, \dots, X_{r})$ is an LSRDR of $(A_{1}, \dots, A_{r})$ . Said differently, $(X_{1}, \dots, X_{r}) \in M_{d} (K)$ is an LSRDR of $(A_{1}, \dots, A_{r})$ if the similarity $∥ (A_{1}, \dots, A_{r}) ≃ (X_{1}, \dots, X_{r}) ∥$ is maximized.

Here, the notion of an LSRDR is a machine learning notion that seems to be much more interpretable and much less subject to noise than many other machine learning notions. But a solid mathematical theory behind LSRDRs would help us understand not just what LSRDRs do, but the mathematical theory would help us understand why they do it.

Problems in random matrix theory concerning LSRDRs:

Suppose that $U_{1}, \dots, U_{r}$ are random matrices (according to some distribution). Then what are some bounds for $ρ_{2, d}^{K} (U_{1}, \dots, U_{r})$ .
Suppose that $U_{1}, \dots, U_{r}$ are random matrices and $A_{1}, \dots, A_{r}$ are non-random matrices. What can we say about the spectrum of $Γ (A_{1}, \dots, A_{r}; U_{1}, \dots, U_{r})$ ? My computer experiments indicate that this spectrum satisfies the circular law, and the radius of the disc for this circular law is proportional to $ρ_{2} (A_{1}, \dots, A_{r})$ , but a proof of this circular law would be nice.
Tensors can be naturally associated with collections of matrices. Suppose now that $U_{1}, \dots, U_{r}$ are the matrices associated with a random tensor. Then what are some bounds for $ρ_{2, d}^{K} (U_{1}, \dots, U_{r})$ .

P.S. By massively downvoting my posts where I talk about mathematics that is clearly applicable to AI interpretability and alignment, the people on this site are simply demonstrating that they need to do a lot of soul searching before they annoy people like Terry Tao with their lack of mathematical expertise.

P.P.S. Instead of trying to get a high profile mathematician like Terry Tao to be interested in problems, it may be better to search for a specific mathematician who is an expert in a specific area related to AI alignment since it may be easier to contact a lower profile mathematician, and a lower profile mathematician may have more specific things to say and contribute. You are lucky that Terry Tao is interested in random matrix theory, but this does not mean that Terry Tao is interested in anything in the intersection between alignment and random matrix theory. It may be better to search harder for mathematicians who are interested in your specific problems.

P.P.P.S. To get more mathematicians interested in alignment, it may be a good idea to develop AI systems that behave much more mathematically. Neural networks currently do not behave very mathematically since they look like the things that engineers would come up with instead of mathematicians.

P.P.P.P.S. I have developed the notion of an LSRDR for cryptocurrency research because I am using this to evaluate the cryptographic security of cryptographic functions.

answer by trevor (Trevor1) · 2022-06-08T22:55:52.517Z · LW(p) · GW(p)

I have heard about the thing where you commit to a $100m reward for any ML or mathmetician who solves alignment, and simultaneously pay 100 top ML and mathmeticians $1m over the course of a year to do nothing but pursue a solution to alignment (pursuing the bounty in the process). Even if all 100 of them fail, you still selected the best 100 out of every mathmetician who applied for those positions, so a large proportion of them might pursue the problem on their own afterwards in pursuit of the ongoing $100 million bounty. One way or another, many of these influential people will be convinced that the problem is significant and tell their friends, or even contract their friends as consultants to help with the problem.

There's plenty of trust issues, going both ways, but I'm not a grantmaker or lawyer and I think some smart, experienced people could probably figure out how to mitigate most of them.

answer by Charbel-Raphaël (Raphaël S) · 2022-06-08T22:49:21.511Z · LW(p) · GW(p)

I really want this to happen.

And why stop at Terry Tao? We could also email other top mathematicians and physicists.

answer by mukashi · 2022-06-08T22:35:21.898Z · LW(p) · GW(p)

I would put that in Google doc, it will be easier to suggest changes etc

27 comments

Comments sorted by top scores.

comment by Rob Bensinger (RobbBB) · 2022-06-11T07:01:47.372Z · LW(p) · GW(p)

I've seen a lot of discussion of Terence Tao, but not much of 'the overall set of young people who have done impressive technical work but are dramatically less busy and famous'. I also see a lot of discussion of people who are already socially adjacent to the community or have had multiple conversations about AI risk. I'd expect more value from poking people who are less visible, more available, and who haven't spent as much time thinking or talking about AI risk already.

comment by Shmi (shminux) · 2022-06-09T04:14:33.640Z · LW(p) · GW(p)

Note that people of Tao's level and prominence likely receive extreme quantities of unsolicited emails, and those remain unread by the intended recipient, possibly not even the subject line. Put yourself in the place of Tao's assistant, why would they let something like what you intend to send through?

Replies from: greg-colbourn

↑ comment by Greg C (greg-colbourn) · 2022-06-09T08:55:16.172Z · LW(p) · GW(p)

Yes, I think the email needs to come from someone with a lot of clout (e.g. a top academic, or a charismatic billionaire; or even a high-ranking government official) if we actually want him to read it and take it seriously.

Replies from: greg-colbourn, greg-colbourn

↑ comment by Greg C (greg-colbourn) · 2022-06-09T08:58:46.032Z · LW(p) · GW(p)

Maybe reaching Demis Hassabis first is the way to go though, given that he's already thinking about it, and has already mentioned it to Tao (according to the podcast). Does anyone have links to Demis? Would be good to know more about his "Avengers assemble" plan! The main thing is that the assembly needs to happen asap, at least for an initial meeting and "priming of the pump" as it were.

Replies from: P.

↑ comment by P. · 2022-06-09T11:29:11.360Z · LW(p) · GW(p)

Do you mean website links about his plan? I found nothing.

I’m still not changing the deadlines but I’ve received information that made me want to change the order.

Replies from: greg-colbourn

↑ comment by Greg C (greg-colbourn) · 2022-06-09T11:33:03.567Z · LW(p) · GW(p)

No I mean links to him in person to talk to him (or for that matter, even an email address or any way of contacting him..).

↑ comment by Greg C (greg-colbourn) · 2022-06-09T09:49:37.481Z · LW(p) · GW(p)

Should also say - good that you are thinking about it P., and thanks for a couple of the links which I hadn't seen before.

comment by Chris_Leong · 2022-06-09T03:08:36.113Z · LW(p) · GW(p)

I'd suggest reaching out to CEA's community health team before sending the email.

Replies from: Raemon, P., adamzerner, P.

↑ comment by Raemon · 2022-06-09T05:57:08.072Z · LW(p) · GW(p)

Why CEA in particular, and why their community health team in particular? I think it's good to get feedback before sending this email (I wouldn't send the emails in their current form), but I don't think of those groups as particularly relevant for how to contact Terry Tao or Demis.

Replies from: Chris_Leong

↑ comment by Chris_Leong · 2022-06-09T06:02:33.602Z · LW(p) · GW(p)

My understanding was that they were the team to talk to within EA if you're thinking about doing outreach to high-net worth individuals, politicians, projects related to children; or projects involving famous people like this one.

↑ comment by P. · 2022-06-09T12:45:01.729Z · LW(p) · GW(p)

Ok, I sent them an email.

↑ comment by Adam Zerner (adamzerner) · 2022-06-09T05:04:25.213Z · LW(p) · GW(p)

What does CEA stand for?

Replies from: Chris_Leong

↑ comment by Chris_Leong · 2022-06-09T05:59:33.751Z · LW(p) · GW(p)

Center for Effective Altruism.

↑ comment by P. · 2022-06-09T10:14:44.026Z · LW(p) · GW(p)

I might try that, but “community health” is not really what I’m optimising for. Maybe the name is misleading?

Replies from: Chris_Leong

↑ comment by Chris_Leong · 2022-06-09T11:38:47.380Z · LW(p) · GW(p)

I don't think it's that surprising. Tasks in an organisation will naturally go to the team that has the most relevant experience.

comment by Adam Zerner (adamzerner) · 2022-06-10T23:56:58.998Z · LW(p) · GW(p)

It just occurred to me that Terry Tao is probably one of the less good people to pursue here. He's more of a popular, public figure type. Which, I assume, means that he is contacted by more people, and thus less receptive to things like this. Whereas there are surely other people who are similarly smart but way less popular amongst the general public. And with a quick email to, say, a PhD student in math, it'd probably be pretty easy to find out who these people are.

comment by juliawise · 2022-06-11T12:18:58.910Z · LW(p) · GW(p)

An EA contacted me who knows Kontsevich and is considering reaching out to him. If you want to coordinate with that person, let me know and I can put you in touch.

Replies from: P.

↑ comment by P. · 2022-06-11T17:29:51.343Z · LW(p) · GW(p)

Please do! You can DM me their contact info, tell them about my accounts: either this one or my EA Forum one [EA · GW], or ask me for my email address.

comment by wunan · 2022-06-18T16:03:48.064Z · LW(p) · GW(p)

I'm not sure about the timing of when the edits in your post were made, but if you want feedback about your planned contact with Demis Hassabis I think you should make a new post about it -- most people who would comment on it may have missed it because they only saw the original unedited post about Tao, which had already received feedback.

I also think that, for the same reason that you chose to let someone else contact Tao instead of you, it may be better to let someone else contact Hassabis (or find someone else to contact him).

Replies from: P.

↑ comment by P. · 2022-06-18T21:07:31.667Z · LW(p) · GW(p)

The email to Demis has been there since the beginning, I even received feedback on it [LW(p) · GW(p)]. I think I will send it next week, but will also try to get to him through some DeepMind employee if that doesn’t work.

Replies from: ricraz

↑ comment by Richard_Ngo (ricraz) · 2022-06-20T02:17:55.240Z · LW(p) · GW(p)

Please don't send these types of emails, I expect that they're actively counterproductive for high-profile recipients.

If you want to outreach, there are clear channels which should be used for coordinating it. For example, you could contact DeepMind's alignment team, and ask them if there's anything which would be useful for them.

comment by papetoast · 2023-09-29T08:14:43.814Z · LW(p) · GW(p)

Any updates? I skimmed through the comments and answers but it seems that we just know someone planned to contact Terrance Tao, and no results have been reported back.

comment by Adrià Garriga-alonso (rhaps0dy) · 2022-06-11T06:29:25.442Z · LW(p) · GW(p)

Hey P. Assuming Demis Hassabis reads your email and takes it seriously, why won’t his reaction be “I already have my alignment team, Shane Legg took care of that” ?

Deepmind has had an alignment team for a long time.

Replies from: P.

↑ comment by P. · 2022-06-11T17:17:41.897Z · LW(p) · GW(p)

Well, if he has, unbeknownst to me, already hired the “Terence Taos of the world” like he said on the podcast, that would be great, and I would move on to other tasks. But if he only has a regular alignment team, I don’t think either of us considers that to be enough. I’m just trying to convince him that it’s urgent and we can’t leave it for later.

comment by ClipMonger · 2022-12-02T09:22:21.268Z · LW(p) · GW(p)

Is this still feasible now?

Replies from: adrian-arellano-davin

↑ comment by mukashi (adrian-arellano-davin) · 2022-12-02T12:05:59.272Z · LW(p) · GW(p)

Why? What happened?

Replies from: gjm

↑ comment by gjm · 2022-12-02T12:45:16.211Z · LW(p) · GW(p)

I assume CM means because of the FTX collapse which means there is no longer such a big pile sloshing around the AI alignment community.

Has anyone actually tried to convince Terry Tao or other top mathematicians to work on alignment?

Contents

Answers

27 comments