Posts
Comments
m - often used together with n to denote the height and width of a matrix
At first I disbelieved. I thought A > B. Then I wrote code myself and checked, and got that B > A. I believed this result. Then I thought about it and realized why my reason for A > B was wrong. But I still didn't understand (and now I don't understand either) why the described random process is not equivalent to randomly choosing 2, 4, or 6 every roll. I thought some more and now I have some doubts. My first doubt is whether there exists some kind of standard way of describing random processes and conditioning on them, and whether the problem as stated by notfnofn. Perhaps the problem is just underspecified? Anyway, this is very interesting.
If you think you might be in a solipsist simulation, you might try to add some chaotic randomness to your decisions. For example, go outside under some trees and wait till any kind of tree leaf or seed or anything hits your left half of the face, choose one course of action. If it hits the other half of your face, choose another course of action. If you do this multiple times in your life, each of your decisions will depend on the state of the whole earth and on all your previous decisions, since weather is chaotic. And thus the simulators will be unable to get good predictions about you using a solipsist simulation. A potential counterargument is that they analyze your thinking and hardcode this binary random choice, i.e. hardcode the memory of the seed hitting your left side. But then there would need to be an intelligent process analyzing your thinking to try and isolate the randomness. But then you could make the dependence of your strategy on randomness even more complicated.
Nice. I have a suggestion how to improve the article. Put a clearly stated theorem somewhere in the middle, in its own block, like in academic math articles.
Why do you hate earworms? To me, they are mildly pleasant. The only moments when I wish I didn’t have an earworm happening at that moment is when I’m trying to remember another tune and the earworm for musicianship purposes and the earworm prevents me from being able to do that.
Instead of inspecting all programs in the UP, just inspect all programs with length less than n. As n becomes larger and larger, this covers more and more of the total probability mass in the up and the total probability mass covered this way approaches 1. What to do about the non-halting programs? Well, just run all the programs for m steps, I guess. I think this is the approximation of UP that is implied.
Well, now I'm wondering - is neural network training chaotic?
This is awesome, I would love more posts like this. Out of curiosity, how many hours have you and your colleague spent on this research.
In my personal experience, exposure therapy did help me with the fear of such "extreme" risks.
In the very beginning of the post, I read: "Quick psychology experiment". Then, I read: "Right now, if I offered you a bet ...". Because of this, I thought about a potential real life situation, not a platonic ideal situation, that the author is offering me this bet. I declined both bets. Not because they are bad bets in an abstract world, but because I don't trust the author in the first bet and I trust them even less in the second bet.
If you rejected the first bet and accepted the second bet, just that is enough to rule you out from having any utility function consistent with your decisions.
Under this interpretation, no it doesn't.
Could you, the author, please modify the thought experiment to indicate that it is assumed that I completely trust the one who is proposing the bet to me? And, maybe discuss other caveats too. Or just say that it's Omega who's offering me the bet.
So you say humans don't reason about the space and objects around them by keeping 3d representations. You think that instead the human brain collects a bunch of heuristics what the response should be to a 2d projection of 3d space, given different angles - an incomprehhensible mishmash of neurons like in an artificial neural network that doesn't have any CNN layers for identifying the digit by image, and just memorizes all rules for all types of pictures with all types of angle like a fully connected layer.
I guess I was not clear enough. In your original post, you wrote "On one hand, there are countably many definitions ..." and "On the other hand, Cantor's diagonal argument applies here, too. ...". So, you talked about two statements - "On one hand, (1)", "On the other hand, (2)". I would expect that when someone says "One one hand, ..., but on the other hand, ...", what they say in those ellipses should contradict each other. So, in my previous comment, I just wanted to point out that (2) does not contradict (1) because countable infinity + 1 is still countable infinity.
take all the iterations you need, even infinitely many of them
Could you clarify how I would construct that?
For example, what is the "next cardinality" after countable?
I didn't say "the next cardinality". I said "a higher cardinality".
Ok, so let's say you've been able to find a countably infinite amount of real numbers and you now call them "definable". You apply the Cantor's argument to generate one more number that's not in this set (and you go from the language to the meta language when doing this). Countably infinite + 1 is still only countably infinite. How would you go to a higher cardinality of "definable" objects? I don't see an easy way.
To check if A causes B, you can check what happens when you intervene and modify A, and also what happens when you intervene and modify B. That's not always possible though. You can consult "Causality: Models, Reasoning, and Inference" by Pearl for more details.
They commit to not using your data to train their models without explicit permission.
I've just registered on their website because of this article. During registration, I was told that conversations marked by their automated system that overlooks if you are following their terms of use are regularly overlooked by humans and used to train their models.
When learning to sing, humming is used to extend your range higher. Not sure if it's used to extend it lower.
Replied in PM.
I would like to make a recommendation to Johannes that he should try to write and post content in a way that invokes less feelings of cringe in people. I know it does invoke that because I personally feel cringe.
Still, I think that there isn’t much objectively bad about this post. I’m not saying the post is very good or convincing. I think its style is super weird but that should be considered to be okay in this community. These thoughts remind me of something Scott Alexander once wrote - that sometimes he hears someone say true but low status things - and his automatic thoughts are about how the person must be stupid to say something like that, and he has to consciously remind himself that what was said is actually true.
Also, all these thoughts about this social reality sadden me a little - why oh why is AI safety such a status-concerned and “serious business” area nowadays?
I've been learning to play diatonic harmonica for the last 2 years. This is my first instrument and I can confirm that learning an instrument (and music theory) is a lot of fun and it has also taught me some new things about how to learn things in general.
I hum all the time anyway.
Unless I don’t recognize the sounds. It’s like asking me to beatbox the last 5 seconds of the gurgling of a nearby river. How the fudge would I do that?
Wait, are there people who can do that?
I think that's pretty easy :)
Please go, study math fundamentals properly, and then come back. What you wrote doesn't make much sense.
I think this last edit is bad.
Is there any "native" textbook that is pragmatic and explains how to use bayesian in practice (perhaps in some narrow domain)?
Did the model randomly stumble upon this strategy? Or was there an idea pitched by the language model, something like "hey, what if we try to hallucinate and maybe we can hack the game that way"?
Are you able to play sounds using other programs (e.g. open a YouTube video in the background) while getting great latency in reaper or in something similar to reaper?
I've been thinking of buying an M1 MacBook because everyone says that Apple's sound system is great and works out of the box correctly with low latency and no problems, unlike Windows+Wasapi, Windows+ASIO, and Linux. I want to use it for music stuff without an external audio interface. How true is this and would you recommend it?
You says Vast.AI is the "most reliable provider". In my experience, it's an unreliable mess with sometimes buggy not properly working servers and non-existent support service. I will also say the same about runpod.io. On the other hand, lambdalabs had been very reliable in my experience and has a much better UX. The main problem with LambdaLabs is that nowadays it happens pretty often that it has no available servers.
This sounds similar to whether a contemporary machine learning model can break a cryptographic cipher, a hash function, or something like that.
Can you formulate the theorem statement in a precise and self-sufficient way that is usually used in textbooks and papers so that a reader can understand it just by reading it and looking up the used definitions?
I have a kinda-unrelated question. Does Bill Gates write gatesnotes completely himself just because he wants? Or is this a marketing/pr thing and is written by other people? If it's the former, then I want to read it. If it's the latter, I don't.
Do you mean "What do you want me to do" in the tone of voice that means "There's nothing to do here, bugger off"? Or do you mean "What do you want me to do?" in the tone of voice that means "I'm ready to help with this. What should I do to remedy the problem?"?
I have recently read The Little Typer by Friedman and Christiansen. I suspect that this book can serve as an introduction similarly to this (planned, so far) sequence of posts. However, the book is not concise at all.
Are those instructions for making a Molotov cocktail and for hotwiring a car real? They look like something someone who's only seen it done in movies would do. Same question for methamphetamine, except that recipe looks more plausible.
Thanks for writing this update! I think your English skills have improved a lot.
I've just read your previous two posts. I, too, will be interested to read another post of yours.
I am (was) an X% researcher, where X<Y. I wish I had given up on AI safety earlier. I suspect it would've been better for me if AI safety resources explicitly said things like "if you're less than Y, don't even try", although I'm not sure if I would've believed them. Now, I'm glad that I'm not trying to do AI safety anymore and instead I just work at a well paying relaxed job doing practical machine learning. So, I think pushing too many EAs into AI safety will lead to those EAs suffering much more, which happened to me, so I don't want that to happen and I don't want the AI Alignment community to stop saying "You should stay if and only if you're better than Y".
Actually, I wish there were more selfish-oriented resources for AI Alignment. Like, with normal universities and jobs, people analyze how to get into them, have a fulfilling career, earn good money, not burn out, etc. As a result, people can read this and properly analyze if it makes sense for them to try to get into jobs or universities for their own food. But with a career in AI safety, this is not the case. All the resources look out not only for the reader, but also for the whole EA project. I think this can easily burn people.
I still take these zinc lozenges when I suspect that I might fall with a common cold. I feel like they help me somewhat. Maybe my colds have been shorter since I've started taking Zinc but I'm not sure. I haven't been tracking any data explicitly. I guess I'm gonna be taking Zinc for common cold as long as I don't get further evidence about it not working.
Perhaps you can just use the international phonetic alphabet?
I don't know how to square that with the idea that one shouldn't ignore their crying kids. I have no idea how kids' crying at night works. Is it possible that a parent should just suck it up and come and comfort the baby every time they cry? Maybe you can comfort her since she's crying but not give her the reward of soothing her until she falls asleep? Is it possible that she cries at night because she's doesn't get enough cuddles during the day or because the room looks scary or something like that? I don't know enough about the situation and I don't have any kids of my own and don't have any practical experience of dealing with them. Maybe you can be there with her in her sleeping room when she cries but still make it so that she learns to self-soothe and put herself to sleep? Like, idk, stay with her but don't rock her to sleep or something like that.
Ok, I don't know more than that about addressing children's crying. I just thought that ignoring it is (almost always?) bad but I'm not sure.
I'm not sure how to read this; where are you on the continuum from "I heard it's bad" to "I read all the papers and came to a deep considered view"?
I also thought so when I read your post. I'm at the "The book 'The Boy Who Was Raised as a Dog' says so" point. The book is not about sleep in particular, it's about psychological trauma in childhood, especially the one obtained from neglect.
Also, I think this might cause the child to develop either an avoidant attachment style (there's no point in crying or asking others for help, they won't come anyway).
I also don't know how to find tutors for narrow subjects. For instance, I would like a little bit of tutoring about
- panoptic segmentation
- dependent types
but I don't know how to find one.
The link to the next post in this post is broken.
Is this the beginning of Friendship is Optimal?
What role do I, the data scientist dwarf, have?
In the first part, the two respective properties of the two definitions of chaaness you mentioned apply after rescaling and shifting of utility functions is done, right? I.e., the properties actually say "after rescaling and shifting the points, if you move the Pareto-frontier points for a player up, they should get more utility" and "untaken options are irrelevant if you don't change the scale after removing them". Now, I don't see why these properties are interesting and what they correspond to in real life. In contrast, if they applied before rescaling and shifting, then they would be quite interesting. So, can you please elaborate why they are interesting as they are and what they actually mean as they are?
I just want to say that your described solution to "Problem 1: Differentiating effective interventions from unfalsifiable woo" suggests to me that your curriculum would be mostly useless for me, and maybe for many other people as well, because it won't go deep enough. I think either I've already gotten everything I can get from shallow interventions "like better nutrition, using your speaking voice more effectively, improving your personal financial organization, emergency preparedness, and implementing a knowledge management system", or they were never that good in the first place. Personally, I am focusing on psychotherapy right now. It's unfortunate that it consists mostly of borderline-unfalsifiable woo but that's all we've got.
My solution:
I choose Radiant Splendor and Enlightenment simply because out of all champions with personality like mine, it had the highest win frequency. And it even has a solid number of samples - 244. Basically, I narrowed down the dataset to only rows with the same personality like mine. Perhaps I could get some more info from other rows, but that would require spending more time.