Posts
Comments
There is already a lot of automatic censoring happening. I am unsure how much LLMs add on top of existing and fairly successful techniques from spam filtering. And just using LLMs is probably prohibitive at the scale of social media (definitely for tech companies, maybe not for governments), but perhaps you can get an edge for some use-case with them.
But I (and I think others on LW team although for slightly different reasons) have been thinking about building a feature directly into LW to facilitate it.
Maybe consider making it super easy (one click easy) to export LW posts to google docs?
ACX is probably a better reference class: https://astralcodexten.substack.com/p/2023-subscription-drive-free-unlocked. In Jan, ACX had 78.2k readers, of which 6.0k subscribers for a 7.7% subscription rate.
Consider https://thepeteplan.wordpress.com/beginner-training/.
I think it might be good to normalize "just try stuff until they fix your condition" as one of the treatment strategies. I guess it's a bit ironic that Dr. Spray-n-pray's indifference toward which pill worked and why seems so epistemically careless, while actually maybe being a correct way to orient towards success when you optimize for luck and have little reliable information.
- Russian military doctrine allows the usage of nuclear weapons to defend Russian territory.
This is ~false. See: https://forum.effectivealtruism.org/posts/TkLk2xoeE9Hrx5Ziw/nuclear-attack-risk-implications-for-personal-decision?commentId=ukEznwTnD78wFdZip#ukEznwTnD78wFdZip
See:
Here is a google sheet.
I want to mention that Tsvi Benson-Tilsen is a mentor at this summer's PIBBSS. So some readers might consider applying (the deadline is Jan 23rd).
I myself was mentored by Abram Demski once through the FHI SRF, which AFAIK was matching fellows with a large pull of researchers based on mutual interests.
I am looking for text-to-speech tools for various contexts. As of now, I am using
- @Voice Aloud Reader (TTS Reader) and a custom script to extract articles from webpages for Android (supports .epub and .pdf as well);
- Capti Voice on my desktop for everything.
I would appreciate it if the ToC linked to the web versions of the essay.
A follow-up (h/t LW review). I got quite a bit out of the workshop, most importantly
- I found a close friend and collaborator, whom I don't think I would have met otherwise.
- I found a close friend and co-founder, whom I was likely to meet otherwise, but it's unlikely that we would have a good enough bond by covid-times.
There was much more but much less legible and "evaluatable." I think ESE was excellent, and I would have done it even if I knew that I wouldn't get two close friendships out of it.
Or, to change tack: the operating budget of the LessWrong website has historically been ~$600k, and this budget is artificially low because the site has paid extremely below-market salaries. Adjusting for the market value of the labor, the cost is more like $1M/year, or $2,700/day. If I assume LessWrong generates more value than the cost required to run it, I estimate that the site provides at least $2,700/day in value, probably a good deal more.
I think this estimate is mistaken because it ignores marginalism: basically, the cost of disabling LW for a year is much larger than 365 * the cost of disabling LW for only a day. The same goes for disabling the whole website vs. disabling only the frontpage.
(Sorry for adding salt to hurt feelings; posting because impact evaluation of longtermism projects is important.)
Maybe reading Gelman's self-contained comments on SSC's More Confounders would make you more confused in a good way.
Hey! Could you say more about a causal link between Sequences and writing these papers, please:
I was able to do from muscle memory certain calculations about conditional probability and expectation that might have taken weeks otherwise (if we figured them out at all). I attribute this ability in large part to reading the Sequences.
I think my confusion comes from (a) having enough math background (read some chapters of The Probabilistic Method yers ago); (b) while reading Sequences and more so AF discussions added to my understanding of formal epistemology, I am surprised that your emphasis how Sequences affected your muscle memory and ability to do calculations.
As this answer got upvoted, I collected some Dubna's courses read in English, for which recordings are available (look for "Доступны 4 видеозаписи курса.")
- https://web.evanchen.cc/napkin.html
- https://t.me/mathtabletalks (in Russian)
- https://www.mccme.ru/dubna/eng/ (a summer school aimed at teaching advanced topic to high schoolers and early undergrads in Russia; there are hundreds of recordings and a dozen of edited notes from the school available in Russian)
- http://math.jacobs-university.de/summerschool/ (European version of above; don't know much about them adding just in case)
Metaculus 2020 U.S. Election Risks Survey doesn't give >1% for >5000 deaths, but I think it is justified to infer something like that from it:
While large-scale violence and military intervention to quell civil unrest seem unlikely, experts still judged these possibilities to be far from remote. Experts predicted a median of 60 deaths occurring due to election-related violence, with an 80% confidence interval of 0 to 912 fatalities that reflects a high degree of uncertainty. Still, the real possibility of violence is a notable departure from the peaceful transitions that have been the hallmark of past U.S. elections. Results indicate an 8% probability of over 1,000 election-related deaths — suggesting that while widespread sustained clashes are unlikely, this possibility warrants real concern. Experts assigned a 10% median prediction that President Trump will invoke the Insurrection Act to mobilize troops during the transition period.
A better example: one might criticize CDC for lack of advice aimed at the vulnerable demographics. But absence might result not from lack of judgment but from political constraints. E.g. jimrandomh writes:
Addendum: A whistleblower claims that CDC wanted to advise elderly and fragile people to not fly on commercial airlines, but removed this advice at the White House's direction.
Upd: this might be indicative of other negative characteristics of CDC (which might contribute to unreliability) but I don't know enough about the US gov to asses it.
While for me it is, indeed, a reason to put less weight on their analysis or expect less useful work/analysis to be done by them in a short/medium-term.
But I think this consideration, also, weakens certain types of arguments about the CDC's lack of judgment/untrustworthiness. For example, arguments like "they did this, but should have done better" loses part of its bayesian weight as the organization likely made a lot of decisions under time pressure and other constraints. And things are more likely to go wrong if you're under-stuffed and hence prioritize more aggressively.
I don't expect to have a good judgment here, but it seems to me that "testing kits the CDC sent to local labs were unreliable" might fall here. It might have been a right call for them to distribute tests quickly and ~skip ensuring that tests didn't have a false positive problem.
Unless there are large enough demographics for which this post looks credible while FB conspiracies do not.
If the only issue is tone, you could write something like: 'Initially, I was confused/surprised by the core claim you made but reading this, this, and that [or thinking for 15 minutes/further research] made me believe that your position is basically correct'. This looks quite
[...] "Yes, you are correct about that" comes across as quite arrogant [...]
I attended Epistea Summer Experiment and greatly enjoyed it. (At the same time I am quite skeptical about value of any rationality workshops for EA-inspired work.)
I think Nuno's time-capped analysis is good.
Thanks for the post. I would recommend reading the original blog post by Noam Brown as it has the proper level of exposition and more details/nuances.
Overall, it seems that Pluribus is conceptually very similar to Libratus; sadly, no new insights about >2-player games. My impression is that because poker players don't collude/cooperate too much, playing something close to an equilibrium against them will make you rich.
If one has 2 possible models to fit a data set, by how much should one penalize the model which has an additional free parameter?
Penalization might not be necessary if your learning procedure is stochastic and favors simple explanations. I encourage you to take a look on the nice poster/paper «Deep learning generalizes because the parameter-function map is biased towards simple functions» (PAC-Bayesian learning theory + empirical intuitions).
Rohin, thank you for the especially long and informative newsletter.
When there are more samples, we get a lower validation loss [...]
I guess you've meant a higher validation loss
?
Alexey, happy birthday to your podcast! I've just subscribed and hope you would post consistently in the future. How many subscribers do you have?
If you are curious why Russian chatroom is so big I encourage you to read about Kocherga. With 174 karma and 54 votes, it is the highest rated non-curated LW post at the moment.
I would like to highlight Russian LessWrong Slack, which has 2000+ registered users, ~150 WAU (among which ~50 are posting) and ~80 DAU (~25 are posting).
I augment Pomodoros with
EA Forum: Donating effectively is usually better than impact investing.
I am quite sure, that Moscow's LW will celebrate a Secular Solstice on 21 or 22 of Dec.
An example from Feynman's «The Character of Physical Law»:
The next guy who did something great was Maxwell, who obtained the laws of electricity and magnetism. What he did was this. He put together all the laws of electricity, due to Faraday and other people who came before him, and he looked at them and realized that they were mathematically inconsistent. In order to straighten it out he had to add one term to an equation. He did this by inventing for himself a model of idler wheels and gears and so on in space. He found what the new law was – but nobody paid much attention because they did not believe in the idler wheels. We do not believe in the idler wheels today, but the equations that he obtained were correct. So the logic may be wrong but the answer right.
Wikipedia page for 'Cognitive behavioral therapy for insomnia' is a great source of useful sleep related habits.
Divestment and mission hedging are examples of politically motivated finance activity. Divestment seems to be somewhat popular, but inefficient. Mission hedging is not well-known, but probably quite good.
A very successful crowdfunding for printing HPMoR has happened in Russia. 21k books are going to be printed: some of them will go to public/university libraries, some to gifted students. More good HPMoR related news are coming from Russia, but too early to announce them.
I think this paper, which models winner-takes-all, public knowledge situations (ex. the space race between the US and USSR) by «Guess Who?» game, is interesting formal model of the first half of this post.
“Guess Who?” is a popular two player game where players ask “Yes”/“No” questions to search for their opponent’s secret identity from a pool of possible candidates. This is modeled as a simple stochastic game. Using this model, the optimal strategy is explicitly found. Contrary to popular belief, performing a binary search is not always optimal. Instead, the optimal strategy for the player who trails is to make certain bold plays in an attempt catch up. This is discovered by first analyzing a continuous version of the game where players play indefinitely and the winner is never decided after finitely many rounds.
You are welcome! A general concern about the pace of scientific progress.
The most in-depth, but a bit outdated (c. 2012) article on sleep is written by Piotr Wozniak, whom you might know as a pioneer of spaced repetition software. The article is ~300 pages long. It includes summary & myths sections which are a bit longer than this post.
Somehow related papers in ML / DL:
- Keeping NN Simple by Minimizing the Description Length of the Weights (Hinton, 1997);
- Binarized Neural Networks (Courbariaux, 2016).
It seems to me, that Dacyn's code executes [stuff]
at least once for any n
. But iff n <= 0
, original while
loop does not execute its body. Dacyn's code looks like a do-while
loop.
I associate myself with the unconscious-self more and more (note: an unconscious-self is bigger than an elephant-self because some modules in a brain are deliberate & analytical, but not directly available to the verbal/conscious rider; I very much agree with @moridinamael's comment above).
Conscious-self seems more like press secretary for more hard-working unconscious-self, who is in charge of most of the decision-making. But, ugh, everyone experienced how «conscious ruled unconscious» (≈ will-power). I think the role of conscious-self in «the use of willpower» is to communicate from long-term modules to short-term modules of unconscious-self.
«Inner Game of Tennis» contains some recommendations on how to augment communication between the modules. I also found TDT-mindset helpful to tell early-evolved modules what later-evolved modules think is worth doing.
Youtube
I permanently blocked the website in all browsers I use. I use command line tool youtube-dl to download the videos I want/need to watch. This workflow gives me an option to watch videos (and also some friction to reevaluate the decision to watch a video); but prevents me from engaging with youtube, the risky game I might 'loss' otherwise.
I predict that a lot of people who would take rationalist lent's advice seriously would try to quite the same things and there are others who has hit on a good diet of experience that they could try to emulate. So It would be helpful to have a list of diets for quitting unwanted behaviour. Feel free to leave your recipes as a reply to this comment.
See also: the guide by Alex Vermeer is fruitful for reviewing the past and planning the following years in an analytical and systematic way.
Two interesting questions arise:
- could Alpha Zero beat the best human-computer team;
- would human-AZ team systematically beat AZ.
I think the answer to the first question is positive, but unfortunately, I couldn't make much sense of the available raw data on Freestyle chess, so my opinion is based on the marginal revolution blog-post. The negative answer to the second question might make some optimists about human-AI cooperation like Kasparov less optimistic.