LessWrong 2.0 Reader

View: New · Old · Top

next page (older posts) →

Caution when interpreting Deepmind's In-context RL paper
Sam Marks (samuel-marks) · 2022-11-01T02:42:06.766Z · comments (6)

EA & LW Forums Weekly Summary (24 - 30th Oct 22')
Zoe Williams (GreyArea) · 2022-11-01T02:58:09.914Z · comments (1)

ML Safety Scholars Summer 2022 Retrospective
ThomasW (ThomasWoodside) · 2022-11-01T03:09:10.305Z · comments (0)

Conversations on Alcohol Consumption
Annapurna (jorge-velez) · 2022-11-01T05:09:34.374Z · comments (6)

[link] Remember to translate your thoughts back again
brook · 2022-11-01T08:49:12.812Z · comments (11)

Auditing games for high-level interpretability
Paul Colognese (paul-colognese) · 2022-11-01T10:44:07.630Z · comments (1)

Clarifying AI X-risk
zac_kenton (zkenton) · 2022-11-01T11:03:01.144Z · comments (24)

Threat Model Literature Review
zac_kenton (zkenton) · 2022-11-01T11:03:22.610Z · comments (4)

[link] a casual intro to AI doom and alignment
Tamsin Leake (carado-1) · 2022-11-01T16:38:31.230Z · comments (0)

On the correspondence between AI-misalignment and cognitive dissonance using a behavioral economics model
Stijn Bruers · 2022-11-01T17:39:10.433Z · comments (0)

[link] Progress links and tweets, 2022-11-01
jasoncrawford · 2022-11-01T17:48:45.562Z · comments (4)

Mildly Against Donor Lotteries
jefftk (jkaufman) · 2022-11-01T18:10:06.458Z · comments (9)

Open & Welcome Thread - November 2022
MondSemmel · 2022-11-01T18:47:40.682Z · comments (46)

AI as a Civilizational Risk Part 4/6: Bioweapons and Philosophy of Modification
PashaKamyshev · 2022-11-01T20:50:54.078Z · comments (1)

[question] Which Issues in Conceptual Alignment have been Formalised or Observed (or not)?
ojorgensen · 2022-11-01T22:32:25.243Z · answers+comments (0)

All AGI Safety questions welcome (especially basic ones) [~monthly thread]
Robert Miles (robert-miles) · 2022-11-01T23:23:04.146Z · comments (105)

[link] Real-Time Research Recording: Can a Transformer Re-Derive Positional Info?
Neel Nanda (neel-nanda-1) · 2022-11-01T23:56:06.215Z · comments (16)

Sequence Reread: Fake Beliefs [plus sequence spotlight meta]
Raemon · 2022-11-02T00:09:11.755Z · comments (3)

Information Markets
eva_ · 2022-11-02T01:24:11.639Z · comments (6)

Why is fiber good for you?
braces · 2022-11-02T02:04:35.579Z · comments (2)

AI Safety Needs Great Product Builders
goodgravy · 2022-11-02T11:33:59.283Z · comments (2)

Mind is uncountable
Filip Sondej · 2022-11-02T11:51:52.050Z · comments (22)

Housing and Transit Thoughts #1
Zvi · 2022-11-02T12:10:00.575Z · comments (5)

[link] Far-UVC Light Update: No, LEDs are not around the corner (tweetstorm)
Davidmanheim · 2022-11-02T12:57:23.445Z · comments (27)

Humans do acausal coordination all the time
Adam Jermyn (adam-jermyn) · 2022-11-02T14:40:39.730Z · comments (35)

"Are Experiments Possible?" Seeds of Science call for reviewers
rogersbacon · 2022-11-02T20:05:17.334Z · comments (0)

[question] Is there a good way to award a fixed prize in a prediction contest?
jchan · 2022-11-02T21:37:45.111Z · answers+comments (5)

AI as a Civilizational Risk Part 5/6: Relationship between C-risk and X-risk
PashaKamyshev · 2022-11-03T02:19:46.847Z · comments (0)

Lazy Python Argument Parsing
jefftk (jkaufman) · 2022-11-03T02:20:05.466Z · comments (3)

Open Letter Against Reckless Nuclear Escalation and Use
Max Tegmark (MaxTegmark) · 2022-11-03T05:34:44.529Z · comments (23)

The Mirror Chamber: A short story exploring the anthropic measure function and why it can matter
mako yass (MakoYass) · 2022-11-03T06:47:56.376Z · comments (13)

The Rational Utilitarian Love Movement (A Historical Retrospective)
CBiddulph (caleb-biddulph) · 2022-11-03T07:11:28.679Z · comments (0)

Information Markets 2: Optimally Shaped Reward Bets
eva_ · 2022-11-03T11:08:49.126Z · comments (0)

K-types vs T-types — what priors do you have?
Cleo Nardo (strawberry calm) · 2022-11-03T11:29:00.809Z · comments (25)

[link] Adversarial Policies Beat Professional-Level Go AIs
sanxiyn · 2022-11-03T13:27:00.059Z · comments (35)

Covid 11/3/22: Asking Forgiveness
Zvi · 2022-11-03T13:50:00.448Z · comments (3)

Multiple Deploy-Key Repos
jefftk (jkaufman) · 2022-11-03T15:10:03.820Z · comments (0)

Why do we post our AI safety plans on the Internet?
Peter S. Park · 2022-11-03T16:02:21.428Z · comments (4)

A Mystery About High Dimensional Concept Encoding
Fabien Roger (Fabien) · 2022-11-03T17:05:56.034Z · comments (13)

AI as a Civilizational Risk Part 6/6: What can be done
PashaKamyshev · 2022-11-03T19:48:52.376Z · comments (4)

Further considerations on the Evidentialist's Wager
Martín Soto (martinsq) · 2022-11-03T20:06:31.997Z · comments (9)

[Video] How having Fast Fourier Transforms sooner could have helped with Nuclear Disarmament - Veritaserum
mako yass (MakoYass) · 2022-11-03T21:04:35.839Z · comments (1)

[question] Could a Supreme Court suit work to solve NEPA problems?
ChristianKl · 2022-11-03T21:10:48.344Z · answers+comments (0)

Mechanistic Interpretability as Reverse Engineering (follow-up to "cars and elephants")
David Scott Krueger (formerly: capybaralet) (capybaralet) · 2022-11-03T23:19:20.458Z · comments (3)

[question] Don't you think RLHF solves outer alignment?
Charbel-Raphaël (charbel-raphael-segerie) · 2022-11-04T00:36:36.527Z · answers+comments (23)

[question] Are alignment researchers devoting enough time to improving their research capacity?
Carson Jones · 2022-11-04T00:58:21.349Z · answers+comments (3)

A newcomer’s guide to the technical AI safety field
zeshen · 2022-11-04T14:29:46.873Z · comments (3)

A new place to discuss cognitive science, ethics and human alignment
Daniel_Friedrich (Hominid Dan) · 2022-11-04T14:34:15.632Z · comments (4)

Weekly Roundup #4
Zvi · 2022-11-04T15:00:01.096Z · comments (1)

Monthly Shorts 10/22
Celer · 2022-11-04T16:30:07.616Z · comments (0)

next page (older posts) →

Archive

Recent comments

nebuchadnezzar on Which skincare products are evidence-based?

Regarding sunscreens, Hyal Reyouth Moist Sun by the Korean brand Dr. Ceuracle is the most cosmetically elegant sun essence I have ever tried. It boasts SPF 50+, PA++++, chemical filters (no white cast) and is very pleasant to the touch and smell, not at all a sensory nightmare.

henry-bass on "AI Safety for Fleshy Humans" an AI Safety explainer by Nicky Case

Yeah, my involvement was providing draft feedback on the article and providing some of the images. Looks like my post got taken down for being a duplicate, though

ophira on Which skincare products are evidence-based?

Snail mucin is one of those products that has less evidence behind it, besides its efficacy as a humectant, compared to the claims you'll often see in marketing. Here's a 1-minute video about it.

It's true that just because a research paper was published, it doesn’t necessarily mean that the research is that reliable — often, if you dig into studies, they’ll have a very small number of participants, or they only did in vitro testing, or something like that.

I’d also argue that natural doesn’t necessarily mean better. My favourite example is shea butter — some people have this romantic idea that it needs to come directly from a far-off village, freshly pounded, but the reality is that raw shea butter often contains stray particles that can in fact exacerbate allergic reactions. Refined shea butter is also really cool from a chemistry perspective, like, you can do very neat things with the texture.

lorxus on Maximal Lottery-Lotteries Exist

what are Smith lotteries?

Lotteries over the Smith set. That definitely wasn't clear - I'll fix that.

which result do you mean by "above result"?

Proposition: (Lottery-lotteries are strongly characterized by their selectivity of partitions of unity)

This one. "You can tell whether a lottery-lottery is maximal or not by how good the partitions of unity it admits are." Sorry, didn't really know a good way to link to myself internally and I forgot to number the various statements.

What does it mean for a lottery to be part of maximal lottery-lotteries?

Just that some maximal lottery-lottery gives it nonzero probability.

does "also subject to the partition-of-unity" refer to the smith lotteries or to the lotteries that are part of maximal lottery-lotteries? (it also feels like there is a word missing somewhere)

Oh no! I thought I caught all the typos! That should be "also subject to the partition-of-unity condition", that is, you look at all the lotteries (which we know are over the Smith set, btw) that some arbitrary maximal lottery-lottery gives any nonzero probability to, and you should expect to be able to sort them into groups by what final probability over candidates they induce; those final probabilities over candidates should themselves result in identical geometric-expected utility for the voterbase.

Why would this suffice?

Consider: at this point we know that a maximal lottery-lottery would not just have to be comprised of lottery-Smith lotteries, i.e., lotteries that are in the lottery-Smith set - but also that they have to be comprised entirely of lotteries over the Smith set of the candidate set. Then on top of that, we know that you can tell which lottery-lotteries are maximal by which partitions of unity they admit (that's the "above result"). Note that by "admit" we mean "some subset of the lotteries this lottery-lottery has support over corresponds to it" (this partition of unity).

This is the slightly complicated part. The game I described has a mixed strategy equilibrium; this will take the form of some probability distribution over . In fact it won't just have one, it'll likely have whole families of them. Much of the time, the lotteries randomized over won't be disjoint - they'll both assign positive probability to some candidate. The key is, the voter doesn't care. As far as a voter's expected utility is concerned, the only thing that matters is the final probability of each candidate.

That's where your collapse of different possible maximal lottery-lotteries to the same partition of unity comes in. Because we know that equivalent candidate-lotteries give voters the same expected utility, the only two ways you get a voter who's indifferent between two candidate-lotteries are 1) they're the same lottery or 2) the voter's utility function is just right to have two very different lotteries tie. Likewise, the only two ways you get a voterbase to be indifferent between two lottery-lotteries is 1) they reduce to the same lottery or 2) the geometric expectation of a voter's utility over candidates sampled from the samples of the lottery-lottery Just Plain Ties.

So: if we can show that whatever equilibrium set of candidate-lotteries Alice and Bob pick always collapses to some convex combination of the Best partitions of unity...? Yeah, I don't think that the second half of the proof holds up as is.

I think I've slightly messed up the definition of lottery-Smith, though not in a fatal way nor (thankfully) in a way that looks to threaten the existence result. The set condition is too strong, in requiring that a lottery-Smith lottery contain all lotteries which correspond to any of the admissible partitions. I'm just going to cut it; it's not actually necessary.

Is this part also supposed to imply the existence of maximal lottery-lotteries? If so, why?

Yes.

Yes, and in particular, it implies the existence of maximal lottery-lotteries before it even tries to prove that they're also lottery-Smith in the sense I define.

Scott's proof [? · GW] doesn't quite work (as he says there) - it almost works, except for the part where the reward functions for Alice and Bob can quite reasonably be discontinuous. My proof is intended as a patch - the reward functions for Alice and Bob should now be extremely continuous in a way that also corresponds well to "how much better did Alice do at picking a candidate-lottery that V will like than Bob did?".

Hopefully this helped? Reading this is confusing even for me sometimes - the word "lottery/lotteries", which appears 59 times in this comment alone, no longer looks like a real word to me and hasn't since late Wednesday. Your comment was really helpful - I have some editing to do! (update - editing is done.)

benito on Habryka's Shortform Feed

I'm probably missing something simple, but what is 356? I was expecting a probability or a percent, but that number is neither.

nebuchadnezzar on Which skincare products are evidence-based?

Concerning the efficacy of hyaluronic acid (HA) in enhancing skin hydration, I would like to highlight glycerin (glycerol) as a superior humectant.

Recalling the 500-Dalton rule, which postulates that any compound with a molecular weight inferior to five hundred daltons possesses the ability to penetrate the skin barrier, we can provide a framework that elucidates the mechanisms of penetration of both compounds. Notably, glycerin has a molecular weight of 92.09 daltons, while even a low-molecular-weight HA weighs a substantial 50,000 daltons. For comparison, high-molecular HA can reach a staggering 1 million daltons.

Consequently, HA is rendered incapable of traversing the deeper skin layers and confined to the epidermis. Topical HA is potent and can bind to colossal amounts of water, proving to be a stellar humectant. Nevertheless, the hygroscopic nature of HA can be problematic in dry climates: HA can extract moisture from adjacent skin cells, inducing transepidermal water loss.

A thorough examination of the hyperbolic marketing surrounding this compound reveals a propensity to obscure the boundaries of its categorization concerning its weight, thereby precipitating a conflation of topical HA and injectable HA, which in turn yields imprecise buzzwords such as "filler" printed on topical moisturizers. A comparative evaluation reveals that the rejuvenative effects of topical HA, when contrasted with its injectable counterpart, are eclipsed in terms of its ability to enhance skin volume and elasticity.

Now, glycerin, on the other hand, has consistently demonstrated superior results at a more economical price point. The trihydroxylated glycerol molecule is widely regarded as one of the most (if not best) humectants: its small molecular weight allows it to penetrate the skin effectively, which characterizes its ability to retain and attract water molecules, and ensure long-lasting hydration.

The synergistic effect of HA and glycerin may provide enhanced hydration benefits by targeting different aspects of skin moisture retention: the concomitant use of both compounds in this study has yielded favorable outcomes.

habryka4 on "AI Safety for Fleshy Humans" an AI Safety explainer by Nicky Case

@henry [LW · GW] (who seems to know Nicky) said on a duplicate link post of this:

This is an accessible introduction to AI Safety, written by Nicky Case and the teens at Hack Club. So far, part 1/3 is completed, which covers a rough timeline of AI advancement up to this point, and what might come next.
If you've got feedback as to how this can be made more understandable, that'd be appreciated! Reach out to Nicky, or to me and I'll get the message to her.

alexander-gietelink-oldenziel on Dalcy's Shortform

I agree with you.

Epsilon machine (and MSP) construction is most likely computationally intractable [I don't know an exact statement of such a result in the literature but I suspect it is true] for realistic scenarios.

Scaling an approximate version of epsilon reconstruction seems therefore of prime importance. Real world architectures and data has highly specific structure & symmetry that makes it different from completely generic HMMs. This must most likely be exploited.

The calculi of emergence paper has inspired many people but has not been developed much. Many of the details are somewhat obscure, vague. I also believe that most likely completely different methods are needed to push the program further. Computational Mechanics' is primarily a theory of hidden markov models - it doesn't have the tools to easily describe behaviour higher up the Chomsky hierarchy. I suspect more powerful and sophisticated algebraic, logical and categorical thinking will be needed here. I caveat this by saying that Paul Riechers has pointed out that actually one can understand all these gadgets up the Chomsky hierarchy as infinite HMMs which may be analyzed usefully just as finite HMMs.

The still-underdeveloped theory of epsilon transducers I regard as the most promising lens on agent foundations. This is uncharcted territory; I suspect the largest impact of computational mechanics will come from this direction.

Your point on True Names is well-taken. More basic examples than gauge information, synchronization order are the triple of quantites entropy rate , excess entropy $E$ and Crutchfield's statistical/forecasting complexity $C$ . These are the most important quantities to understand for any stochastic process (such as the structure of language and LLMs!)

alexander-gietelink-oldenziel on Transformers Represent Belief State Geometry in their Residual Stream

Non exhaustive list of reasons one could be interested in computational mechanics: https://www.lesswrong.com/posts/GG2NFdgtxxjEssyiE/dalcy-s-shortform?commentId=DdnaLZmJwusPkGn96 [LW(p) · GW(p)]

habryka4 on Habryka's Shortform Feed

@jefftk [LW · GW] comments on the HN thread on this:

How many people would, if they suddenly died, be reported as a "Boeing whistleblower"? The lower this number is, the more surprising the death.

Another HN commenter says (in a different thread):

It’s a nice little math problem.
Let’s say both of the whistleblowers were age 50. The probability of a 50 year old man dying in a year is 0.6%. So the probability of 2 or more of them dying in a year is 1 - (the probability of exactly zero dying in a year + the probability of exactly one dying in a year). 1 - (A+B).
A is (1-0.006)^N. B is 0.006N(1-0.006)^(N-1). At 60 A is about 70% and B is about 25% making it statistically insignificant.
But they died in the same 2 month period, so that 0.006 should be 0.001. If you rerun the same calculation, it’s 356.