Posts
Comments
Embarrassingly, that was a semi-unintended reaction — I would bet a small amount against that statement if someone gave me a resolution method, but am not motivated to figure one out, and realized this a second after making it — that I hadn't figured out how to remove by the time you made that comment. Sorry.
It sounds to me like the model is 'the candidate needs to have a (party-aligned) big blind spot in order to be acceptable to the extremists(/base)'. (Which is what you'd expect, if those voters are bucketing 'not-seeing A' with 'seeing B'.)
(Riffing off from that: I expect there's also something like, Motive Ambiguity-style, 'the candidate needs to have some, familiar/legible(?), big blind spot, in order to be acceptable/non-triggering to people who are used to the dialectical conflict'.)
if my well-meaning children successfully implement my desire never to die, by being uploaded, and "turn me on" like this with sufficient data and power backups but lack of care; or if something else goes wrong with the technicians involved not bothering to check if the upload was successful in setting up a fully virtualized existence complete with at least emulated body sensations, or do not otherwise check from time to time to ensure this remains the case;
These don't seem like plausible scenarios to me. Why would someone go to the trouble of running an upload, but be this careless? Why would someone running an upload not try to communicate with it at all?
A shell in a Matrioshka brain (more generally, a Dyson sphere being used for computation) reradiates 100% of the energy it captures, just at a lower temperature.
The AI industry people aren't talking much about solar or wind, and they would be if they thought it was more cost effective.
I don't see them talking about natural gas either, but nuclear or even fusion, which seems like an indication that whatever's driving their choice of what to talk about, it isn't short-term cost-effectiveness.
I doubt it (or at least, doubt that power plants will be a bottleneck as soon as this analysis says). Power generation/use varies widely over the course of a day and of a year (seasons), so the 500 GW number is an average, and generating capacity is overbuilt; this graph on the same EIA page shows generation capacity > 1000 GW and non-stagnant (not counting renewables, it declined slightly from 2005 to 2022 but is still > 800 GW):
This seems to indicate that a lot of additional demand[1] could be handled without building new generation, at least (and maybe not only) if it's willing to shut down at infrequent times of peak load. (Yes, operators will want to run as much as possible, but would accept some downtime if necessary to operate at all.)
This EIA discussion of cryptocurrency mining (estimated at 0.6% to 2.3% of US electricity consumption!) is highly relevant, and seems to align with the above. (E.g. it shows increased generation at existing power plants with attached crypto mining operations, mentions curtailment during peak demand, and doesn't mention new plant construction.)
- ^
Probably not as much as implied by the capacity numbers, since some of that capacity is peaking plants and/or just old, meaning not only inefficient, but sometimes limited by regulations in how many hours it can operate per year. But still.
Confidentiality: Any information you provide will not be personally linked back to you. Any personally identifying information will be removed and not published. By participating in this study, you are agreeing to have your anonymized responses and data used for research purposes, as well as potentially used in writeups and/or publications.
Will the names (or other identifying information if it exists, I haven't taken the survey) of the groups evaluated potentially be published? I'm interested in this survey, but only willing to take it if there's a confidentiality assurance for that information, even independent of my PII. (E.g., I might want to take it about a group without potentially contributing to public association between that group and 'being a cult'.)
The hypothetical bunker people could easily perform the Cavendish experiment to test Newtonian gravity, there just (apparently) isn't any way they'd arrive at the hypothesis.
As a counterpoint, I use Firefox as my primary browser (I prefer a bunch of little things about its UI), and this is a complete list of glitches I've noticed:
- The Microsoft account login flow sometimes goes into a loop of asking me for my password
- Microsoft Teams refuses to work ('you must use Edge or Chrome')
- Google Meet didn't used to support background blurring, but does now
- A coworker reported that a certain server BMC web interface didn't work in Firefox, but did in Chrome (on Mac) — I found (on Linux, idk if that was the relevant difference) it broke the same way in both, which I could get around by deleting a modal overlay in the inspector
(I am not a lawyer)
The usual argument (e.g.) for warrant canaries being meaningful is that the (US) government has much less legal ability to compel speech (especially false speech) than to prohibit it. I don't think any similar argument holds for private contracts; AFAIK they can require speech, and I don't know whether anything is different if the required speech is known by both parties to be false. (The one relevant search result I found doesn't say there's anything preventing such a contract; Claude says there isn't, but it could be thrown out on grounds of public policy or unconscionability.)
I would think this 'canary' still works, because it's hard to imagine OpenAI suing, or getting anywhere with a suit, for someone not proactively lying (when silence could mean things besides 'I am subject to an NDA'). But, if a contract requiring false speech would be valid,
- insofar as this works it works for different reasons than a warrant canary
- it could stop working, if future NDAs are written with it in mind
(Quibbles aside, this is a good idea; thanks for making it!)
Upvoted, but weighing in the other direction: Average Joe also updates on things he shouldn't, like marketing. I expect the doctor to have moved forward some in resistance to BS (though in practice, not as much as he would if he were consistently applying his education).
And the correct reaction (and the study's own conclusion) is that the sample is too small to say much of anything.
(Also, the "something else" was "conventional treatment", not another antiviral.)
I find the 'backfired through distrust'/'damaged their own credibility' claim plausible, it agrees with my prejudices, and I think I see evidence of similar things happening elsewhere; but the article doesn't contain evidence that it happened in this case, and even though it's a priori likely and worth pointing out, the claim that it did happen should come with evidence. (This is a nitpick, but I think it's an important nitpick in the spirit of sharing likelihood ratios, not posterior beliefs.)
if there's a domain where the model gives two incompatible predictions, then as soon as that's noticed it has to be rectified in some way.
What do you mean by "rectified", and are you sure you mean "rectified" rather than, say, "flagged for attention"? (A bounded approximate Bayesian approaches consistency by trying to be accurate, but doesn't try to be consistent. I believe 'immediately update your model somehow when you notice an inconsistency' is a bad policy for a human [and part of a weak-man version of rationalism that harms people who try to follow it], and I don't think this belief is opposed to "rationalism", which should only require not indefinitely tolerating inconsistency.)
We found that viable virus could be detected... up to 4 hours on copper...
Here's a study using a different coronavirus.
Brasses containing at least 70% copper were very effective at inactivating HuCoV-229E (Fig. 2A), and the rate of inactivation was directly proportional to the percentage of copper. Approximately 103 PFU in a simulated wet-droplet contamination (20 µl per cm2) was inactivated in less than 60 min. Analysis of the early contact time points revealed a lag in inactivation of approximately 10 min followed by very rapid loss of infectivity (Fig. 2B).
That paper only looks at bacteria and does not knowably carry over to viruses.
I don't see you as having come close to establishing, beyond the (I claim weak) argument from the single-word framing, that the actual amount or parts of structure or framing that Dragon Army has inherited from militaries are optimized for attacking the outgroup to a degree that makes worrying justified.
Doesn't work in incognito mode either. There appears to be an issue with lesserwrong.com when accessed over HTTPS — over HTTP it sends back a reasonable-looking 301 redirect, but on port 443 the TCP connection just hangs.
Similar meta: none of the links to lesserwrong.com currently work due to, well, being to lesserwrong.com rather than lesswrong.com.
Further-semi-aside: "common knowledge that we will coordinate to resist abusers" is actively bad and dangerous to victims if it isn't true. If we won't coordinate to resist abusers, making that fact (/ a model of when we will or won't) common knowledge is doing good in the short run by not creating a false sense of security, and in the long run by allowing the pattern to be deliberately changed.
This post may not have been quite correct Bayesianism (... though I don't think I see any false statements in its body?), but regardless there are one or more steel versions of it that are important to say, including:
- persistent abuse can harm people in ways that make them more volatile, less careful, more likely to say things that are false in some details, etc.; this needs to be corrected for if you want to reach accurate beliefs about what's happened to someone
- arguments are soldiers; if there are legitimate reasons (that people are responding to) to argue against someone or see them as dangerous, this is likely to bleed over to dismissing other things they say more than is justified, especially if there are other motivations to do so
- the intelligent social web makes some people both more likely to be abused, and less likely to be believed
- whether someone seems "off" depends to some extent on how the social web wants them to be perceived, independent of what they're doing
- seriously I don't know how to communicate using words just how powerful (I claim) this class of effects is
- there are all kinds of reasons that not believing claims about abuse is often just really convenient; this sounds obvious but I don't see people accounting for it well; this motivation will take advantage of whatever rationalizations it can
IMO, the "legitimate influence" part of this comment is important and good enough to be a top-level post.
This is simply instrumentally wrong, at least for most people in most environments. Maybe people and an environment could be shaped so that this was a good strategy, but the shaping would actually have to be done and it's not clear what the advantage would be.
My consistent experience of your comments is one of people giving [what I believe to be, believing that I understand what they're saying] the actual best explanations they can, and you not understanding things that I believe to be comprehensible and continuing to ask for explanations and evidence that, on their model, they shouldn't necessarily be able to provide.
(to be upfront, I may not be interested in explaining this further, due to limited time and investment + it seeming like a large tangent to this thread)
I don't see how we anything like know that deep NNs with ‘sufficient training data’ would be sufficient for all problems. We've seen them be sufficient for many different problems and can expect them to be sufficient for many more, but all?
A tangential note on third-party technical contributions to LW (if that's a thing you care about): the uncertainty about whether changes will be accepted, uncertainty about and lack of visibility into how that decision is made or even who makes it, and lack of a known process for making pull requests or getting feedback on ideas are incredibly anti-motivating.
Other possible implications of this scenario have been discusesd on LW before.
This shouldn't lead to rejection of the mainstream position, exactly, but rejection of the evidential value of mainstream belief, and reversion to your prior belief / agnosticism about the object-level question.
Solving that problem seems to require some flavor of Paul's "indirect normativity", but that's broken and might be unfixable as I've discussed with you before.
Do you have a link to this discussion?
Why not go a step further and say that 1 copy is the same as 0, if you think there's a non-moral fact of the matter? The abstract computation doesn't notice whether it's instantiated or not. (I'm not saying this isn't itself really confused - it seems like it worsens and doesn't dissolve the question of why I observe an orderly universe - but it does seem to be where the GAZP points.)
I wonder if it would be fair to characterize the dispute summarized in/following from this comment on that post (and elsewhere) as over whether the resolutions to (wrong) questions about anticipation/anthropics/consciousness/etc. will have the character of science/meaningful non-moral philosophy (crisp, simple, derivable, reaching consensus across human reasoners to the extent that settled science does), or that of morality (comparatively fuzzy, necessarily complex, not always resolvable in principled ways, not obviously on track to reach consensus).
Where Recursive Justification Hits Bottom and its comments should be linked for their discussion of anti-inductive priors.
(Edit: Oh, this is where the first quote in the post came from.)
Measuring optimization power requires a prior over environments. Anti-inductive minds optimize effectively in anti-inductive worlds.
(Yes, this partially contradicts my previous comment. And yes, the idea of a world or a proper probability distribution that's anti-inductive in the long run doesn't make sense as far as I can tell; but you can still define a prior/measure that orders any finite set of hypotheses/worlds however you like.)
I agree with the message, but I'm not sure whether I think things with a binomial monkey prior, or an anti-inductive prior, or that don't implement (a dynamic like) modus ponens on some level even if they don't do anything interesting with verbalized logical propositions, deserve to be called "minds".
Have Eliezer's views (or anyone else's who was involved) on the Anthropic Trilemma changed since that discussion in 2009?
So my guess is that a given dollar is probably more valuable at CFAR right this instant, and we hope this changes very soon (due to CFAR having its own support base)...
an added dollar of marginal spending is more valuable at CFAR (in my estimates).
Is this still your view?
I didn't, and still don't... but now I'm a little bit disturbed that I don't, and want to look a lot more closely at Hermione for ways she's awesome.
Upvoted; whatever its relationship to what the OP actually meant, this is good.
Saying that it's good because it's vague, because it's harder to screw up when you don't know what you're talking about, is contrary to the spirit of LessWrong.
Reminding yourself of your confusion, and avoiding privileging hypotheses, by using vague terms as long as you remember that they're vague doesn't seem so bad.
I kept expecting someone to object that "this Turing machine never halts" doesn't count as a prediction, since you can never have observed it to run forever.
More sympathetically, people might (well, I'm sure some people do) see avoiding stereotype-based jokes as a step towards there being things you can't say, and prefer some additional risk of saying harmful things to moving in that direction (possibly down a slippery slope).
But the blogger's position is one that is often met with hostility round these parts, for reasons that are unclear to me.
I think some of it is a defensive reaction to perceived possible vaguely-defined moral demands/condemnation. Here's a long comment I wrote about that in a different context.
Also simple contrarianism, though that's not much of an explanation absent a theory of why this is the thing people are contrarian against.
the parts of social engineering that I think LW is worst at.
What are those?
It/s a lesswrongian prejjudice that the only game anyone would want to play is Highly Competent But Criminally Underappreciated Backroom Boffin.
Yes. The general case of this prejudice is probably something like 'behavior morally should be evaluated according to its stated far-mode purpose; other purposes are possible and important, but dirty'. Of course, this has the large upside of making us seriously evaluate things according to their stated purpose at all....
Out of curiosity, what are the connotations of the word "rube" that make you suspicious?
Low status, contemptibility, etc. I expect making status hierarchies salient to make people less rational (hence fully generic suspicion), and I had the specific hypothesis that you might see people using 'signaling' models as judging others as contemptible and be offended by this.
Relatedly, I dislike calling the behavior in question "pandering", since I expect using condemnatory terms for phenomena to make them aversive to look at closely, and to lead to bias in attribution (against seeing them in oneself/'good' people and towards seeing them in 'bad' people, as well as towards seeing people who unambiguously exhibit them as 'bad').
I have a hard time telling whether you're trying to say that 'signaling' models are inaccurate, or just that calling them 'signaling' is misleading. I agree with the latter insofar as 'signaling' means this specific economic model, because the behaviors in question aren't directed at economically rational agents. I also can't tell if you dislike models that postulate stupidity (the strong status connotations of the word "rube" make me suspicious).
If you mean the former: I think you greatly overestimate median rationality in your take on the manager and butcher examples. All positive traits get conflated with each other by default. People can and do override their affective impressions with explicit reasoning, but more often than not they don't, especially when evaluating performance is difficult — and it's almost always more difficult than evaluating "does this person look like a winner?".
I also used to think that simple non-costly signaling couldn't possibly stably work, but experience (often with my own irrationality) changed my mind. This is less confusing if I think of it as social-primate (rather than general-intelligence) behavior; liking things/people other people like is socially useful. (This would likely be significant in the manager example in real life, e.g., I'll look better to my superiors if I make similar evaluations of my subordinates to them.)
The quality proposed was "status", but outrage is cheap. Any fool can be outraged at a blog post mentioning rape.
Now, status signaling is overused as an explanation. If the "HOW DARE YOU" comments are signaling (or 'signaling') anything, the obvious thing is alignment with the perceived-as-socially-powerful (implicit-Schelling-point-)faction condemning Robin, not status.
I agree with this policy.
I and the one person currently in the room with me immediately took "by all means necessary" to suggest violence. I think you're in a minority in how you interpret it.
Police seek and preserve public favour not by catering to public opinion, but by constantly demonstrating absolute impartial service to the law.
I know this is meant to be an ideal for the police, but it could also be read as a descriptive claim about public favor, and it's worth noting that that claim is sometimes false: how often do people approve of police bashing the heads of $OUTGROUP?
"Apply decision theory to the set of actions you can perform at that point" is underspecified — are you computing counterfactuals the way CDT does, or EDT, TDT, etc?
This question sounds like a fuzzier way of asking which decision theory to use, but maybe I've missed the point.
Can you give an example of circular preferences that aren't contextual and therefore only superficially circular (like Benja's Alice and coin-flipping examples are contextual and only superficially irrational), and that you endorse, rather than regarding as bugs that should be resolved somehow? I'm pretty sure that any time I feel like I have intransitive preferences, it's because of things like framing effects or loss aversion that I would rather not be subject to.