LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] How Big a Deal are MatMul-Free Transformers?
JustisMills · 2024-06-27T22:28:40.888Z · comments (6)

[link] [Linkpost] A Case for AI Consciousness
cdkg · 2024-07-06T14:52:21.704Z · comments (2)

Sustainability of Digital Life Form Societies
Hiroshi Yamakawa (hiroshi-yamakawa) · 2024-07-19T13:59:13.973Z · comments (1)

[question] If I wanted to spend WAY more on AI, what would I spend it on?
Logan Zoellner (logan-zoellner) · 2024-09-15T21:24:46.742Z · answers+comments (7)

[link] what becoming more secure did for me
Chipmonk · 2024-08-22T17:44:48.525Z · comments (5)

Looking for Goal Representations in an RL Agent - Update Post
CatGoddess · 2024-08-28T16:42:19.367Z · comments (0)

[question] What are the best resources for building gears-level models of how governments actually work?
adamShimi · 2024-08-19T14:05:02.590Z · answers+comments (6)

[question] What should we do about COVID in 2024?
ChristianKl · 2024-08-04T10:57:24.140Z · answers+comments (2)

Tokenized SAEs: Infusing per-token biases.
tdooms · 2024-08-04T09:17:46.755Z · comments (20)

A Second Wetsuit Summer
jefftk (jkaufman) · 2024-07-13T02:00:05.412Z · comments (2)

Announcing the PIBBSS Symposium '24!
DusanDNesic · 2024-09-03T11:19:47.568Z · comments (0)

[link] Compression Moves for Prediction
adamShimi · 2024-09-14T17:51:12.004Z · comments (0)

[link] AI existential risk probabilities are too unreliable to inform policy
Oleg Trott (oleg-trott) · 2024-07-28T00:59:59.497Z · comments (5)

Bryan Johnson and a search for healthy longevity
NancyLebovitz · 2024-07-27T15:28:13.117Z · comments (17)

[question] Karma votes: blind to or accounting for score?
cata · 2024-06-22T21:40:34.143Z · answers+comments (4)

Finding Deception in Language Models
Esben Kran (esben-kran) · 2024-08-20T09:42:13.060Z · comments (4)

The Bar for Contributing to AI Safety is Lower than You Think
Chris_Leong · 2024-08-16T15:20:19.055Z · comments (1)

PSA: Consider alternatives to AUROC when reporting classifier metrics for alignment
rpglover64 (alex-rozenshteyn) · 2024-06-24T17:53:28.705Z · comments (1)

[link] Imbue (Generally Intelligent) continue to make progress
Nathan Helm-Burger (nathan-helm-burger) · 2024-06-26T20:41:18.413Z · comments (0)

[link] Nuclear War, Map and Territory, Values | Guild of the Rose Newsletter, May 2024
moridinamael · 2024-06-21T17:39:24.119Z · comments (0)

"Real AGI"
Seth Herd · 2024-09-13T14:13:24.124Z · comments (18)

[link] Green and golden: a meditation
Richard_Ngo (ricraz) · 2024-08-18T01:36:43.613Z · comments (0)

Computational Complexity as an Intuition Pump for LLM Generality
aribrill (Particleman) · 2024-06-25T20:25:36.751Z · comments (6)

Training a Sparse Autoencoder in < 30 minutes on 16GB of VRAM using an S3 cache
Louka Ewington-Pitsos (louka-ewington-pitsos) · 2024-08-24T07:39:00.057Z · comments (0)

[link] Pronouns are Annoying
ymeskhout · 2024-09-18T13:30:04.620Z · comments (17)

[link] Why Swiss watches and Taylor Swift are AGI-proof
Kevin Kohler (KevinKohler) · 2024-09-05T13:23:27.033Z · comments (11)

Games of My Childhood: The Troops
Kaj_Sotala · 2024-07-08T11:20:03.033Z · comments (0)

Initial Experiments Using SAEs to Help Detect AI Generated Text
Aaron_Scher · 2024-07-22T05:16:20.516Z · comments (0)

OpenAI Boycott Revisit
Jake Dennie · 2024-07-22T01:44:55.094Z · comments (2)

[link] To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-09-19T16:13:55.835Z · comments (1)

Travel Buffer
jefftk (jkaufman) · 2024-07-06T02:20:02.723Z · comments (3)

"Which Future Mind is Me?" Is a Question of Values
dadadarren · 2024-08-09T18:17:09.884Z · comments (12)

[link] Minimalist And Maximalist Type Systems
adamShimi · 2024-07-05T16:25:59.448Z · comments (6)

[link] The Dumbification of our smart screens
Itay Dreyfus (itay-dreyfus) · 2024-07-04T06:32:36.672Z · comments (0)

Invitation to lead a project at AI Safety Camp (Virtual Edition, 2025)
Linda Linsefors · 2024-08-23T14:18:24.327Z · comments (2)

[link] How to choose what to work on
jasoncrawford · 2024-09-18T20:39:12.316Z · comments (2)

What program structures enable efficient induction?
Daniel C (harper-owen) · 2024-09-05T10:12:14.058Z · comments (4)

[question] Self-censoring on AI x-risk discussions?
Decaeneus · 2024-07-01T18:24:15.759Z · answers+comments (2)

Why I'm bearish on mechanistic interpretability: the shards are not in the network
tailcalled · 2024-09-13T17:09:25.407Z · comments (35)

[question] Is this voting system strategy proof?
Donald Hobson (donald-hobson) · 2024-09-06T20:44:46.691Z · answers+comments (9)

[link] My lukewarm take on GLP-1 agonists
George3d6 · 2024-08-26T12:34:27.929Z · comments (0)

Podcasts: AGI Show, Consistently Candid, London Futurists
KatjaGrace · 2024-06-23T13:50:03.676Z · comments (0)

[link] AI Safety Newsletter #39: Implications of a Trump Administration for AI Policy Plus, Safety Engineering
Corin Katzke (corin-katzke) · 2024-07-29T17:50:52.454Z · comments (1)

Interview with Robert Kralisch on Simulators
WillPetillo · 2024-08-26T05:49:15.543Z · comments (0)

[question] Is there any rigorous work on using anthropic uncertainty to prevent situational awareness / deception?
David Scott Krueger (formerly: capybaralet) (capybaralet) · 2024-09-04T12:40:07.678Z · answers+comments (6)

[link] AlignedCut: Visual Concepts Discovery on Brain-Guided Universal Feature Space
Bogdan Ionut Cirstea (bogdan-ionut-cirstea) · 2024-09-14T23:23:26.296Z · comments (1)

"... than average" is (almost) meaningless
jwfiredragon · 2024-06-21T04:42:26.682Z · comments (6)

[link] Meta Alignment: Communication Wack-a-Mole
Bridgett Kay (bridgett-kay) · 2024-06-22T20:12:16.412Z · comments (2)

[link] Announcing The Techno-Humanist Manifesto: A new philosophy of progress for the 21st century
jasoncrawford · 2024-07-08T16:33:02.194Z · comments (4)

[link] CultFrisbee
Gauraventh (aryangauravyadav) · 2024-08-11T21:36:36.550Z · comments (3)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

bokov-1 on My simple AGI investment & insurance strategy

Maybe the key is not to assume the entire economy will win, but make some attempt to distinguish winners from losers and then find ETFs and other instruments that approximate these sectors.

So, some wild guesses...

AI labs and their big-tech partners: winners
Cloud hosting: winners
Commercial real estate specializing in server farms: winners
Whoever comes up with tractable ways to power all these server farms: winners
AI-enabling hardware companies: winners until the Chinese blockade Taiwan and impose an embargo on raw materials... after that... maybe losers except the ones that have already started diversifying their supply-chains?
Companies which inherently depend on aggregating and reselling labor: tricky, because if they do nothing, they're toast, but some of them can turn themselves into resellers of AI... e.g. a temp agency rolling out AI services as a cheaper product line
Professional services: same as above but less exposed
Businesses that are needed only in proportion to other businesses having human employees: travel, office real estate, office furniture and supplies: losers

As the effects ripple out and more and more workers are displaced...

Low to mid-end luxury goods and eventually anything that depends on mass discretionary spending: losers

Though what I really would like to do is create some sort of rough model of an individual non-AI company with the following parameters:

Recurring costs attributable to employees
Other recurring costs
Revenue
Fraction of employees whose jobs can be automated at the current state of the art
Variables representing of how far along this company is in planning or implementing AI-driven consolidation and how quickly it is capable of cutting over to AI
Fixed costs of cut-over to AI
Variable costs of cut-over to AI (depending on aggregate workload being automated)
Whatever other variables people who unlike me actually know something about fundamental analysis would put in such a model.

...and then be able to make a principled guess about where on the AI-winners vs AI-losers spectrum a given company is. I even started sketching out a model like this until I realized that someone with relevant expertise must have already written a general-purpose model of this sort and I should find it and adapt it to the AI-automation scenario instead of making up my own.

rhollerith_dot_com on My simple AGI investment & insurance strategy

Our situation is analogous to someone who has been diagnosed with cancer and told he has a low probability of survival, but at least there's a nifty investment opportunity he can buy that pays off big if he does survive.

steve2152 on [Intuitive self-models] 1. Preliminaries

Thanks for the kind words!

The thing you quoted was supposed to be very silly and self-deprecating, but I wrote it very poorly, and it actually wound up sounding kinda judgmental. Oops, sorry. I just rewrote it. I agree with everything you wrote in this comment.

mark-xu on My AI Model Delta Compared To Christiano

I don’t think Paul thinks verification is generally easy or that delegation is fundamentally viable. He, for example, doesn’t suck at hiring because he thinks it’s in fact a hard problem to verify if someone is good at their job.

I liked Rohins comment elsewhere on this general thread.

I’m happy to answer more specific questions, although provide would generally feel more comfortable answering questions about my views then about Paul’s.

linda-linsefors on [Intuitive self-models] 1. Preliminaries

I tried it and it works for me too.

For me the dancer was spinning contraclockwise and would not change. With your screwing trick I could change rotation, and where now stably stuck in the clockwise direction. Until I screwed in the other direction. I've now done this back and forth a few times.

paradiddle on [Intuitive self-models] 1. Preliminaries

Section 1.6 is another appendix about how this series relates to Philosophy Of Mind. My opinion of Philosophy Of Mind is: I’m against it! Or rather, I’ll say plenty in this series that would be highly relevant to understanding the true nature of consciousness, free will, and so on, but the series itself is firmly restricted in scope to questions that can be resolved within the physical universe (including physics, neuroscience, algorithms, and so on). I’ll leave the philosophy to the philosophers.

At the risk of outing myself as a thin-skinned philosopher, I want to push back on this a bit. If we are taking "philosophy of mind" to mean, "the kind of work philosophers of mind do" (which I think we should), then your comment seems misplaced. Crucially, one need not be defending particular views on "big questions" about the true nature of consciousness, free will, and so on to be doing philosophy of mind. Rather, much of the work philosophers of mind do is continuous with scientific inquiry. Indeed, I would say some philosophy of mind is close to indistinguishable from what you do in this post! For example, lots of this work involves trying to carve up conceptual space in a way that coheres with empirical findings, suggests avenues for further research, and renders fruitful discussion easier. Your section 1.3 in this post features exactly the kind of conceptual work that is the bread-and-butter of philosophy. So, far from leaving philosophy to the philosophers, I actually think your work would fit comfortably into the more empirically informed end of contemporary philosophy of mind. To end on a positive note, I think it's really clearly written, fascinating, and fun to read. So thanks!

bokov-1 on My simple AGI investment & insurance strategy

I'm trying out this strategy on Investopedia's simulator (https://www.investopedia.com/simulator/trade/options)

The January 15 2027 call options on QQQ look like this as of posting (current price 481.48):

Strike	Black-Scholes	Ask
485	64.244	77.4
500	57.796	69.83
...	...	...
675	14.308	14
680	13.693	13.5
685	13.077	12.49
...	...	...
700	11.446	10.5
...	...	...
720	9.702	8.5

So, if you were following this strategy and buying today, would you buy 485 because it has the lowest OOM strike price? Would you buy 675 because it's the lowest strike price where the ask is lower than the theoretical Black-Sholes fair price? Would you go for 720 because it's the cheapest available? Would you look for the out-of-money option with the largest difference between Black-Sholes and the ask?

What would be your thought process? I'm definitely hoping to hear from @lc but am interested in hearing from anybody who found this line of reasoning worth investigating and has opinions about it.

dagon on What you know when you know nothing

I think this is mixing up colloquial "know nothing" and literal "know nothing". It's impossible to identify a thing about which one knows nothing, as that identification is something about the thing. It can be wrong, and it can be very imprecise, but it's not nothing.

50/50 are the odds of A when we know nothing about A.

No. 50/50 is a reasonable universal prior, but that's both very theoretical and deeply unclear how to categorize quantum waveforms into things over which a probability is even applicable. In most real cases, 50/50 are the odds to start with when all you know is that it's common enough to come to your attention, and that it "feels" balanced whether or not it'll happen.

In other words, "undefined and inapplicable" is the probability for things you know nothing about. Almost all things you can apply probability to, you know SOMETHING about.

You add another layer of mixing literal and figurative "don't know anything" to the term "singularity". Also, don't forget to multiply by the probability that a singularity-on-relevant-factors might not have happened for the thing you're predicting.

dacyn on Pronouns are Annoying

No, because John could be speaking about himself administering the medication.

If it's about John administering the medication then you'd have to say "... he refused to let him".

It’s also possible to refuse to do something you’ve already acknowledged you should do, so the 3rd he could still be John regardless of who is being told what.

But the sentence did not claim John merely acknowledged that he should administer the medication, it claimed John was the originator of that statement. Is John supposed to be refusing his own requests?

cubefox on What you know when you know nothing

If we know nothing about them, the statements could equally be true or false, and positively or negatively dependent. The same argument which makes us assume 50% probability to individual statements would also make us assume independence between statements. The possibilities cancel out, so to speak.