LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Video and transcript of presentation on Otherness and control in the age of AGI
Joe Carlsmith (joekc) · 2024-10-08T22:30:38.054Z · comments (1)

ARENA4.0 Capstone: Hyperparameter tuning for MELBO + replication on Llama-3.2-1b-Instruct
25Hour (aaron-kaufman) · 2024-10-05T11:30:11.953Z · comments (2)

Augmenting Statistical Models with Natural Language Parameters
jsteinhardt · 2024-09-20T18:30:10.816Z · comments (0)

[link] Book review: On the Edge
PeterMcCluskey · 2024-08-30T22:18:39.581Z · comments (0)

Reflective consistency, randomized decisions, and the dangers of unrealistic thought experiments
Radford Neal · 2023-12-07T03:33:16.149Z · comments (25)

Adam Smith Meets AI Doomers
James_Miller · 2024-01-31T15:53:03.070Z · comments (10)

D&D.Sci (Easy Mode): On The Construction Of Impossible Structures
abstractapplic · 2024-05-17T00:25:42.950Z · comments (12)

Intransitive Trust
Screwtape · 2024-05-27T16:55:29.294Z · comments (15)

Unpicking Extinction
ukc10014 · 2023-12-09T09:15:41.291Z · comments (10)

If You Can Climb Up, You Can Climb Down
jefftk (jkaufman) · 2024-07-30T00:00:06.295Z · comments (9)

[link] GPT2, Five Years On
Joel Burget (joel-burget) · 2024-06-05T17:44:17.552Z · comments (0)

[link] The last era of human mistakes
owencb · 2024-07-24T09:58:42.116Z · comments (2)

[link] My Apartment Art Commission Process
jenn (pixx) · 2024-08-26T18:36:44.363Z · comments (4)

[link] The $100B plan with "70% risk of killing us all" w Stephen Fry [video]
Oleg Trott (oleg-trott) · 2024-07-21T20:06:39.615Z · comments (8)

AXRP Episode 33 - RLHF Problems with Scott Emmons
DanielFilan · 2024-06-12T03:30:05.747Z · comments (0)

[link] Romae Industriae
Maxwell Tabarrok (maxwell-tabarrok) · 2024-07-19T13:03:31.536Z · comments (2)

Motivating Alignment of LLM-Powered Agents: Easy for AGI, Hard for ASI?
RogerDearnaley (roger-d-1) · 2024-01-11T12:56:29.672Z · comments (4)

[link] Suffering Is Not Pain
jbkjr · 2024-06-18T18:04:43.407Z · comments (45)

[link] Robin Hanson & Liron Shapira Debate AI X-Risk
Liron · 2024-07-08T21:45:40.609Z · comments (4)

Finding the Wisdom to Build Safe AI
Gordon Seidoh Worley (gworley) · 2024-07-04T19:04:16.089Z · comments (10)

Confusing the metric for the meaning: Perhaps correlated attributes are "natural"
NickyP (Nicky) · 2024-07-23T12:43:18.681Z · comments (3)

[link] Twitter thread on open-source AI
Richard_Ngo (ricraz) · 2024-07-31T00:26:11.655Z · comments (6)

Effectively Handling Disagreements - Introducing a New Workshop
Camille Berger (Camille Berger) · 2024-04-15T16:33:50.339Z · comments (2)

[link] The Cancer Resolution?
PeterMcCluskey · 2024-07-24T00:25:17.322Z · comments (24)

Boston Solstice 2023 Retrospective
jefftk (jkaufman) · 2024-01-02T03:10:05.694Z · comments (0)

2024 ACX Predictions: Blind/Buy/Sell/Hold
Zvi · 2024-01-09T19:30:06.388Z · comments (2)

One way violinists fail
Solenoid_Entity · 2024-05-29T04:08:17.675Z · comments (5)

Musings on LLM Scale (Jul 2024)
Vladimir_Nesov · 2024-07-03T18:35:48.373Z · comments (0)

Monthly Roundup #16: March 2024
Zvi · 2024-03-19T13:10:05.529Z · comments (4)

UDT1.01: Logical Inductors and Implicit Beliefs (5/10)
Diffractor · 2024-04-18T08:39:13.368Z · comments (2)

[link] patent process problems
bhauth · 2024-07-14T21:12:04.953Z · comments (13)

Experimentation (Part 7 of "The Sense Of Physical Necessity")
LoganStrohl (BrienneYudkowsky) · 2024-03-18T21:25:56.527Z · comments (0)

[link] AI Safety Memes Wiki
plex (ete) · 2024-07-24T18:53:04.977Z · comments (1)

Love, Reverence, and Life
Elizabeth (pktechgirl) · 2023-12-12T21:49:04.061Z · comments (7)

Mech Interp Lacks Good Paradigms
Daniel Tan (dtch1997) · 2024-07-16T15:47:32.171Z · comments (0)

DIY LessWrong Jewelry
Fluffnutt (Pear) · 2024-08-25T21:33:56.173Z · comments (0)

[link] FTX expects to return all customer money; clawbacks may go away
Mikhail Samin (mikhail-samin) · 2024-02-14T03:43:13.218Z · comments (1)

Monthly Roundup #20: July 2024
Zvi · 2024-07-23T12:50:07.991Z · comments (9)

[link] Information dark matter
Logan Kieller (logan-kieller) · 2024-10-01T15:05:41.159Z · comments (4)

My disagreements with "AGI ruin: A List of Lethalities"
Noosphere89 (sharmake-farah) · 2024-09-15T17:22:18.367Z · comments (44)

Proveably Safe Self Driving Cars [Modulo Assumptions]
Davidmanheim · 2024-09-15T13:58:19.472Z · comments (26)

The Cognitive Bootcamp Agreement
Raemon · 2024-10-16T23:24:05.509Z · comments (0)

Disentangling four motivations for acting in accordance with UDT
Julian Stastny · 2023-11-05T21:26:22.514Z · comments (3)

5. Moral Value for Sentient Animals? Alas, Not Yet
RogerDearnaley (roger-d-1) · 2023-12-27T06:42:09.130Z · comments (41)

One True Love
Zvi · 2024-02-09T15:10:05.298Z · comments (7)

AGI will be made of heterogeneous components, Transformer and Selective SSM blocks will be among them
Roman Leventov · 2023-12-27T14:51:37.713Z · comments (9)

More on the Apple Vision Pro
Zvi · 2024-02-13T17:40:05.388Z · comments (5)

Templates I made to run feedback rounds for Ethan Perez’s research fellows.
Henry Sleight (ResentHighly) · 2024-03-28T19:41:15.506Z · comments (0)

[link] Fake Deeply
Zack_M_Davis · 2023-10-26T19:55:22.340Z · comments (7)

Helpful examples to get a sense of modern automated manipulation
trevor (TrevorWiesinger) · 2023-11-12T20:49:57.422Z · comments (3)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

skybluecat on Open Thread Fall 2024

Should AI safety people/funds focus more on boring old human problems like (especially cyber-and bio-)security instead of flashy ideas like alignment and decision theory? The possible impact of vulnerabilities will only increase in the future with all kinds of technological progress, with or without sudden AI takeoff, but they are much of what makes AGI dangerous in the first place. Security has clear benefits regardless and people already have a good idea how to do it, unlike with AGI or alignment.

If any actor with or without AGI can quickly gain lots of money and resources without alarming anyone, can take over infrastructure and weaponry, or can occupy land and create independent industrial systems and other countries cannot stop it, our destiny is already not in our hands, and it would be suicidal to think we don't need to fix these first because we expect to create an aligned AGI to save us.

If we grow complacent about the fragility of our biology and ecosystem, and continue to allow the possibility of any actor releasing pandemics and arbitrary malwares and deadly radiation etc (for example by allowing global transport without reliable pathogen removal, or using operating systems and open-source libraries that have not been formally proven to be safe), and keep thinking the universe should keep our environment safe and convenient by default, it would be naive to complain when these things happen and hope AGI would somehow preserve human lives and values without having to change our lifestyle or biology to adapt to new risks.

Yes, fixing vulnerabilities of our biology and society is hard and inconvenient and not as glamorous as creating a friendly god to do whatever you want, but we shouldn't let motivated reasoning and groupthink lead us into thinking the latter is feasible when we don't have a good idea about how to do it, just because the former requires sacrifices and investments and we'd prefer if it's not needed. After all, it's a fact that there exist small configurations of matter and information that can completely devastate our world, and just wishing it wasn't true is not going to make it go away.

hmys on BIG-Bench Canary Contamination in GPT-4

But the probability? :O

avturchin on avturchin's Shortform

"Bird Flu H5N1: Not Chaos, but Conspiracy?" By Alexander Pruss
Two months ago, I was puzzled how bird flu, potentially capable of killing tens of millions, went rampant on American livestock farms and began infecting workers, yet no urgent measures were being taken. Even standard epidemiological threat monitoring was happening unsystematically, with months-long delays, and results weren't being made public for months afterward. What happened to the bitter lessons from the coronavirus pandemic? Why such chaos? Since then, the sense of criminal inaction has only intensified. Missouri discovered the first outbreak of human cases unrelated to farm workers, but molecular testing was neglected and infection paths remained undiscovered.

In California, a more pathogenic variant of bird flu spread to hundreds of dairy farms, reportedly killing up to 15% of cows, with almost daily new cases of virus transmission to humans. The virus apparently came to California through cattle transportation from Idaho, despite belatedly introduced rules formally prohibiting the transport of infected cows across state lines. The problem was that infection in transported cows was checked through selective testing, and as reported, the sampling wasn't random: before government testing, farmers secretly tested cows for bird flu in private laboratories and selected only healthy ones for official testing. Here's the continuation of the translation:

A new Vanity Fair investigation shows this isn't random chaos. The USDA (U.S. Department of Agriculture) has been blocking research and data about the new infection in America's dairy herds from the start to protect the multi-billion-dollar American dairy export industry and the interests of giant national dairy processing companies. The idea was simple: most cows recover after a few weeks, and while the bird flu virus does get into milk in huge quantities, it should die during pasteurization. Therefore, the economic losses from the pathogen aren't that severe. However, if consumers in America and especially abroad raise the alarm, it could result in much greater dollar losses. USDA Secretary Thomas Vilsack knows this firsthand: before his government appointment, he worked as president of the U.S. Dairy Export Council.

And immediately after it was finally discovered in March 2024 that dairy farms in Texas and Kansas were hit by bird flu, veterinarians and state officials began receiving calls from personal mobile phones of USDA veterinary institute workers: "we're officially forbidden to discuss this problem without permission from the very top, and unofficially we're asking you to keep quiet about it too." But what about the danger that the virus, having settled in mammals and especially humans, could recombine with our seasonal flu viruses and produce hybrid viruses that combine the infectious and pathogenic potential of human viruses with immunity to our regular antibodies inherited from their avian ancestor?

This, generally speaking, isn't USDA's concern. This alarm was raised by the White House Office of Pandemic Preparedness and Response (OPPR), created in 2023, under the leadership of military doctor and biosecurity expert Paul Friedrichs. In early April, dairy industry representatives raised concerns that some upstart from the White House was muddying the waters. USDA's response was their new policy of official secrecy. Secretary Vilsack responded only a month later to state veterinarians' inquiries about the sudden communication breakdown, and his response was essentially a brush-off. And his ally in Texas, state agriculture commissioner Sid Miller, even hinted that if Friedrichs' people stick their noses into Texas farms, they might be met with bullets.

A number of veterinarians who disagreed with USDA's actions soon lost their jobs, and the country fell into an atmosphere of "work-to-rule," where veterinary authorities appear to be doing their job, but as slowly as possible and with all the red tape that can be justified by regulations. Meanwhile, flu season is approaching, and encounters between bird and human flu in people infected with both viruses are inevitable in the near future.

I forgot to add that by May, a vaccine for bird flu became available for cows, but the USDA chose not to use it.

anthonyc on Word Spaghetti

100% agreed. And you're definitely not alone. Someone once told me, when they asked me to describe the company I worked for, that "Your 5 minute pitch is much better than your 30 second pitch." The reasons are similar. And when I do try to condense and refine my language, I tend to run into the problem that most readers/listeners almost completely ignore qualifiers and caveats, however explicitly stated. I like to say that most of the meaning of a sentence is buried in the small words that are easy to ignore. It's very hard to express a web or network of meanings. That's also why people talk about "unpacking" and "close reading."

From CS Lewis, in the Space Trilogy:

“Of course I realise it’s all rather too vague for you to put into words,” when he took me up rather sharply, for such a patient man, by saying, “On the contrary, it is words that are vague. The reason why the thing can’t be expressed is that it’s too definite for language.”

And of course, from HPMOR ch 70:

Godric Gryffindor's autobiography had been a lot more compressed than the books Hermione was used to reading, he used one sentence to say things that should've taken thirty inches just by themselves, and then there was another sentence after that...

lblack on Lucius Bushnaq's Shortform

Why aren’t there 2^{1000} less programs with such dead code and a total length below10^90 for p_2, compared to p_1?

samshap on Lucius Bushnaq's Shortform

Yes, you are missing something.

Any DEADCODE that can be added to a 1kb program can also be added to a 2kb program. The net effect is a wash, and you will end up with a ratio over priors

leon-lang on Alexander Gietelink Oldenziel's Shortform

I have a similar feeling, but there are some forces in the opposite direction:

Nvidia seems to limit how many GPUs a single competitor can acquire.
training frontier models becomes cheaper over time. Thus, those that build competitive models some time later than the absolute frontier have to invest much less resources.

alexander-gietelink-oldenziel on Alexander Gietelink Oldenziel's Shortform

AGI companies merging within next 2-3 years inevitable?

There are currently about a dozen major AI companies racing towards AGI with many more minor AI companies. The way the technology shakes out this seems like unstable equilibrium.

It seems by now inevitable that we will see further mergers, joint ventures - within two years there might only be two or three major players left. Scale is all-dominant. There is no magic sauce, no moat. OpenAI doesn't have algorithms that her competitors can't copy within 6-12 months. It's all leveraging compute. Whatever innovations smaller companies make can be easily stolen by tech giants.

e.g. we might have xAI- Meta, Anthropic- DeepMind-SSI-Google, OpenAI-Microsoft-Apple.

Actuallly, although this would be deeply unpopular in EA circles it wouldn't be all that surprising if Anthropic and OpenAI would team up.

And - of course - a few years later we might only have two competitors: USA, China.

xizneb on The Personal Implications of AGI Realism

I agree that there are significant uncertainties on the specific consequences of AI accelerating bio/medicine R&D, but I think even without buying into Amodei's specific speculations on life extension, you would still get wildly transformative breakthroughs and unforeseen consequences. I do agree it seems to make sense to be wary of just extrapolating past increases in life expectancy.

Time will tell!

dusandnesic on Could randomly choosing people to serve as representatives lead to better government?

I think an important thing here is:

A random person gets selected for office. Maybe they need to move to the capital city, but their friends are still "back home." Once they serve their term, they will want to come back to their community most likely. So lobbying needs to be able to pay to get you out of your community, break all your bonds and all that during your short stint in power. Currently, politicians slowly come to power and their social clique is used to being lobbies and getting rich and selling out ideals.

This would cut down on corruption a lot (see also John Huang's comment https://www.lesswrong.com/posts/veebprDdTbq2Xmnyj/could-randomly-choosing-people-to-serve-as-representatives?commentId=NEtq8QtayXZY5a38J [LW(p) · GW(p)]) and would undo a lot of the damage done from politicians not having to live normal lives under the current system.