LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[question] Cryonics considerations: how big of a problem is ischemia?
kman · 2024-12-04T04:45:06.629Z · answers+comments (1)

[link] Are SAE features from the Base Model still meaningful to LLaVA?
Shan23Chen (shan-chen) · 2024-12-05T20:21:55.501Z · comments (2)

[question] Set Theory Multiverse vs Mathematical Truth - Philosophical Discussion
Wenitte Apiou (wenitte-apiou) · 2024-11-01T18:56:06.900Z · answers+comments (25)

Another UFO Bet
codyz · 2024-11-01T01:55:27.301Z · comments (11)

Post-Quantum Investing: Dump Crypto for Index Funds and Real Estate?
G (g-1) · 2024-12-11T11:59:11.062Z · comments (5)

[link] The Dissolution of AI Safety
Roko · 2024-12-12T10:34:14.253Z · comments (44)

[link] Nerdtrition: simple diets via spreadsheet abuse
dkl9 · 2024-10-27T21:45:15.117Z · comments (0)

Dario Amodei's "Machines of Loving Grace" sound incredibly dangerous, for Humans
Super AGI (super-agi) · 2024-10-27T05:05:13.763Z · comments (1)

Reanalyzing the 2023 Expert Survey on Progress in AI
AI Impacts (AI Imacts) · 2024-12-16T06:10:04.563Z · comments (0)

Where do you put your ideas?
CstineSublime · 2024-12-17T07:26:06.685Z · comments (20)

What conclusions can be drawn from a single observation about wealth in tennis?
Trevor Cappallo (trevor-cappallo) · 2024-12-18T09:55:34.923Z · comments (3)

Meta AI (FAIR) latest paper integrates system-1 and system-2 thinking into reasoning models.
happy friday (happy-friday) · 2024-10-24T16:54:15.721Z · comments (0)

A Brief Explanation of AI Control
Aaron_Scher · 2024-10-22T07:00:56.954Z · comments (1)

[link] What is autonomy? Why boundaries are necessary.
Chipmonk · 2024-10-21T17:56:33.722Z · comments (1)

[question] Why don't we currently have AI agents?
ChristianKl · 2024-12-26T15:26:35.682Z · answers+comments (10)

On Intentionality, or: Towards a More Inclusive Concept of Lying
Cornelius Dybdahl (Kalciphoz) · 2024-10-18T10:37:32.201Z · comments (0)

[question] Change My Mind: Thirders in "Sleeping Beauty" are Just Doing Epistemology Wrong
DragonGod · 2024-10-16T10:20:22.133Z · answers+comments (67)

Thoughts On the Nature of Capability Elicitation via Fine-tuning
Theodore Chapman · 2024-10-15T08:39:19.909Z · comments (0)

Zombies among us
Declan Molony (declan-molony) · 2024-12-31T05:14:07.929Z · comments (3)

[link] It's important to know when to stop: Mechanistic Exploration of Gemma 2 List Generation
Gerard Boxo (gerard-boxo) · 2024-10-14T17:04:57.010Z · comments (0)

[link] Riffing on Machines of Loving Grace
an1lam · 2025-01-01T01:06:45.122Z · comments (0)

[link] Contagious Beliefs—Simulating Political Alignment
James Stephen Brown (james-brown) · 2024-10-13T00:27:08.084Z · comments (0)

HDBSCAN is Surprisingly Effective at Finding Interpretable Clusters of the SAE Decoder Matrix
Jaehyuk Lim (jason-l) · 2024-10-11T23:06:14.340Z · comments (2)

[question] why won't this alignment plan work?
KvmanThinking (avery-liu) · 2024-10-10T15:44:59.450Z · answers+comments (7)

[link] Triangulating My Interpretation of Methods: Black Boxes by Marco J. Nathan
adamShimi · 2024-10-09T19:13:26.631Z · comments (0)

MIT FutureTech are hiring for a Head of Operations role
peterslattery · 2024-10-02T17:11:42.960Z · comments (0)

Three main arguments that AI will save humans and one meta-argument
avturchin · 2024-10-02T11:39:08.910Z · comments (8)

Foresight Vision Weekend 2024
Allison Duettmann (allison-duettmann) · 2024-10-01T21:59:55.107Z · comments (0)

[link] AI Safety Newsletter #42: Newsom Vetoes SB 1047 Plus, OpenAI’s o1, and AI Governance Summary
Corin Katzke (corin-katzke) · 2024-10-01T20:35:32.399Z · comments (0)

[link] In-Context Learning: An Alignment Survey
alamerton · 2024-09-30T18:44:28.589Z · comments (0)

[link] Models of life
Abhishaike Mahajan (abhishaike-mahajan) · 2024-09-29T19:24:40.060Z · comments (0)

Interpreting the effects of Jailbreak Prompts in LLMs
Harsh Raj (harsh-raj-ep-037) · 2024-09-29T19:01:10.113Z · comments (0)

[link] Jailbreaking language models with user roleplay
loops (smitop) · 2024-09-28T23:43:10.870Z · comments (0)

Two new datasets for evaluating political sycophancy in LLMs
alma.liezenga · 2024-09-28T18:29:49.088Z · comments (0)

Steering LLMs' Behavior with Concept Activation Vectors
Ruixuan Huang (sprout_ust) · 2024-09-28T09:53:19.658Z · comments (0)

[link] Experts' AI timelines are longer than you have been told?
Vasco Grilo (vascoamaralgrilo) · 2025-01-16T18:03:18.958Z · comments (4)

Thoughts on the conservative assumptions in AI control
Buck · 2025-01-17T19:23:38.575Z · comments (0)

[link] Deconstructing arguments against AI art
DMMF · 2024-12-27T19:40:13.015Z · comments (5)

A small improvement to Wikipedia page on Pareto Efficiency
ektimo · 2024-11-18T02:13:49.151Z · comments (0)

Do Antidepressants work? (First Take)
Jacob Goldsmith (jacgoldsm) · 2025-01-12T17:11:55.417Z · comments (8)

[question] What actual bad outcome has "ethics-based" RLHF AI Alignment already prevented?
Roko · 2024-10-19T06:11:12.602Z · answers+comments (16)

[link] A Heuristic Proof of Practical Aligned Superintelligence
Roko · 2024-10-11T05:05:58.262Z · comments (6)

Ethical Implications of the Quantum Multiverse
Jonah Wilberg (jrwilb@googlemail.com) · 2024-11-18T16:00:20.645Z · comments (22)

[link] Paper Highlights, November '24
gasteigerjo · 2024-12-07T19:15:11.859Z · comments (0)

[link] [Linkpost] Hawkish nationalism vs international AI power and benefit sharing
jakub_krys (kryjak) · 2024-10-18T18:13:19.425Z · comments (5)

2025 Q1 Pivotal Research Fellowship (Technical & Policy)
Tobias H (clearthis) · 2024-11-12T10:56:24.858Z · comments (0)

[question] Recommendations on communities that discuss AI applications in society
Annapurna (jorge-velez) · 2024-12-24T13:37:49.821Z · answers+comments (2)

[link] Progress links and short notes, 2024-12-16
jasoncrawford · 2024-12-16T17:24:31.398Z · comments (0)

[link] My Experience With A Magnet Implant
Vale · 2025-01-07T03:01:21.410Z · comments (2)

[link] An Epistemological Nightmare
Ariel Cheng (arielcheng218) · 2024-11-21T02:08:56.942Z · comments (0)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

the-gears-to-ascension on Numberwang: LLMs Doing Autonomous Research, and a Call for Input

Aa I said elsewhere, https://www.lesswrong.com/posts/LfQCzph7rc2vxpweS/introducing-the-weirdml-benchmark?commentId=q86ogStKyge9Jznpv [LW(p) · GW(p)]

This is a capabilities game. It is neither alignment or safety. To the degree it's forecasting, it helps cause the thing it forecasts. This has been the standard pattern in capabilities research for a long time: someone makes a benchmark (say, imagenet 1.3m 1000class), and this produces a leaderboard that allows people to show how good their learning algorithm is at novel datasets. In some cases this even produced models directly that were generally useful, but it traditionally was used to show how well an algorithm would work in a new context from scratch. Building benchmarks like this gives teams a new way to brag - they may have a better source of training data (eg, google always had a better source of training data than imagenet), but it allows them to brag that they scored well on the benchmark, which among other things helps them get funding.

Perhaps it also helps convince people to be concerned. That might trade off against this. Perhaps it sucks in some way as a bragging rights challenge. That would trade off against this

Hopefully it sucks as a bragging rights challenge.

jkaufman on Tax Price Gouging?

The thing that I think would be overall better (no price controls) is politically unpopular, strongly socially discouraged, and often illegal. This is a proposal that tries to move us in a direction I think is better, while addressing some of what price gouging opponents dislike.

jkaufman on Tax Price Gouging?

one of the things the public hates more than price increases during a shortage is higher taxes any time

Maybe? Though in this case what we're taxing is the disliked activity--price increases during a shortage. So possibly this would be popular, like taxes on alcohol, tobacco, or gambling?

make emergencies a tax holiday

The main good bit of market pricing this would miss is the demand reduction and reallocation caused by the higher prices. I might be willing to buy 100lb of ice at $1/lb but only 10lb of ice at $5/lb: it's easier for me to just dump a bunch of ice into my fridge, but if I prioritize and put the important stuff into a cooler I can make do with much less. If the government is subsidizing suppliers to keep the price at the pre-disaster rate I don't have this incentive to ice more efficiently.

abandon on dirk's Shortform

Language can only ever approximate reality and that's Fine Actually. The point of maps is to have a simplified representation of the territory you can use for navigation (or avoiding water mains as you dig, or assessing potential weather conditions, or deciding which apartment to rent—and maps for different purposes include or leave out different features of the territory depending on which matter to the task at hand); including all the detail would mean the details that actually matter for our goals are lost in the noise (not to mention requiring, in the limit, a map which is an identical copy of the territory and therefore intractably large). So too is language a compression of reality in order to better communicate that subset of its features which matter to the task at hand; it’s that very compression which lets us choose which part of the territory we point to.

viliam on ahmadzaidi12's Shortform

Yeah, this is why rationality is a group effort (on top of the individual effort). There is not enough time to make a precise map of everything from scratch. It is better to hang out with people whose maps are generally good.

d0themath on The purposeful drunkard

The paper you're thinking of is probably The Developmental Landscape of In-Context Learning.

christiankl on Unregulated Peptides: Does BPC-157 hold its promises?

Having with Cerebrolysin and BPC-157 the two top-rated peptides to be bogus, does suggest that the whole field is untrustworthy. It also makes me more skeptical about self-reporting.

linda-linsefors on The quantum red pill or: They lied to you, we live in the (density) matrix

ϕt=Utϕ0U−1t.

I think you mean here, not $ψ$

d0themath on Lecture Series on Tiling Agents

@abramdemski [LW · GW] I think I'm the biggest agree vote for alexander (without me alexander would have -2 agree), and I do see this because I follow both of you on my subscribe tab.

I basically endorse Alexander's elaboration.

On the "prep for the model that is coming tomorrow not the model of today" front, I will say that LLMs are not always going to be as dumb as they are today. Even if you can't get them to understand or help with your work now, their rate of learning still makes them in some sense your most promising mentee, and that means trying to get as much of the tacit knowledge you have into their training data as possible (if you want them to be able to more easily & sooner build on your work). Or (if you don't want to do that for whatever reason) just generally not being caught flat-footed once they are smart enough to help you, as all your ideas are in videos or otherwise in high context understandable-only-to-abram notes.

In the words of gwern [LW(p) · GW(p)],

Should you write text online now in places that can be scraped? You are exposing yourself to 'truesight' and also to stylometric deanonymization or other analysis, and you may simply have some sort of moral objection to LLM training on your text.
This seems like a bad move to me on net: you are erasing yourself (facts, values, preferences, goals, identity) from the future, by which I mean, LLMs. Much of the value of writing done recently or now is simply to get stuff into LLMs. I would, in fact, pay money to ensure Gwern.net is in training corpuses, and I upload source code to Github, heavy with documentation, rationale, and examples, in order to make LLMs more customized to my use-cases. For the trifling cost of some writing, all the worlds' LLM providers are competing to make their LLMs ever more like, and useful to, me.

dagon on Tax Price Gouging?

This is a novel (to me) line of thinking, and I'm happy to hear about it! I'm not sure it's feasible, as one of the things the public hates more than price increases during a shortage is higher taxes any time.

That said, the REVERSE of this - slightly raise taxes in normal times, and make emergencies a tax holiday, might really work. This gives room for producers/distributors to raise prices WITHOUT as much impact on the consumers. Gets some of the good bits of market pricing, with less of the bad bits (both limited to the magnitude of the tax change relative to the scarcity-based price change).