LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[question] hypnosis question
KvmanThinking (avery-liu) · 2025-02-06T02:41:53.314Z · answers+comments (5)

Scanless Whole Brain Emulation
Knight Lee (Max Lee) · 2025-01-27T10:00:08.036Z · comments (4)

Use computers as powerful as in 1985 or AI controls humans or ?
jrincayc (nerd_gatherer) · 2025-02-03T00:51:05.706Z · comments (0)

[question] Whose track record of AI predictions would you like to see evaluated?
Jonny Spicer (jonnyspicer) · 2025-01-29T12:05:30.311Z · answers+comments (3)

[link] Credit Suisse collapse obfuscated Parreaux, Thiébaud & Partners scandal
pocock · 2025-02-24T21:28:39.617Z · comments (0)

Safe Search is off: root causes of AI catastrophic risks
Jemal Young (ghostwheel) · 2025-01-31T18:22:43.947Z · comments (0)

Is it ethical to work in AI "content evaluation"?
anon_databoy123 (noob1234) · 2025-01-27T19:58:26.176Z · comments (2)

[link] New LLM Scaling Law
wrmedford · 2025-02-19T20:21:17.475Z · comments (0)

[question] How do biological or spiking neural networks learn?
Dom Polsinelli (dom-polsinelli) · 2025-01-31T16:03:38.425Z · answers+comments (1)

[question] Strong, Stable, Open: Choose Two - in search of an article
Eli_ · 2025-01-31T14:48:21.438Z · answers+comments (0)

[link] Modularity and assembly: AI safety via thinking smaller
D Wong (d-nell) · 2025-02-20T00:58:39.714Z · comments (0)

arch-anarchist reading list
Peter lawless · 2025-02-16T22:47:00.273Z · comments (1)

Arguing for the Truth? An Inference-Only Study into AI Debate
denisemester · 2025-02-11T03:04:58.852Z · comments (0)

Can someone, anyone, make superintelligence a more concrete concept?
Ori Nagel (ori-nagel) · 2025-02-04T02:18:51.718Z · comments (8)

AI acceleration, DeepSeek, moral philosophy
Josh H (joshua-haas) · 2025-02-02T00:08:11.593Z · comments (0)

[link] Probability of AI-Caused Disaster
Alvin Ånestrand (alvin-anestrand) · 2025-02-12T19:40:11.121Z · comments (2)

[link] The future of humanity is in management
jasoncrawford · 2025-01-30T22:14:46.765Z · comments (5)

Visualizing Interpretability
Darold Davis (darold) · 2025-02-03T19:36:38.938Z · comments (0)

Workshop: Interpretability in LLMs Using Geometric and Statistical Methods
Karthik Viswanathan (vkarthik095) · 2025-02-22T09:39:26.446Z · comments (0)

Making alignment a law of the universe
juggins · 2025-02-25T10:44:11.632Z · comments (0)

[link] Forecasting Uncontrolled Spread of AI
Alvin Ånestrand (alvin-anestrand) · 2025-02-22T13:05:57.171Z · comments (0)

Artificial Static Place Intelligence: Guaranteed Alignment
ank · 2025-02-15T11:08:50.226Z · comments (2)

ChatGPT: Exploring the Digital Wilderness, Findings and Prospects
Bill Benzon (bill-benzon) · 2025-02-02T09:54:26.008Z · comments (0)

Updating and Editing Factual Knowledge in Language Models
Dhananjay Ashok (dhananjay-ashok) · 2025-01-23T19:34:37.121Z · comments (2)

Intrinsic Dimension of Prompts in LLMs
Karthik Viswanathan (vkarthik095) · 2025-02-14T19:02:49.464Z · comments (0)

if you're not happy single, you won't be happy immortal
daijin · 2025-02-24T13:23:52.204Z · comments (1)

The many failure modes of consumer-grade LLMs
dereshev · 2025-01-26T19:01:09.891Z · comments (0)

Starting Thoughts on RLHF
Michael Flood (michael-flood) · 2025-01-23T22:16:49.793Z · comments (0)

The Outer Levels
Jerdle (daniel-amdurer) · 2025-02-03T14:30:29.230Z · comments (3)

Should Art Carry the Weight of Shaping our Values?
Krishna Maneesha Dendukuri (krishna_maneesha-d) · 2025-01-28T18:43:32.517Z · comments (0)

LW/ACX social meetup
Stefan (stefan-1) · 2025-02-10T21:12:39.092Z · comments (0)

[link] Language Models and World Models, a Philosophy
kyjohnso · 2025-02-03T02:55:36.577Z · comments (0)

Locating and Editing Knowledge in LMs
Dhananjay Ashok (dhananjay-ashok) · 2025-01-24T22:53:40.559Z · comments (0)

Part 1: Enhancing Inner Alignment in CLIP Vision Transformers: Mitigating Reification Bias with SAEs and Grad ECLIP
Gilber A. Corrales (mysticdeepai) · 2025-02-03T19:30:52.505Z · comments (0)

[question] Programming Language Early Funding?
J Thomas Moros (J_Thomas_Moros) · 2025-02-16T17:34:06.058Z · answers+comments (5)

Interpreting autonomous driving agents with attention based architecture
Manav Dahra (manav-dahra) · 2025-02-01T23:20:27.162Z · comments (0)

Exploring the coherence of features explanations in the GemmaScope
Mattia Proietti (mattia-proietti) · 2025-02-01T21:28:33.690Z · comments (0)

Biological humans collectively exert at most 400 gigabits/s of control over the world.
benwr · 2025-02-20T23:44:06.509Z · comments (1)

The Domain of Orthogonality
mgfcatherall · 2025-02-05T08:14:32.793Z · comments (0)

[question] Why do we have the NATO logo?
KvmanThinking (avery-liu) · 2025-02-19T22:59:41.755Z · answers+comments (4)

[question] Why isn't AI containment the primary AI safety strategy?
OKlogic · 2025-02-05T03:54:58.171Z · answers+comments (3)

[link] Ideas for CoT Models: A Geometric Perspective on Latent Space Reasoning
Rohan Ganapavarapu (rohan-ganapavarapu) · 2025-01-24T19:01:47.339Z · comments (0)

Nationwide Action Workshop: Contact Congress about AI safety!
Felix De Simone (BobusChilc) · 2025-02-24T19:36:09.084Z · comments (0)

Quantifying the Qualitative: Towards a Bayesian Approach to Personal Insight
Pruthvi Kumar (pruthvi-kumar) · 2025-02-15T19:50:42.550Z · comments (0)

Upcoming Neuroscience Workshop - Functionalizing Brain Data, Ground-Truthing, and the Role of Artificial Data in Advancing Neuroscience
Devin Ward (Carboncopies Foundation) · 2025-01-30T23:02:00.681Z · comments (0)

Demystifying the Pinocchio Paradox
Novak Zukowski (Zantarus) · 2025-02-25T06:16:57.219Z · comments (0)

Opinion Article Scoring System
ciaran · 2025-02-10T14:32:19.030Z · comments (0)

Dayton, Ohio, HPMOR 10 year Anniversary meetup
Lunawarrior · 2025-02-24T12:55:59.484Z · comments (0)

[link] The Capitalist Agent
henophilia · 2025-02-04T15:32:39.694Z · comments (10)

[link] Request for proposals: improving capability evaluations
cb · 2025-02-07T18:51:34.926Z · comments (0)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

niplav on shortplav

Huh, cool. Intuitively, I'd expect those character-level similarities not to matter too much since the tokenization makes these end up in very different parts of embedding space, unless "kwiecień" or "kviten" are often misspelled as words with the prefix "kwiet". (I check with Google translate, which ~always translates "kwiet" as "quiet" for Slavic languages & Maltese, and as "flower" in Polish).

whestler on Historical mathematicians exhibit a birth order effect too

I'm surprised to see so little discussion of educational attainment and it's relation to birth order here. It seems that a lot of the discussion is around biological differences. Did I miss something?

Families may only have enough money to send one child to school or university, and this is commonly the first born. As a result, I'd expect to see a trend of more first-borns in academic fields like mathematics, as well as on LessWrong.

As a quick example to back up this hunch, this paper seems to reach the same conclusion:

https://www.sciencedirect.com/science/article/abs/pii/S0272775709001368

"birth order turns out to have a significant negative effect on educational attainment. This decline in years of schooling with birth order turns out to be approximately linear."

I'd be interested if the effect still exists if we control for educational attendance/ resources somehow.

niplav on shortplav

Yeah, definitely not the least likely trajectories, instead it's just the next token with the smallest probability. I was thinking of doing beam search with minimizing logits, but that looked difficult to implement. Still surprised that it produces things like prü|stor|oire| which are pretty pronounceable.

lsusr on List of most interesting ideas I encountered in my life, ranked

+1 to Taleb's Extremistan vs Mediocristan model

philh on Microplastics: Much Less Than You Wanted To Know

One question I have that might be relatively tractable: if I'm using plastic containers for leftovers, how much difference is there between

Store in the container, put on plate to microwave and eat.
Store and microwave in the container, put on plate to eat.
Who needs a plate anyway? Just eat from the container.

The bit about plastic chopping boards kind of hints that (3) might give a lot more microplastics than (2)? But you're probably less violent to the container than the chopping board.

ebenezer-dukakis on Ebenezer Dukakis's Shortform

it will also set off the enemy rhetoric detectors among liberals

I'm not sure about that, does Bernie Sanders rhetoric set off that detector?

james-oofou on Have LLMs Generated Novel Insights?

Current LLMs are capable of solving novel problems when the user does most the work: when the user lays the groundwork and poses the right question for the LLM to answer.

So, if we can get LLMs to lay the groundwork and pose the right questions then we'll have autonomous scientists in whatever fields LLMs are OK at problem solving.

This seems like something LLMs will learn to do as inference-time compute is scaled up. Reasoners benefit from coming up with sub-problems whose solutions can be built atop of to solve the problem posed by the user.

LLMs will learn that in order to solve difficult questions, they must pose and solve novel sub-questions.

So, once given an interesting research problem, the LLM will hum away for days doing good, often-novel work.

silentbob on List of most interesting ideas I encountered in my life, ranked

In no particular order, because interestingness is multi-dimensional and they are probably all to some degree on my personal interesting Pareto frontier:

We're not as 3-dimensional as we think [LW · GW]
Replacing binary questions with "under which circumstances" [LW · GW]
Almost everything is causally linked [LW · GW], saying "A has no effect on B" is almost always wrong (unless you very deliberately search for A and B that fundamentally cannot be causally linked). If you ran a study with a bazillion subjects for long enough, practically anything you can measure would reach statistical significance
Many disagreements are just disagreements about labels ("LLMs are not truly intelligent", "Free will does not exist") and can be easily resolved / worked around once you realize this (see also [LW · GW])
Selection biases of all kind
Intentionality bias [LW · GW], it's easy to explain human behavior with supposed intentions, but there is much more randomness and ignorance everywhere than we think
Extrapolations tend to work locally, but extrapolating further into the future very often gets things wrong; kind of obvious, applies to e.g. resource shortages ("we'll run out of X and then there won't be any X anymore!"), but also Covid (I kind of assumed Covid cases would just exponentially climb until everything went to shit, and forgot to take into account that people would get afraid and change their behavior on a societal scale, at least somewhat, and politicians would eventually do things, even if later than I would), and somewhat AI (we likely won't just "suddenly" end up with a flawless superintelligence)
"If only I had more time/money/whatever" style thinking is often misguided, as often when people say/think this, the sentence continues with "then I could spend that time/money/whatever in other/more ways than currently", meaning as soon as you get more of X, you would immediately want to spend it, so you'll never sustainably end up in a state of "more X". So better get used to X being limited and having to make trade-offs and decisions on how to use that limited resource rather than daydreaming about a hypothetical world of "more X". (This does not mean you shouldn't think about ways to increase X, but you should probably distance yourself from thinking about a world in which X is not limited)
Taleb's Extremistan vs Mediocristan model
+1 to Minimalism that lsusr already mentioned
The mindblowing weirdness [LW(p) · GW(p)] of very high-dimensional spaces
Life is basically an ongoing coordination problem between your past/present/future selves
The realization that we're not smart enough to be true consequentialists, i.e. consequentialism is somewhat self-defeating
The teleportation paradox, and thinking about a future world in which a) teleportation is just a necessity to be successful in society (and/or there is just social pressure, e.g. all your friends do it and you get excluded from doing cool things if you don't join in) and b) anyone having teleported before having convincing memories of having gone through teleportation and coming out on the other side. In such a world, anyone with worries about teleportation would basically be screwed. Not sure if I should believe in any kind of continuity of consciousness, but that certainly feels like a thing. So I'd probably prefer not to be forced to give that up just because the societal trajectory happens to lead through ubiquitous teleportation.

purplehermann on Export Surplusses

USA is the world government from a money perspective. They can simply tax the world by printing dollars and sending them overseas.

Any lesson learned about decifits/surpluses from the US is suspect.

China's Belt and Road Initiative /New Silk Road means owning parts of other countries is a terminal value.

Other countries mostly have net neutral imports/exports if I remember correctly.

The way you get rich in an economy is by producing more valuable things and trading for what you want and storing surplus currency (dollars at the world stage).

At times you need to give away your labor so as to start participating and get access, but you need to be getting stuff back to get richer in real terms.

No different than individuals in standard economies

richard_kennaway on Perry Cai's Shortform

Anyone have a logical solution to exactly why we should act altruistically?

"Logical ... should" sounds like a type error, setting things up for a contradiction. While there are adherents of moral naturalism, I doubt there are many moral naturalists around here. Even given moral naturalism, I believe it would still be true that any amount of intelligence can coexist with any goals [? · GW]. So no, there is no reason why unconstrained intelligences should be altruistic, or even be the sort of thing that "altruism" could meaningfully be asserted or denied of them.

I know it makes sense evolutionarily through game theory and statistics, but human decision making is still controlled by emotions

...which came about through evolution, so what work is the "but" doing? The urge to do good for others is what the game theory feels like from inside [LW · GW].

it's still most advantageous for an individual actor to follow their own self-interest to a degree in a social community.

Each knows their own needs and desires better than anyone else, so it's primarily up to each person to ensure their own are fulfilled. Ensuring this often involves working with others. We do things for each other that we may individually prosper.

So, what type of altruism are you asking about? I expect Peter Singer would dismiss reciprocal altruism as weak sauce, a pale and perverted imitation of what he preaches. The EA variety inspired by Singer? Utilitarianism that values all equally to oneself, and feels another's pain as intensely as one's own? Saintliness that values everyone else above oneself who am nothing? There's a long spectrum there, and people inhabiting all parts of it.