LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

Why Care About Natural Latents?
johnswentworth · 2024-05-09T23:14:30.626Z · comments (3)

Skepticism About DeepMind's "Grandmaster-Level" Chess Without Search
Arjun Panickssery (arjun-panickssery) · 2024-02-12T00:56:44.944Z · comments (13)

How do you actually obtain and report a likelihood function for scientific research?
Peter Berggren (peter-berggren) · 2024-02-11T17:42:49.956Z · comments (4)

Genetic fitness is a measure of selection strength, not the selection target
Kaj_Sotala · 2023-11-04T19:02:13.783Z · comments (43)

Why I no longer identify as transhumanist
Kaj_Sotala · 2024-02-03T12:00:04.389Z · comments (33)

Conditional prediction markets are evidential, not causal
philh · 2024-02-07T21:52:47.476Z · comments (10)

Secret Collusion: Will We Know When to Unplug AI?
schroederdewitt · 2024-09-16T16:07:01.119Z · comments (7)

The Best of Don’t Worry About the Vase
Zvi · 2023-12-13T12:50:02.510Z · comments (4)

[link] OpenAI releases GPT-4o, natively interfacing with text, voice and vision
Martín Soto (martinsq) · 2024-05-13T18:50:52.337Z · comments (23)

... Wait, our models of semantics should inform fluid mechanics?!?
johnswentworth · 2024-08-26T16:38:53.924Z · comments (12)

Complexity of value but not disvalue implies more focus on s-risk. Moral uncertainty and preference utilitarianism also do.
Chi Nguyen · 2024-02-23T06:10:05.881Z · comments (18)

AI #44: Copyright Confrontation
Zvi · 2023-12-28T14:30:10.237Z · comments (13)

[link] Google Gemini Announced
Jacob G-W (g-w1) · 2023-12-06T16:14:07.192Z · comments (22)

Math-to-English Cheat Sheet
nahoj · 2024-04-08T09:19:40.814Z · comments (5)

[link] In Defense of Epistemic Empathy
Kevin Dorst · 2023-12-27T16:27:06.320Z · comments (19)

[link] the micro-fulfillment cambrian explosion
bhauth · 2023-12-04T01:15:34.342Z · comments (5)

[link] Unlocking Solutions—By Understanding Coordination Problems
James Stephen Brown (james-brown) · 2024-07-27T04:52:13.435Z · comments (4)

On Anthropic’s Sleeper Agents Paper
Zvi · 2024-01-17T16:10:05.145Z · comments (5)

[link] Theories of Change for AI Auditing
Lee Sharkey (Lee_Sharkey) · 2023-11-13T19:33:43.928Z · comments (0)

Safe Stasis Fallacy
Davidmanheim · 2024-02-05T10:54:44.061Z · comments (2)

[link] Questions are usually too cheap
Nathan Young · 2024-05-11T13:00:54.302Z · comments (19)

[link] AI, centralization, and the One Ring
owencb · 2024-09-13T14:00:16.126Z · comments (11)

Dating Roundup #2: If At First You Don’t Succeed
Zvi · 2024-01-02T16:00:04.955Z · comments (29)

FixDT
abramdemski · 2023-11-30T21:57:11.950Z · comments (14)

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
leogao · 2023-12-16T05:39:10.558Z · comments (5)

What if a tech company forced you to move to NYC?
KatjaGrace · 2024-06-09T06:30:03.329Z · comments (22)

[link] Land Reclamation is in the 9th Circle of Stagnation Hell
Maxwell Tabarrok (maxwell-tabarrok) · 2024-01-12T13:36:27.159Z · comments (6)

Cooperating with aliens and AGIs: An ECL explainer
Chi Nguyen · 2024-02-24T22:58:47.345Z · comments (8)

[link] Come to Manifest 2024 (June 7-9 in Berkeley)
Saul Munn (saul-munn) · 2024-03-27T21:30:17.306Z · comments (2)

[Closed] PIBBSS is hiring in a variety of roles (alignment research and incubation program)
Nora_Ammann · 2024-04-09T08:12:59.241Z · comments (0)

[link] [Closed] Agent Foundations track in MATS
Vanessa Kosoy (vanessa-kosoy) · 2023-10-31T08:12:50.482Z · comments (1)

On “first critical tries” in AI alignment
Joe Carlsmith (joekc) · 2024-06-05T00:19:02.814Z · comments (8)

Ten Modes of Culture War Discourse
jchan · 2024-01-31T13:58:20.572Z · comments (15)

Monthly Roundup #17: April 2024
Zvi · 2024-04-15T12:10:03.126Z · comments (4)

Fat Tails Discourage Compromise
niplav · 2024-06-17T09:39:16.489Z · comments (5)

How the AI safety technical landscape has changed in the last year, according to some practitioners
tlevin (trevor) · 2024-07-26T19:06:47.126Z · comments (6)

Human wanting
TsviBT · 2023-10-24T01:05:39.374Z · comments (1)

[link] LLMs seem (relatively) safe
JustisMills · 2024-04-25T22:13:06.221Z · comments (24)

AI #76: Six Shorts Stories About OpenAI
Zvi · 2024-08-08T13:50:04.659Z · comments (10)

AI #50: The Most Dangerous Thing
Zvi · 2024-02-08T14:30:13.168Z · comments (4)

Zvi's Manifold Markets House Rules
Zvi · 2023-11-13T00:28:02.147Z · comments (6)

Trading off Lives
jefftk (jkaufman) · 2024-01-03T03:40:05.603Z · comments (12)

[link] Open Phil releases RFPs on LLM Benchmarks and Forecasting
LawrenceC (LawChan) · 2023-11-11T03:01:09.526Z · comments (0)

2022 (and All Time) Posts by Pingback Count
Raemon · 2023-12-16T21:17:00.572Z · comments (14)

AI #71: Farewell to Chevron
Zvi · 2024-07-04T13:40:05.905Z · comments (9)

AI #37: Moving Too Fast
Zvi · 2023-11-09T17:50:04.324Z · comments (5)

[question] Can we get an AI to do our alignment homework for us?
Chris_Leong · 2024-02-26T07:56:22.320Z · answers+comments (33)

Calendar feature geometry in GPT-2 layer 8 residual stream SAEs
Patrick Leask (patrickleask) · 2024-08-17T01:16:53.764Z · comments (0)

AMA: Earning to Give
jefftk (jkaufman) · 2023-11-07T16:20:10.972Z · comments (8)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

tailcalled on We Don't Know Our Own Values, but Reward Bridges The Is-Ought Gap

So you're basically working with a maximally-shattered model of agency where life consists of a bunch of independent activities that can be fully observed post-hoc and which have no connection between them?

So e.g. if you sometimes feel like eating one kind of food and other times feel like eating another kind of food, you just think "ah, my food preference arbitrarily changed", not "my situation changed to make so that the way to objectively improve my food intake is different now than it was in the past"?

richard_kennaway on We Don't Know Our Own Values, but Reward Bridges The Is-Ought Gap

Expected utility is what you have before the outcome of an action is known. Actual utility is what you have after the outcome is known. Here, the utility function has remained the same and you have acquired knowledge of the outcome.

Someone no longer finding a thing valuable that they used to, has either re-evaluated the thing in the light of new information about it, or changed the value they (their utility function) put on it.

j-bostock on The Best Lay Argument is not a Simple English Yud Essay

Yeah, I agree we need improvement. I don't know how many people it's important to reach, but I am willing to believe you that this will hit maybe 10%. I expect the 10% to be people with above-average impact on the future, but I don't know what %age of people is enough.

90% is an extremely ambitious goal. I would be surprised if 90% of the population can be reliably convinced by logical arguments in general.

dacyn on Pronouns are Annoying

I don't agree that I am making unwarranted assumptions; I think what you call "assumptions" are merely observations about the meanings of words. I agree that it is hard to program an AI to determine who the "he"s refer to, but I think as a matter of fact the meanings of those words don't allow for any other possible interpretation. It's just hard to explain to an AI what the meanings of words are. Anyway I'm not sure if it is productive to argue this any further as we seem to be repeating ourselves.

niplav on Sodium's Shortform

When will this be revealed?

aysja on Skills from a year of Purposeful Rationality Practice

Similarly in Baba is You: when people don't have a crisp understanding of the puzzle, they tend to grasp and straws and motivatedly-reason their way into accepting sketchy sounding premises [LW · GW]. But, the true solution to a level often feels very crisp and clear and inevitable.

A few of the scientists I’ve read about have realized their big ideas in moments of insight (e.g., Darwin for natural selection, Einstein for special relativity). My current guess about what’s going on is something like: as you attempt to understand a concept you don’t already have, you’re picking up clues about what the shape of the answer is going to look like (i.e., constraints). Once you have these constraints in place, your mind is searching for something which satisfies all of them (both explicitly and implicitly), and insight is the thing that happens when you find a solution that does.

At least, this is what it feels like for me when I play Baba is You (i.e., when I have the experience you’re describing here). I always know when a fake solution is fake, because it’s really easy to tell that it violates one of the explicit constraints the game has set out (although sometimes in desperation I try it anyway :p). But it’s immediately clear when I've landed on the right solution (even before I execute it), because all of the constraints I’ve been holding in my head get satisfied at once. I think that’s the “clicking” feeling.

Darwin’s insight about natural selection was also shaped by constraints. His time on the Beagle had led him to believe that “species gradually become modified,” but he was pretty puzzled as to how the changes were being introduced. If you imagine a beige lizard that lives in the sand, for instance, it seems pretty clear that it isn’t the lizard itself (its will) which causes its beigeness, nor is it the sand that directly causes the coloring (as in, physically causes it within the lizards lifetime). But then, how are changes introduced, if not by the organism, and not by the environment directly? He was stuck on this for awhile, when: “I can remember the very spot in the road, whilst in my carriage, when to my joy the solution occurred to me.”

There’s more going on to Darwin’s story than that, but I do think it has elements of the sort of thing you're describing here. Jeff Hawkins also describes insight as a constraint satisfaction problem pretty explicitly (I might’ve gotten this idea from him), and he experienced it when coming up with the idea of a thousand brains.

Anyway, I don’t have a strong sense of how crucial this sort of thing is to novel conceptual inquiry in general, but I do think it’s quite interesting. It seems like one of the ways that someone can go from a pre-paradigmatic grasping around for clues sort of thing to a fully formed solution.

tailcalled on tailcalled's Shortform

Thinking further, a key part of it is that temperature has a tendency to mix stuff together, due to the associated microscopic kinetic energy.

james-oofou on AGI Ruin: A List of Lethalities

with a less than fifty percent change of killing more than one billion people

Typo, 'change' should be 'chance'

aditya-prasad-1 on We're already in AI takeoff

predictable

damn that hit me

javier-marin-valenzuela on Heartless Genius: The Peril of Emotionally Blind AI

Thank you very much for the criticism. Jeremias: I genuinely appreciate it. Please allow me to provide some context before commenting. Professor W.J. Wukmir developed the orectic theory, which I used in this post to provide a theory of emotions. The man in question lived in Barcelona, where he arrived around 1960 and was stateless until his death in 1981. Unfortunately, his theories have been lost to the course of time. A few months ago, the last of his disciples passed away. I'm afraid his theory will be forgotten in some second-hand bookstore as a rarity. His disciples said that in his youth (in the early 1930s), Wukmir had intense disputes with Freud and Adler in the Vienna circle to which the three belonged. Freud saw him as a revolutionary, while Wukmir believed that Freud was unable to understand him. I apologize for writing this preview, but my purpose was only to offer light on the origins of this theory.

The critique raises some important issues that should be carefully considered. However, I believe there are some fundamental misunderstandings concerning the nature and purpose of emotions as described by Wukmir's orectic theory, which I will address. I apologize for not being more clear in explaining them.
Emotions, according to this theory, are not just "shortcut heuristics" but also essential systems of vital orientation. Emotions are more than just quick evaluations in the absence of detail; they are complex, multidimensional evaluations that incorporate multiple aspects of an organism's internal state, external environment, and previous experiences. This valuation process takes place at all levels of an organism, from individual cells to complex brain systems. The goal here is to emphasize the relevance of the "valuation" process rather than focusing solely on the emotions involved.

Your critique implies a distinction between emotion and cognition, which my argument explicitly rejects. According to the orectic theory, emotion and cognition are inextricably related parts of the same process of vital orientation. Every cognitive act entails emotional valuation, and every emotional response contains cognitive components. This integration is not a "ghost in the machine," but rather an essential component of how living systems process information and make decisions. This is what we require from AI: process information and make decisions.

Your claim that AI can make decisions without emotions ignores the importance of emotions in decision-making, according to orectic theory. It's probably my fault for not explaining it more concisely. Emotions are not an add-on to decision-making; they are essential to the process of analyzing possibilities and deciding courses of action. According to this viewpoint, assigning meaning and value to stimuli and prospective actions is an emotional process.

The notion that ethics can be "arbitrarily programmed" without emotional capacity misses the importance of emotions in moral judgment. In my opinion, ethical decisions are fundamentally dependent on emotional valuations of situations and the resulting outcomes. A strictly logical system of ethics without emotional components would lack the critical ability to attach meaning and value to different outcomes.
Your argument appears to presuppose a restricted definition of intelligence that focuses on information processing and decision-making in a mechanical sense. The theory I described, as well as many modern perspectives in cognitive research, advocate for a more embodied and embedded conception of intelligence that must include emotional processes. From this standpoint, a "fully functioning artificial intelligence" would have to have emotion-like mechanisms.

While it is true that more precise prompts may elicit different answers, the purpose of the GPT-4o example was to demonstrate the limitations of simply language-based AI in grasping emotional context. Human communication is highly reliant on emotional understanding that extends beyond the literal meaning of words, and this understanding is rooted in shared embodied experiences.

Finally, the notion that language can completely compensate for the lack of nonverbal emotional cues overlooks the embodied character of emotional cognition. In my opinion, our emotional knowledge is largely based on our bodily experiences and cannot be completely reduced to linguistic descriptions.