LessWrong 2.0 Reader

View: New · Old · Top

← previous page (newer posts) · next page (older posts) →

The "AI Dungeons" Dragon Model is heavily path dependent (testing GPT-3 on ethics)
Rafael Harth (sil-ver) · 2020-07-21T12:14:32.824Z · comments (9)

Uncalibrated quantum experiments act clasically
justinpombrio · 2020-07-21T05:31:06.377Z · comments (12)

[link] The Rediscovery of Interiority in Machine Learning
DanB · 2020-07-21T05:02:33.324Z · comments (4)

Chains, Bottlenecks and Optimization
curi · 2020-07-21T02:07:27.953Z · comments (12)

"Can you keep this confidential? How do you know?"
Raemon · 2020-07-21T00:33:27.974Z · comments (41)

[link] Parallels Between AI Safety by Debate and Evidence Law
Cullen (Cullen_OKeefe) · 2020-07-20T22:52:09.185Z · comments (1)

[link] Thiel on Progress and Stagnation
Richard_Ngo (ricraz) · 2020-07-20T20:27:59.112Z · comments (32)

Learning Values in Practice
Stuart_Armstrong · 2020-07-20T18:38:50.438Z · comments (0)

Inefficient doesn't mean indifferent, but it might mean wimpy.
DirectedEvolution (AllAmericanBreakfast) · 2020-07-20T18:27:48.332Z · comments (3)

[question] To what extent is GPT-3 capable of reasoning?
TurnTrout · 2020-07-20T17:10:50.265Z · answers+comments (73)

Selling real estate: should you overprice or underprice?
Steven Byrnes (steve2152) · 2020-07-20T15:54:09.478Z · comments (5)

[question] "Do Nothing" utility function, 3½ years later?
niplav · 2020-07-20T11:09:36.946Z · answers+comments (3)

Operationalizing Interpretability
[deleted] · 2020-07-20T05:22:14.798Z · comments (0)

[link] Use resilience, instead of imprecision, to communicate uncertainty
habryka (habryka4) · 2020-07-20T05:08:52.759Z · comments (1)

What Would I Do? Self-prediction in Simple Algorithms
Scott Garrabrant · 2020-07-20T04:27:25.490Z · comments (12)

"Should Blackmail Be Legal" Hanson/Zvi Debate (Sun July 26th, 3pm PDT)
Ben Pace (Benito) · 2020-07-20T04:06:26.275Z · comments (13)

The 8 Techniques to Tolerify the Dark World
adamShimi · 2020-07-20T00:58:04.621Z · comments (5)

Praise of some popular LW articles
DirectedEvolution (AllAmericanBreakfast) · 2020-07-20T00:32:35.849Z · comments (1)

Types Of Online Meetups
Dan B (dan-b-1) · 2020-07-19T23:51:27.048Z · comments (2)

Musical Outgroups
[deleted] · 2020-07-19T22:55:12.007Z · comments (1)

Forum Assisted Discussion
Dan B (dan-b-1) · 2020-07-19T22:38:10.727Z · comments (0)

Pulse and Glide Cycling
jefftk (jkaufman) · 2020-07-19T19:02:24.070Z · comments (5)

[question] Math. proof of the superiority of independent guesses?
Milton · 2020-07-19T02:38:39.725Z · answers+comments (7)

Criticism of some popular LW articles
DirectedEvolution (AllAmericanBreakfast) · 2020-07-19T01:16:50.230Z · comments (19)

Swiss Political System: More than You ever Wanted to Know (I.)
Martin Sustrik (sustrik) · 2020-07-19T01:11:54.756Z · comments (39)

[question] Why is pseudo-alignment "worse" than other ways ML can fail to generalize?
nostalgebraist · 2020-07-18T22:54:50.957Z · answers+comments (9)

Against Reopening Ottawa
[deleted] · 2020-07-18T20:08:07.151Z · comments (2)

[link] Collection of GPT-3 results
Kaj_Sotala · 2020-07-18T20:04:50.027Z · comments (24)

[question] Is there an easy way to turn a LW sequence into an epub?
ChristianKl · 2020-07-18T18:20:03.795Z · answers+comments (9)

Calibrate words, not just probabilities
MikkW (mikkel-wilson) · 2020-07-18T05:56:11.120Z · comments (3)

[question] Erving Goffman’s ‘paper’
Saffron · 2020-07-18T01:12:25.587Z · answers+comments (2)

Lessons on AI Takeover from the conquistadors
Daniel Kokotajlo (daniel-kokotajlo) · 2020-07-17T22:35:32.265Z · comments (31)

[question] Can an agent use interactive proofs to check the alignment of succesors?
PabloAMC · 2020-07-17T19:07:54.072Z · answers+comments (2)

Anthropomorphizing Humans
johnswentworth · 2020-07-17T17:49:37.086Z · comments (6)

Telling more rational stories
DirectedEvolution (AllAmericanBreakfast) · 2020-07-17T17:47:31.831Z · comments (20)

Solving Math Problems by Relay
bgold · 2020-07-17T15:32:00.985Z · comments (26)

[question] What are the best tools you have seen to keep track of knowledge around testable statements?
migueltorrescosta · 2020-07-17T15:02:06.490Z · answers+comments (1)

Environments as a bottleneck in AGI development
Richard_Ngo (ricraz) · 2020-07-17T05:02:56.843Z · comments (19)

My Dating Plan ala Geoffrey Miller
snog toddgrass · 2020-07-17T04:52:29.612Z · comments (57)

Meta-preferences are weird
jacobjacob · 2020-07-16T23:03:40.226Z · comments (2)

Sunday July 19, 1pm (PDT) — talks by Raemon, ricraz, mr-hire, Jameson Quinn
jacobjacob · 2020-07-16T20:04:37.974Z · comments (6)

[question] What should be the topic of my LW mini-talk this Sunday (July 18th)?
Jameson Quinn (jameson-quinn) · 2020-07-16T16:32:54.241Z · answers+comments (3)

Covid 7/16: Becoming the Mask
Zvi · 2020-07-16T12:40:00.663Z · comments (20)

[link] Why associative operations?
Sunny from QAD (Evan Rysdam) · 2020-07-16T12:36:47.802Z · comments (7)

[question] How big of an issue are patent trolls to the average startup?
ChristianKl · 2020-07-16T11:31:05.934Z · answers+comments (4)

[AN #107]: The convergent instrumental subgoals of goal-directed agents
Rohin Shah (rohinmshah) · 2020-07-16T06:47:55.532Z · comments (1)

[AN #108]: Why we should scrutinize arguments for AI risk
Rohin Shah (rohinmshah) · 2020-07-16T06:47:38.322Z · comments (6)

Alignment proposals and complexity classes
evhub · 2020-07-16T00:27:37.388Z · comments (26)

[question] How should AI debate be judged?
abramdemski · 2020-07-15T22:20:33.950Z · answers+comments (26)

Automatically Turning Off Computer at Night
Raemon · 2020-07-15T20:42:10.021Z · comments (13)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

weightt-an on LLMs could be as conscious as human emulations, potentially

you're making a token-predicting transformer out of a virtual system with a human emulation as a component.

Should it make a difference? Same iterative computation.

In the system, the words "what's your earliest memory?" appearing on the paper are going to trigger all sorts of interesting (emulated) neural mechanisms that eventually lead to a verbal response, but the token predictor doesn't necessarily need to emulate any of that.

Yes, I talked about optimizations a bit. I think you are missing a point of this example. The point is that if you are trying to conclude from the fact that this system is doing next token prediction then it's definitely not conscious, you are wrong. And my example is an existence proof, kind of.

review-bot on Paul Christiano named as US AI Safety Institute Head of AI Safety

The LessWrong Review [? · GW] runs every year to select the posts that have most stood the test of time. This post is not yet eligible for review, but will be at the end of 2025. The top fifty or so posts are featured prominently on the site throughout the year. Will this post make the top fifty?

weightt-an on LLMs could be as conscious as human emulations, potentially

>It seems you are arguing that anything that presents like it is conscious implies that it is conscious.

No? That's definitely not what I'm arguing.

>But what ultimately matters is what this thing IS, not how it became in that way. If, this thing internalized that conscious type of processing from scratch, without having it natively, then resulting mind isn't worse than the one that evolution engineered with more granularity. Doesn't matter if this human was assembled atom by atom on molecular assembler, it's still a conscious human.

Look, here I'm talking about pathways to acquire that "structure" inside you. Not outlook of it.

avturchin on Magic by forgetting

non-disease copies do not need to perform any changes in their meditation routine in this model, assuming that they naturelly forget their disease status during meditation.

mondsemmel on Thoughts on seed oil

You might appreciate the perspective in the short post Statistical models & the irrelevance of rare exceptions [LW · GW]. (I previously commented [LW(p) · GW(p)] something similar on a post by Duncan.)

ablue on LLMs could be as conscious as human emulations, potentially

I don't think that in the example you give, you're making a token-predicting transformer out of a human emulation; you're making a token-predicting transformer out of a virtual system with a human emulation as a component. In the system, the words "what's your earliest memory?" appearing on the paper are going to trigger all sorts of interesting (emulated) neural mechanisms that eventually lead to a verbal response, but the token predictor doesn't necessarily need to emulate any of that. In fact, if the emulation is deterministic, it can just memorize whatever response is given. Maybe gradient descent is likely to make the LLM conscious in order to efficiently memorize the outputs of a partly conscious system, but that's not obvious.

If you have a brain emulation, the best way to get a conscious LLM seems to me like it would be finding a way to tokenize emulation states and training it on those.

gunnar_zarncke on LLMs could be as conscious as human emulations, potentially

Ok. It seems you are arguing that anything that presents like it is conscious implies that it is conscious. You are not arguing whether or not the structure of LLMs can give rise to consciousness.

But then your argument is a social argument. I'm fine with a social definition of consciousness - after all, our actions depend to a large degree on social feedback and morals (about what beings have value) at different times have been very different and thus been socially construed.

But then why are you making a structural argument about LLMs in the end?

PS. In fact, I commented on the filler symbol paper when Xixidu posted about it and I don't think that's a good comparison.

seth-herd on The Prop-room and Stage Cognitive Architecture

I agree with all of that. Even being sceptical that LLMs plus search will reach AGI. The lack of constraint satisfaction as the human brain does it could be a real stumbling block.

But LLMs have copied a good bit of our reasoning and therefore our semantic search. So they can do something like constraint satisfaction.

Put the constraints into a query, and the answer will satisfy those constraints. The process used is different than a human brain, but for every problem I can think of, the results are the same.

Now, that's partly because every problem I can think of is one I've already seen solved. But my ability to do truly novel problem solving is rarely used and pretty limitted. So I'm not sure the LLM can't do just as good a job if it had a scaffolded script to explore its knowledge base from a few different angles.

metachirality on What is the easiest/funnest way to build up a comprehensive understanding of AI and AI Safety?

Vanessa Kosoy has a list specifically for her alignment agenda but is probably applicable to agent foundations in general: https://www.alignmentforum.org/posts/fsGEyCYhqs7AWwdCe/learning-theoretic-agenda-reading-list [AF · GW]

review-bot on Express interest in an "FHI of the West"

The LessWrong Review [? · GW] runs every year to select the posts that have most stood the test of time. This post is not yet eligible for review, but will be at the end of 2025. The top fifty or so posts are featured prominently on the site throughout the year. Will this post make the top fifty?