LessWrong 2.0 Reader

View: New · Old · Top

Restrict date range: Today · This week · This month · Last three months · This year · All time

← previous page (newer posts) · next page (older posts) →

[link] Towards Monosemanticity: Decomposing Language Models With Dictionary Learning
Zac Hatfield-Dodds (zac-hatfield-dodds) · 2023-10-05T21:01:39.767Z · comments (21)

Politics is way too meta
Rob Bensinger (RobbBB) · 2021-03-17T07:04:42.187Z · comments (46)

Social Dark Matter
[DEACTIVATED] Duncan Sabien (Duncan_Sabien) · 2023-11-16T20:00:00.000Z · comments (112)

Mysteries of mode collapse
janus · 2022-11-08T10:37:57.760Z · comments (57)

Study Guide
johnswentworth · 2021-11-06T01:23:09.552Z · comments (48)

Hooray for stepping out of the limelight
So8res · 2023-04-01T02:45:31.397Z · comments (24)

[link] My hour of memoryless lucidity
Eric Neyman (UnexpectedValues) · 2024-05-04T01:40:56.717Z · comments (19)

[link] Intentionally Making Close Friends
Neel Nanda (neel-nanda-1) · 2021-06-27T23:06:49.269Z · comments (35)

A central AI alignment problem: capabilities generalization, and the sharp left turn
So8res · 2022-06-15T13:10:18.658Z · comments (53)

We Choose To Align AI
johnswentworth · 2022-01-01T20:06:23.307Z · comments (16)

Is AI Progress Impossible To Predict?
alyssavance · 2022-05-15T18:30:12.103Z · comments (39)

OpenAI: The Battle of the Board
Zvi · 2023-11-22T17:30:04.574Z · comments (82)

What Are You Tracking In Your Head?
johnswentworth · 2022-06-28T19:30:06.164Z · comments (81)

Sazen
[DEACTIVATED] Duncan Sabien (Duncan_Sabien) · 2022-12-21T07:54:51.415Z · comments (83)

My May 2023 priorities for AI x-safety: more empathy, more unification of concerns, and less vilification of OpenAI
Andrew_Critch · 2023-05-24T00:02:08.836Z · comments (39)

Guide to rationalist interior decorating
mingyuan · 2023-06-19T06:47:13.704Z · comments (45)

What Multipolar Failure Looks Like, and Robust Agent-Agnostic Processes (RAAPs)
Andrew_Critch · 2021-03-31T23:50:31.620Z · comments (64)

Notes on Teaching in Prison
jsd · 2023-04-19T01:53:00.427Z · comments (12)

Don't die with dignity; instead play to your outs
Jeffrey Ladish (jeff-ladish) · 2022-04-06T07:53:05.172Z · comments (59)

The Base Rate Times, news through prediction markets
vandemonian · 2023-06-06T17:42:56.718Z · comments (40)

Toni Kurz and the Insanity of Climbing Mountains
GeneSmith · 2022-07-03T20:51:58.429Z · comments (67)

Gentleness and the artificial Other
Joe Carlsmith (joekc) · 2024-01-02T18:21:34.746Z · comments (33)

Humans are very reliable agents
alyssavance · 2022-06-16T22:02:10.892Z · comments (35)

We don’t trade with ants
KatjaGrace · 2023-01-10T23:50:11.476Z · comments (109)

Seven Years of Spaced Repetition Software in the Classroom
tanagrabeast · 2021-03-04T02:42:01.475Z · comments (38)

OpenAI: Facts from a Weekend
Zvi · 2023-11-20T15:30:06.732Z · comments (158)

Accidentally Load Bearing
jefftk (jkaufman) · 2023-07-13T16:10:00.806Z · comments (14)

Core Pathways of Aging
johnswentworth · 2021-03-28T00:31:49.698Z · comments (123)

[link] Scale Was All We Needed, At First
Gabe M (gabe-mukobi) · 2024-02-14T01:49:16.184Z · comments (31)

12 interesting things I learned studying the discovery of nature's laws
Ben Pace (Benito) · 2022-02-19T23:39:47.841Z · comments (40)

Your Cheerful Price
Eliezer Yudkowsky (Eliezer_Yudkowsky) · 2021-02-13T05:41:53.511Z · comments (82)

The 6D effect: When companies take risks, one email can be very powerful.
scasper · 2023-11-04T20:08:39.775Z · comments (40)

A Brief Introduction to Container Logistics
Vitor · 2021-11-11T15:58:11.510Z · comments (22)

[link] Where do your eyes go?
alkjash · 2021-09-19T22:43:47.491Z · comments (22)

Basics of Rationalist Discourse
[DEACTIVATED] Duncan Sabien (Duncan_Sabien) · 2023-01-27T02:40:52.739Z · comments (180)

Omicron Variant Post #1: We’re F***ed, It’s Never Over
Zvi · 2021-11-26T19:00:00.988Z · comments (95)

Express interest in an "FHI of the West"
habryka (habryka4) · 2024-04-18T03:32:58.592Z · comments (41)

Constellations are Younger than Continents
Jeffrey Heninger (jeffrey-heninger) · 2023-12-19T06:12:40.667Z · comments (22)

"Carefully Bootstrapped Alignment" is organizationally hard
Raemon · 2023-03-17T18:00:09.943Z · comments (22)

On green
Joe Carlsmith (joekc) · 2024-03-21T17:38:56.295Z · comments (34)

Changing the world through slack & hobbies
Steven Byrnes (steve2152) · 2022-07-21T18:11:05.636Z · comments (13)

AI Timelines
habryka (habryka4) · 2023-11-10T05:28:24.841Z · comments (74)

Your Dog is Even Smarter Than You Think
StyleOfDog · 2021-05-01T05:16:09.821Z · comments (108)

So, geez there's a lot of AI content these days
Raemon · 2022-10-06T21:32:20.833Z · comments (140)

Safetywashing
Adam Scholl (adam_scholl) · 2022-07-01T11:56:33.495Z · comments (20)

[link] [SEE NEW EDITS] No, *You* Need to Write Clearer
NicholasKross · 2023-04-29T05:04:01.559Z · comments (64)

Sexual Abuse attitudes might be infohazardous
Pseudonymous Otter · 2022-07-19T18:06:43.956Z · comments (71)

The Plan
johnswentworth · 2021-12-10T23:41:39.417Z · comments (78)

larger language models may disappoint you [or, an eternally unfinished draft]
nostalgebraist · 2021-11-26T23:08:56.221Z · comments (31)

[link] Paul Christiano named as US AI Safety Institute Head of AI Safety
Joel Burget (joel-burget) · 2024-04-16T16:22:06.937Z · comments (59)

← previous page (newer posts) · next page (older posts) →

Archive

Recent comments

habryka4 on [deleted]

Hmm, I have sympathy for this tag, but also I do feel like the tagging system probably shouldn't implicitly carry judgement. Seems valuable to keep your map separate from your incentives and all that.

Happy to discuss here what to do. I do think allowing people to somehow tag stuff that seems like it increases capabilities in some dangerous way seems good, but I do think it should come with less judgement in the site's voice (judgement in a user's voice is totally fine, but the tagging system speaks more with the voice of the site than any individual user).

habryka4 on Take the wheel, Shoggoth! (LW frontpage algorithm experiments)

Oh, yeah, admins currently have access to a purely recommended view, and I prefer it. I would be in favor of making that accessible to users (maybe behind a beta flag, or maybe not, depending on uptake).

erik-jenner on MATS Winter 2023-24 Retrospective

I don't know the answer to your actual question, but I'll note there are slightly fewer mech interp mentors than mentors listed in the "AI interpretability" area (though all of them are at least doing "model internals"). I'd say Stephen Casper and I aren't focused on interpretability in any narrow sense, and Nandi Schoots' projects also sound closer to science of deep learning than mech interp. Assuming we count everyone else, that leaves 11 out of 39 mentors, which is slightly less than ~8 out of 23 from the previous cohort (though maybe not by much).

raemon on Raemon's Shortform

New concept for my "qualia-first calibration" app idea that I just crystallized. The following are all the same "type":

1. "this feels 10% likely"

2. "this feels 90% likely"

3. "this feels exciting!"

4. "this feels confusing :("

5. "this is coding related"

6. "this is gaming related"

All of them are a thing you can track: "when I observe this, my predictions turn out to come true N% of the time".

Numerical-probabilities are merely a special case (tho it still gets additional tooling, since they're easier to visualize graphs and calculate brier scores for)

And then a major goal of the app is to come up with good UI to help you visualize and compare results for the "non-numeric-qualia".

Depending on circumstances, it might seem way more important to your prior "this feels confusing" than "this feels 90% likely". (I'm guessing there is some actual conceptual/mathy work that would need doing to build the mature version of this)

nathan-helm-burger on Open Thread Spring 2024

Yeah, I should use that. I'd need to remember to unbookmark after reading it I suppose.

nicolas-lacombe on ACX Meetups Everywhere Spring 2024, Montreal, QC

we are outside in front of the entrance

nicholaskross on Please stop publishing ideas/insights/research about AI

Further observation about that second sentence.

lc on Dating Roundup #3: Third Time’s the Charm

It might very well be cultural. I am American and I literally cannot imagine doing either of those things with a woman I am not in a relationship with. I've never seen anybody else do them either.

christiankl on ChristianKl's Shortform

In addition to that from my perspective, I think that if every day of the year you consume the same amount of potassium you (as a typical office worker) likely consume either too much or too little on some days.

jacques-thibodeau on [deleted]

I think you are assuming optical transistors and photonic computing are the same thing, but they are not. Optical transistors are a component that could be used for photonic computing, but they are not necessary, and companies may have a better shot at getting photonic computing to work at scale without them.

Optical transistors try to function similarly to electronic transistors but use photons instead of electrons for signal processing. You are correct that optical transistors are not currently not great and it's an active area of research to get it to work.

However, photonic computing is a broader concept that may or may not involve optical transistors as some of its components. Given the limitations of current optical transistors (as you point out), I understand that companies working on this typically use alternative photonic techniques to make it more feasible and practical for deep learning matrix multiplication.

Optical transistors are just not as technologically mature (and may never be) as photonic components like modulators and waveguides. For example, the paper I linked in the post is titled "Experimentally realized in situ backpropagation for deep learning in photonic neural networks", they do not use optical transistors. Instead, they use some of the following components: Mach-Zehnder interferometers, thermo-optic phase shifters, Photonic integrated circuits, and Silicon photonic waveguides.

The final setup allows for matrix operations for backpropagation.