LessWrong 2.0 Reader

View: New · Old · Top

← previous page (newer posts) · next page (older posts) →

The "AI Dungeons" Dragon Model is heavily path dependent (testing GPT-3 on ethics)
Rafael Harth (sil-ver) · 2020-07-21T12:14:32.824Z · comments (9)
Uncalibrated quantum experiments act clasically
justinpombrio · 2020-07-21T05:31:06.377Z · comments (12)
[link] The Rediscovery of Interiority in Machine Learning
DanB · 2020-07-21T05:02:33.324Z · comments (4)
Chains, Bottlenecks and Optimization
curi · 2020-07-21T02:07:27.953Z · comments (12)
"Can you keep this confidential? How do you know?"
Raemon · 2020-07-21T00:33:27.974Z · comments (41)
[link] Parallels Between AI Safety by Debate and Evidence Law
Cullen (Cullen_OKeefe) · 2020-07-20T22:52:09.185Z · comments (1)
[link] Thiel on Progress and Stagnation
Richard_Ngo (ricraz) · 2020-07-20T20:27:59.112Z · comments (32)
Learning Values in Practice
Stuart_Armstrong · 2020-07-20T18:38:50.438Z · comments (0)
Inefficient doesn't mean indifferent, but it might mean wimpy.
DirectedEvolution (AllAmericanBreakfast) · 2020-07-20T18:27:48.332Z · comments (3)
[question] To what extent is GPT-3 capable of reasoning?
TurnTrout · 2020-07-20T17:10:50.265Z · answers+comments (73)
Selling real estate: should you overprice or underprice?
Steven Byrnes (steve2152) · 2020-07-20T15:54:09.478Z · comments (5)
[question] "Do Nothing" utility function, 3½ years later?
niplav · 2020-07-20T11:09:36.946Z · answers+comments (3)
Operationalizing Interpretability
[deleted] · 2020-07-20T05:22:14.798Z · comments (0)
[link] Use resilience, instead of imprecision, to communicate uncertainty
habryka (habryka4) · 2020-07-20T05:08:52.759Z · comments (1)
What Would I Do? Self-prediction in Simple Algorithms
Scott Garrabrant · 2020-07-20T04:27:25.490Z · comments (12)
"Should Blackmail Be Legal" Hanson/Zvi Debate (Sun July 26th, 3pm PDT)
Ben Pace (Benito) · 2020-07-20T04:06:26.275Z · comments (13)
The 8 Techniques to Tolerify the Dark World
adamShimi · 2020-07-20T00:58:04.621Z · comments (5)
Praise of some popular LW articles
DirectedEvolution (AllAmericanBreakfast) · 2020-07-20T00:32:35.849Z · comments (1)
Types Of Online Meetups
Dan B (dan-b-1) · 2020-07-19T23:51:27.048Z · comments (2)
Musical Outgroups
[deleted] · 2020-07-19T22:55:12.007Z · comments (1)
Forum Assisted Discussion
Dan B (dan-b-1) · 2020-07-19T22:38:10.727Z · comments (0)
Pulse and Glide Cycling
jefftk (jkaufman) · 2020-07-19T19:02:24.070Z · comments (5)
[question] Math. proof of the superiority of independent guesses?
Milton · 2020-07-19T02:38:39.725Z · answers+comments (7)
Criticism of some popular LW articles
DirectedEvolution (AllAmericanBreakfast) · 2020-07-19T01:16:50.230Z · comments (19)
Swiss Political System: More than You ever Wanted to Know (I.)
Martin Sustrik (sustrik) · 2020-07-19T01:11:54.756Z · comments (39)
[question] Why is pseudo-alignment "worse" than other ways ML can fail to generalize?
nostalgebraist · 2020-07-18T22:54:50.957Z · answers+comments (9)
Against Reopening Ottawa
[deleted] · 2020-07-18T20:08:07.151Z · comments (2)
[link] Collection of GPT-3 results
Kaj_Sotala · 2020-07-18T20:04:50.027Z · comments (24)
[question] Is there an easy way to turn a LW sequence into an epub?
ChristianKl · 2020-07-18T18:20:03.795Z · answers+comments (9)
Calibrate words, not just probabilities
MikkW (mikkel-wilson) · 2020-07-18T05:56:11.120Z · comments (3)
[question] Erving Goffman’s ‘paper’
Saffron · 2020-07-18T01:12:25.587Z · answers+comments (2)
Lessons on AI Takeover from the conquistadors
Daniel Kokotajlo (daniel-kokotajlo) · 2020-07-17T22:35:32.265Z · comments (31)
[question] Can an agent use interactive proofs to check the alignment of succesors?
PabloAMC · 2020-07-17T19:07:54.072Z · answers+comments (2)
Anthropomorphizing Humans
johnswentworth · 2020-07-17T17:49:37.086Z · comments (6)
Telling more rational stories
DirectedEvolution (AllAmericanBreakfast) · 2020-07-17T17:47:31.831Z · comments (20)
Solving Math Problems by Relay
bgold · 2020-07-17T15:32:00.985Z · comments (26)
[question] What are the best tools you have seen to keep track of knowledge around testable statements?
migueltorrescosta · 2020-07-17T15:02:06.490Z · answers+comments (1)
Environments as a bottleneck in AGI development
Richard_Ngo (ricraz) · 2020-07-17T05:02:56.843Z · comments (19)
My Dating Plan ala Geoffrey Miller
snog toddgrass · 2020-07-17T04:52:29.612Z · comments (57)
Meta-preferences are weird
jacobjacob · 2020-07-16T23:03:40.226Z · comments (2)
Sunday July 19, 1pm (PDT) — talks by Raemon, ricraz, mr-hire, Jameson Quinn
jacobjacob · 2020-07-16T20:04:37.974Z · comments (6)
[question] What should be the topic of my LW mini-talk this Sunday (July 18th)?
Jameson Quinn (jameson-quinn) · 2020-07-16T16:32:54.241Z · answers+comments (3)
Covid 7/16: Becoming the Mask
Zvi · 2020-07-16T12:40:00.663Z · comments (20)
[link] Why associative operations?
Sunny from QAD (Evan Rysdam) · 2020-07-16T12:36:47.802Z · comments (7)
[question] How big of an issue are patent trolls to the average startup?
ChristianKl · 2020-07-16T11:31:05.934Z · answers+comments (4)
[AN #107]: The convergent instrumental subgoals of goal-directed agents
Rohin Shah (rohinmshah) · 2020-07-16T06:47:55.532Z · comments (1)
[AN #108]: Why we should scrutinize arguments for AI risk
Rohin Shah (rohinmshah) · 2020-07-16T06:47:38.322Z · comments (6)
Alignment proposals and complexity classes
evhub · 2020-07-16T00:27:37.388Z · comments (26)
[question] How should AI debate be judged?
abramdemski · 2020-07-15T22:20:33.950Z · answers+comments (26)
Automatically Turning Off Computer at Night
Raemon · 2020-07-15T20:42:10.021Z · comments (13)
← previous page (newer posts) · next page (older posts) →