Posts

Comments

Comment by knowsnothing on Alignment Implications of LLM Successes: a Debate in One Act · 2024-03-20T19:13:51.630Z · LW · GW

Any reason not to just run the experiment?

Comment by knowsnothing on Alignment Implications of LLM Successes: a Debate in One Act · 2024-03-20T18:58:52.669Z · LW · GW

Why not just run that experiment?

Comment by knowsnothing on Instrumental deception and manipulation in LLMs - a case study · 2024-02-26T17:56:23.456Z · LW · GW

Thank you for doing this. Would you mind if this is added to the Misalignment Database?

Comment by knowsnothing on Everything Wrong with Roko's Claims about an Engineered Pandemic · 2024-02-25T17:19:10.743Z · LW · GW

" For the most part, Roko's posts not only fail to engage with any scientific literature on the subject, but employ an extremely naive and ultimately misleading model that does not hold up to empirical and theoretical scrutiny. "

Can be applied generally.

Comment by knowsnothing on Does literacy remove your ability to be a bard as good as Homer? · 2024-01-26T12:34:37.190Z · LW · GW

Been doing this. Reading less. Writing a LOT less. Memory has improved a lot.

Comment by knowsnothing on Do you know of any reliable DIY compendium of home physical therapy exercises? · 2023-09-16T18:38:56.035Z · LW · GW

Check out: https://m.youtube.com/@BobandBrad

Comment by knowsnothing on 6 non-obvious mental health issues specific to AI safety · 2023-08-18T21:38:48.637Z · LW · GW

The alienation is something I felt for a bit, until I started working on my project and working with folk, talking to folk, etc. Also, been very pleasantly surprised how receptive non AI/non-tech folk are when talking to them about AI risk, as long as it's framed in a down to earth, relatable manner, introduced organically, etc.

Comment by knowsnothing on An Ignorant View on Ineffectiveness of AI Safety · 2023-07-24T18:54:52.602Z · LW · GW

I disagree with this now.

Comment by knowsnothing on The Waluigi Effect (mega-post) · 2023-03-04T17:16:38.567Z · LW · GW

I think a lot of people think Sydney/Bing Chat is GPT 4

Comment by knowsnothing on We don’t trade with ants · 2023-01-13T09:33:50.195Z · LW · GW

Human can manipulate animals and make them do what they want. So could AI

Comment by knowsnothing on How it feels to have your mind hacked by an AI · 2023-01-13T09:32:08.946Z · LW · GW

Manipulating lonely people is easy

Comment by knowsnothing on How it feels to have your mind hacked by an AI · 2023-01-13T09:30:53.272Z · LW · GW

Sounds like you lack understanding of people.

Comment by knowsnothing on The Feeling of Idea Scarcity · 2023-01-13T09:28:25.585Z · LW · GW

Or they're desperate. And/or don't trust themselves to be ok if the idea fails.

Comment by knowsnothing on The Feeling of Idea Scarcity · 2023-01-13T09:27:30.318Z · LW · GW

"mistrust" is not the best approach here. Mistrusting yourself or your ideas can lead to misery and feeling lackadaisical. Could lead to lacking motivation to pursue an idea as hard as you otherwise might.

"Openness" is a better idea imo. Openness to the idea failing or taking some adjustment to reach success, openness to it succeeding as well. Looking at ideas not just as a way to achieve success, but test the view you have on the world, a way to learn something new about the world through testing it and working on it.

Comment by knowsnothing on All AGI Safety questions welcome (especially basic ones) [~monthly thread] · 2023-01-05T11:26:56.617Z · LW · GW

Is trying to reduce internet usage and maybe reducing the amount of data AI companies have to work with something that is at all feasible?