Posts

ChatGPT Plugins - The Beginning of the End 2023-03-25T11:45:32.877Z

Comments

Comment by Bary Levy (bary-levi) on Tiny Mech Interp Projects: Emergent Positional Embeddings of Words · 2023-07-20T01:19:38.684Z · LW · GW

I want to generally encourage this kind of experiment-and-publish-quickly project. This might require a post of its own, but as someone with a background in both hacking and entrepreneurship, this kind of quick feedback loop is, in my opinion, an incredible strength of both, and I hope can be used to accelerate scientific progress, which is exactly what we need in alignment.

Comment by bary-levi on [deleted post] 2023-04-16T10:47:57.671Z

I actually don't think it has much impact on superintelligence. I shared this mostly because I thought it's a cool idea that we can implement now and can later be turned into a policy. Compared to existing policy proposals that don't limit training/usage, I think this can have a much larger impact

Comment by Bary Levy (bary-levi) on Spectrum of Independence · 2022-11-07T22:06:18.868Z · LW · GW

Might also be interesting to look at this from a Learned Helplessness point of view. Especially with helicopter parenting. Perhaps children aren't learning to solve their own problems independenly. I wouldn't be surprised if this contributes to the mental health epidemic.

Comment by Bary Levy (bary-levi) on Spectrum of Independence · 2022-11-07T22:03:59.466Z · LW · GW

A factor for why children are becoming less independent in the US might be car-centric city design. With unsafe streets, and no way to walk to school, friends or after-school activities, parents have no choice but to drive them around. Not Just Bikes has a great video on this

https://youtu.be/oHlpmxLTxpw

Comment by Bary Levy (bary-levi) on All AGI Safety questions welcome (especially basic ones) [~monthly thread] · 2022-11-02T15:56:46.065Z · LW · GW

I've seen in the term "AI Explainability" floating around in the mainstream ML community. Is there a major difference between that and what we in the AI Safety community call Interpretability?