Posts

Comments

Comment by weightt an (weightt-an) on Richard Ngo's Shortform · 2024-03-21T13:00:25.211Z · LW · GW

If traders can get access to control panel for actions of the external agent AND they profit from accurately predicting its observations, then wouldn't the best strategy be "create as much chaos as possible that is only predictable to me, its creator". So, traders that value ONLY accurate predictions will get the advantage?

Comment by weightt an (weightt-an) on Causal confusion as an argument against the scaling hypothesis · 2022-07-22T17:24:46.729Z · LW · GW

Well maybe llms can "experiment" on their dataset by assuming something about it and then being modified if they encounter counterexample. 

 I think it vaguely counts as experimenting.

Comment by weightt an (weightt-an) on wrapper-minds are the enemy · 2022-07-09T20:22:05.187Z · LW · GW

I think that there may be wrapper-minds with very detailed utility functions, that whatever qualities you attribute to agents that are not them, the wrapper-mind's behavior will look like their with arbitrary precision on arbitrarily many evaluation parameters. I don't think it's practical or it's something that has a serious chance of happening, but I think it's a case that might be worth considering.

 

Like, maybe it's very easy to build a wrapper mind that is a very good approximation of very non wrapper mind. Who knows 

Comment by weightt an (weightt-an) on Debating Whether AI is Conscious Is A Distraction from Real Problems · 2022-06-21T20:53:16.121Z · LW · GW

Sounds like a statement "no AI can have or get them". 

Well it can learn it, it can develop them based on a dataset of people's stories. Especially it looks possible with the approach that is currently being used. 

Comment by weightt an (weightt-an) on A claim that Google's LaMDA is sentient · 2022-06-12T10:48:00.702Z · LW · GW
Comment by weightt an (weightt-an) on Book Review: Being You by Anil Seth · 2022-06-02T17:54:19.901Z · LW · GW

Isn't consciousness just a "read-only access thing to the world" then? Like is there some reason why dualism is not isomorphic to parallelism?

Comment by weightt an (weightt-an) on Deepmind's Gato: Generalist Agent · 2022-05-16T13:00:06.490Z · LW · GW

There is a lot more useful data on YouTube (by several orders of magnitude at least? idk), I think the next wave of such breakthrough models will train on video.

Comment by weightt an (weightt-an) on Interacting with a Boxed AI · 2022-04-14T11:20:57.521Z · LW · GW

Give it 140k chances to predict "rain or no rain, in this location and time?" and it has no chance.

Well i think it can just encode some message in this bits and you or your colleagues will eventually check it

Comment by weightt an (weightt-an) on Self-Integrity and the Drowning Child · 2021-10-29T07:14:45.346Z · LW · GW

Exactly