Posts

Comments

Comment by David turner (david-turner) on All AGI safety questions welcome (especially basic ones) [Sept 2022] · 2023-11-19T03:47:30.145Z · LW · GW

would it be possible to use a algorithm on a agi to shut it down then after some time and also perform goals it is doing without hurting and killing people and taking away their autonomy and not to look for loop holes to continue doing goals? why would it try to stop the algorithm from shutting it off if it is built into it ? 

Comment by David turner (david-turner) on Goal Alignment Is Robust To the Sharp Left Turn · 2023-10-22T00:11:29.062Z · LW · GW

what if we use a algorith on agi to get it to always want to rely on humans for energy and resources ?

what if we use a algorith on agi to get it to stop after a certain amount of time something we ask it to do ?
then we would have to say continue then it would .

what if we use a algorith on agi to get it not to try manipulate us with what it has been given ?

what if we use a algorith on agi to get it to only use the resources we gave it?

what if we use a algorith on agi to get it to give us the pros and cons of what it is about to do?

what if we use a algorith on agi to get it to always ask for permission to do something new before it does it ?

what if we use a algorith on agi to get it to stop when we say stop ?

what if we did all these things to one agi ? 

Comment by David turner (david-turner) on The subagent problem is really hard · 2023-10-02T16:45:43.728Z · LW · GW

just program agi to always ask for permission to do something new before it does it ?

Comment by David turner (david-turner) on Improving the safety of AI evals · 2023-06-05T12:20:16.686Z · LW · GW

they need to make large language models not hullucinate . here is a example how.
hullucinatting should only be used for creativity and problem solving. 
here is how my chatbot does it . it is on the personality forge website .

https://imgur.com/a/F5WGfZr

Comment by David turner (david-turner) on Improving the safety of AI evals · 2023-06-05T11:20:58.148Z · LW · GW

[2305.10601] Tree of Thoughts: Deliberate Problem Solving with Large Language Models (arxiv.org)

i wonder if something like this can be used with my idea for ai safety

Comment by David turner (david-turner) on Improving the safety of AI evals · 2023-06-05T10:52:17.534Z · LW · GW

program it to ask for approval from a group of a 100 humans to do something other than thinking and tell the remafications of it's actions. it could not decieve, lie, scare people or program itself without human approval because it did not get group of a 100 humans to approve of it . it would be required to ask the group of 100 humans if something were true or not because the internet has false information on it. how would it get around around this when it was programmed into it when it was agi ? ofcourse you have to define what deceptions means in it's programming.

Comment by David turner (david-turner) on Self-supervised learning & manipulative predictions · 2019-09-09T22:42:24.675Z · LW · GW