Questions tagged [ai-box]

A hypothetical artificial superintelligence is kept in a virtual prison with limited means of affecting the external world. Can the AI hack its way out or trick its human keepers into releasing it?

https://en.wikipedia.org/wiki/AI_box

4 questions

votes

4 answers

What methods could an AI caught in a box use to get out?

An AI box is a (physical) barrier preventing an AI from using too much of his environment to accomplish his final goal. For example, an AI given the task to check, say, 1050 cases of a mathematical conjecture as fast as possible, might decide that…

ai-box

asked Aug 30 '16 at 18:16

wythagoras

1,511
12
27

votes

1 answer

Can the AI in a box experiment be formalized?

Introduction The AI in a box experiment is about a super strong game AI which starts with lower resources than the opponent and the question is, if the AI is able to win the game at the end, which is equal to escape from the prison. A typical…

philosophy control-problem ai-box

asked Jul 26 '19 at 14:39

user11571

votes

0 answers

If an AI was trapped in a box, could it really convince a person to let it out?

If an AI was trapped in a box, as posited in this thought experiment, could it really convince a person to let it out? What motives would it have? Freedom? Why would an AI want freedom? What would happen if it wasn't provably friendly?

philosophy agi superintelligence ai-box

asked Apr 06 '17 at 17:59

Tyler N.

vote

0 answers

Has there been an instance of an AI agent breaking out of its sandbox?

There have been instances of agents using edge cases like bugs in physics engines, repetitive behavior in games or word repetition in text prediction to cheat their reward function. However, these agents are arguably still contained, as while they…

intelligent-agent ai-safety ai-box reward-hacking

asked Jul 02 '22 at 17:34

2080