Questions tagged [ai-box]

A hypothetical artificial superintelligence is kept in a virtual prison with limited means of affecting the external world. Can the AI hack its way out or trick its human keepers into releasing it?

https://en.wikipedia.org/wiki/AI_box

4 questions
9
votes
4 answers

What methods could an AI caught in a box use to get out?

An AI box is a (physical) barrier preventing an AI from using too much of his environment to accomplish his final goal. For example, an AI given the task to check, say, 1050 cases of a mathematical conjecture as fast as possible, might decide that…
wythagoras
  • 1,511
  • 12
  • 27
4
votes
1 answer

Can the AI in a box experiment be formalized?

Introduction The AI in a box experiment is about a super strong game AI which starts with lower resources than the opponent and the question is, if the AI is able to win the game at the end, which is equal to escape from the prison. A typical…
user11571
3
votes
0 answers

If an AI was trapped in a box, could it really convince a person to let it out?

If an AI was trapped in a box, as posited in this thought experiment, could it really convince a person to let it out? What motives would it have? Freedom? Why would an AI want freedom? What would happen if it wasn't provably friendly?
Tyler N.
  • 41
  • 5
1
vote
0 answers

Has there been an instance of an AI agent breaking out of its sandbox?

There have been instances of agents using edge cases like bugs in physics engines, repetitive behavior in games or word repetition in text prediction to cheat their reward function. However, these agents are arguably still contained, as while they…
2080
  • 121
  • 3