For questions related to the (automated) planning problem, which is the problem of finding a plan, i.e. a sequence of actions to move from an initial state to a goal state or a policy (a function from states to actions), and planning algorithms. There are different ways to define a planning problem (such as PDDL) and solve a planning problem (e.g. GraphPlan). In reinforcement learning, planning consists in finding a policy that solves an MDP.
Questions tagged [planning]
24 questions
10
votes
3 answers
What algorithms are considered reinforcement learning algorithms?
What are the areas/algorithms that belong to reinforcement learning?
TD(0), Q-Learning and SARSA are all temporal-difference algorithms, which belong to the reinforcement learning area, but is there more to it?
Are the dynamic programming…

Miguel Saraiva
- 767
- 1
- 5
- 14
8
votes
1 answer
MCTS: How to choose the final action from the root
When the time allotted to Monte Carlo tree search runs out, what action should be chosen from the root?
The original UCT paper (2006) says bestAction in their algorithm.
Monte-Carlo Tree Search: A New Framework for Game AI (2008) says
The game…

user76284
- 347
- 1
- 14
8
votes
2 answers
What is the difference between search and planning?
I'm reading the book Artificial Intelligence: A Modern Approach (by Stuart Russell and Peter Norvig).
However, I don't understand the difference between search and planning. I was more confused when I saw that some search problems can be determined…

theantomc
- 263
- 2
- 8
5
votes
3 answers
Isn't a simulation a great model for model-based reinforcement learning?
Most reinforcement learning agents are trained in simulated environments. The goal is to maximize performance in (often) the same environment, preferably with a minimum amount of interactions. Having a good model of the environment allows to use…

Ray Walker
- 451
- 3
- 8
4
votes
1 answer
What AI technique should I use to assign a person to a task?
I'm trying to learn AI and thinking to apply it to our system. We have an application for the translation industry. What we are doing now is the coordinator $C$ assigns a file to a translator $T$. The coordinator usually considers these criteria…

Jaime Sangcap
- 143
- 4
4
votes
2 answers
What is "planning" in the context of reinforcement learning, and how is it different from RL and SL?
This is an excerpt taken from Sutton and Barto (pg. 3):
Another key feature of reinforcement learning is that it explicitly considers the whole problem of a goal-directed agent interacting with an uncertain environment. This is in contrast with…
user9947
3
votes
1 answer
Several questions related to UCT and MCTS
In Bandit Based Monte-Carlo Planning, the article where UCT is introduced as a planning algorithm, there is an algorithm description in page 285 (4 of the pdf).
Comparing this implementation of UCT (a specific type of MCTS algorithm) to the…

Miguel Saraiva
- 767
- 1
- 5
- 14
3
votes
1 answer
How to transform a PDDL to search?
I have a question about search and planning:
I still haven't understood the difference from the two, but they seem very similar to me; here is a question I am struggling with:
"Having formulated a PDDL problem, transform it into research,
…

theantomc
- 263
- 2
- 8
3
votes
1 answer
Is there any AI system for finding the best way to schedule university classes?
I was wondering whether there is an AI system which could be used to resolve the class clashes problem which mostly happens in universities. In almost every university students face this problem, where two or more courses that many students want to…

Israr Ali
- 103
- 5
2
votes
0 answers
What trait of a planning problem makes reinforcement learning a well suited solution?
Planning problems have been the first problems studied at the dawn of AI (Shakey the robot). Graph search (e.g. A*) and planning (e.g. GraphPlan) algorithms can be very efficient at generating a plan. As for problem formulation, for planning…

50k4
- 225
- 1
- 8
2
votes
1 answer
Can't solve Towers of Hanoi in PDDL
I'm using PDDL to generate a plan to solve this tower of Hanoi puzzle. I'll give the problem, the rules, the domain and fact sheet for everything.
PDDL is telling me that the goal can be simplified to false; however, I know for a fact that this…

Michaelslec
- 131
- 4
2
votes
1 answer
FastDownward PDDL Planner Limitations
I recently had a look at automated planners and experimented a little bit with FastDownward. As I wanted to start a toy project, I created a PDDL model for the ordinary 3D Rubik's Cube (of course using a planner may not be the most efficient…

ttttttt1
- 23
- 2
2
votes
0 answers
How can I define the relations, preconditions and effects of each operator for the Sokoban puzzle?
I would like to solve the Sokoban puzzle, which consists in moving a character in a 2D map to push boulders into target cells. Each turn, the player can move to an adjacent cell (no diagonals) if it is empty, or push a boulder one step further. To…

Peyton
- 21
- 1
2
votes
2 answers
Can PDDL be utilized for action recognition?
The Planning Domain Definition Language (PDDL) is known for its capabilities of symbolic planning in the state space. A solver will find a sequence of steps to bring the system from a start state to the goal state. A common example of this is the…
user11571
2
votes
0 answers
How to choose method for solving planning problems?
There are many methods and algorithms dealing with planning problems.
If I understand correctly, according to Wikipedia, there are classical planning problems, with:
a unique known initial state,
duration-less actions,
deterministic actions,
which…

50k4
- 225
- 1
- 8