Highest Voted 'continuous-tasks' Questions - Artificial Intelligence Stack Exchange

6

votes

1 answer

What are the advantages of RL with actor-critic methods over actor-only methods?

In general, what are the advantages of RL with actor-critic methods over actor-only (or policy-based) methods? This is not a comparison with the Q-learning series, but probably a method of learning the game with only the actor. I think it's…

asked Jan 12 '21 at 22:29

ground clown

111
2

2

votes

4 answers

How can the Cart Pole problem be a continuing task?

In Introduction to Reinforcement Learning (2nd edition) by Sutton and Barto, there is an example of the Pole-Balancing problem (Example 3.4). In this example, they write that this problem can be treated as an episodic task or continuing task. I…

reinforcement-learning sutton-barto continuous-tasks episodic-tasks

asked Aug 04 '18 at 03:53

user3595632

175
4

2

votes

0 answers

Why are agents trained in episodes, even in non-episodic tasks?

Let's consider some non-episodic problem. Maybe a game which can go on forever. My question is: Why are agents still trained in episodes? My understanding is that the agent's neural network is updated in batches depending on the batch size (so every…

reinforcement-learning deep-rl continuous-tasks episodes

asked Jun 08 '22 at 18:03

Vladimir Belik

342
2
12

2

votes

1 answer

For continuing tasks, is the choice of episode length completely arbitrary?

Let's say I'm training a reinforcement learning agent to act in some environment that perpetually continues to give the agent opportunities to earn rewards, and there is no cap on the score and there is no way to "win". That is, there is no natural…

reinforcement-learning hyper-parameters continuous-tasks

asked Jun 05 '22 at 18:29

Vladimir Belik

342
2
12

1

vote

0 answers

Knowing the futility of discounting in continuing problems, how can we say discounting has no role in control problems with function approximation?

Sutton-Barto (Section 10.4, page 254): Based on the futility of discounting in continuing problems, how can we conclude that discounting has no role to play in control problems with function approximation?

reinforcement-learning function-approximation sutton-barto discount-factor continuous-tasks

asked Apr 19 '22 at 17:51

user3489173

179
6

1

vote

1 answer

Predicting continous value with CNN (prediction of fruit maturity)

I want to train some IA algorithm to be able to evaluate the maturity of a fruit (say, measured in numbers of days before rotten) based on an image of the fruit. My first instinct is to go with convolutional neural network (CNN), since those have…

convolutional-neural-networks image-recognition continuous-tasks

asked Jan 18 '21 at 15:58

Antoine Labelle

141
6

Questions tagged [continuous-tasks]

What are the advantages of RL with actor-critic methods over actor-only methods?

How can the Cart Pole problem be a continuing task?

Why are agents trained in episodes, even in non-episodic tasks?

For continuing tasks, is the choice of episode length completely arbitrary?

Knowing the futility of discounting in continuing problems, how can we say discounting has no role in control problems with function approximation?

Predicting continous value with CNN (prediction of fruit maturity)