Highest Voted 'multi-objective-rl' Questions - Artificial Intelligence Stack Exchange

12

votes

3 answers

Why is the reward in reinforcement learning always a scalar?

I'm reading Reinforcement Learning by Sutton & Barto, and in section 3.2 they state that the reward in a Markov decision process is always a scalar real number. At the same time, I've heard about the problem of assigning credit to an action for a…

asked Aug 06 '20 at 22:06

Sid Mani

223
1
4

3

votes

2 answers

Can rewards be decomposed into components?

I'm training a robot to walk to a specific $(x, y)$ point using TD3, and, for simplicity, I have something like reward = distance_x + distance_y + standing_up_straight, and then it adds this reward to the replay buffer. However, I think that it…

reinforcement-learning reward-design reward-functions multi-objective-rl reward-hypothesis

asked Jul 09 '20 at 16:49

pinkie pAI

35
3

2

votes

1 answer

Can the rewards be matrices when using DQN?

I have a basic question. I'm working towards developing a reward function for my DQN. I'd like to train an RL agent to edit pixels on an image. I understand that convolutions are ideal for working with images, but I'd like to observe the agent doing…

reinforcement-learning dqn reward-functions reward-design multi-objective-rl

asked Feb 02 '21 at 12:57

junfanbl

323
1
7

2

votes

1 answer

What are preferences and preference functions in multi-objective reinforcement learning?

In RL (reinforcement learning) or MARL (multi-agent reinforcement learning), we have the usual tuple: (state, action, transition_probabilities, reward, next_state) In MORL (multi-objective reinforcement learning), we have two more additions to the…

reinforcement-learning terminology definitions multi-objective-rl

asked Apr 28 '20 at 14:39

Huan

161
1
6

2

votes

1 answer

What are some simple open problems in multi-agent RL that would be suited for a bachelor's thesis?

I've decided to make my bachelor thesis in RL. I am currently struggling to find a good problem. I am interested in multi-agent RL with the dilemma between selfishness and cooperation. I only have 2 months to complete this and I'm afraid that…

reinforcement-learning reference-request research academia multi-objective-rl

asked Apr 09 '19 at 19:10

Rom

139
4

Questions tagged [multi-objective-rl]

Why is the reward in reinforcement learning always a scalar?

Can rewards be decomposed into components?

Can the rewards be matrices when using DQN?

What are preferences and preference functions in multi-objective reinforcement learning?

What are some simple open problems in multi-agent RL that would be suited for a bachelor's thesis?