How can PPO be combined with HER?

Asked Jan 17 '22 at 19:53

Active Jan 17 '22 at 19:53

Viewed 291 times

2

I ask because PPO is apparently an on-policy algorithm & the HER paper says that it can be combine with any off-policy algorithm. Yet I see GitHub projects that have combined them somehow?

How is this done? And is it reasonable?

asked Jan 17 '22 at 19:53

profPlum

360
1
9

0 Answers0