As someone starting out in RL, could you help me understand the differences between actor-only, critic-only, and actor-critic methods?

Asked Jul 01 '22 at 11:20

Active Jul 01 '22 at 11:20

Viewed 35 times

I have been reading some medium articles and these three methods pop up a lot. I am wondering what the differences between these are, what are the advantages of one over the other, etc. Also from my understanding, actor-only method is synonymous to policy gradient methods, critic-only method is synonymous to value-based methods, and actor-critic methods use policy gradient methods for the actor part and value-based methods for the critic part. Am I correct? Any help would be greatly appreciated. Finally, what should one keep in mind when considering one or the other for their projects?

asked Jul 01 '22 at 11:20

No-Time-To-Day

You're asking many questions here. I think they can barely be in the same post, provided that you edit your post to emphasize what your main question is and the other questions are just sub-questions that would also help to answer the main one. – nbro Jul 02 '22 at 23:24
Hi, thanks for the reply. My main question would be what the difference is between these three: actor-only, critic-only, and actor-critic methods. – No-Time-To-Day Jul 04 '22 at 10:21

As someone starting out in RL, could you help me understand the differences between actor-only, critic-only, and actor-critic methods?

0 Answers0