Hard to understand for a newbie in Deep RL. A formal definition of what is a skill ("A skill is simply a policy.") would help. ### Typos: - "guaranteeing that is has maximum entropy" - "We discuss the the log p(z) term in Appendix B." - "so it much first gather momentum" - "While are skills are learned"
I agree with the previous comment. The article seems to aim at people who are already familiar with reinforcement (not necessarily deep, or based on neural networks I guess) and its usual benchmark. The implementations are not detailed, the authors lay stress on the general idea (which is relatively simple to get), and its visual results which look quite spectacular.
> The article seems to aim at people who are already familiar with X and its usual benchmark. That can be said about quite every article.
>> The article seems to aim at people who are already familiar with X and its usual benchmark. > That can be said about quite every article. Sure, let me rephrase then. The article does not remind the reader of much contextual information and standard definitions of the field. Thus, that's not the appropriate paper if you want to get a general idea of the field.
