Can anyone explain what language-conditioned visual reasoning is? I saw this term in this paper and I searched on the internet but I couldn't find a proper explanation.
Asked
Active
Viewed 17 times