Commonsense Reasoning for Natural Language Processing
Probably Approximately a Scientific Blog
JANUARY 12, 2021
Visual question answering models guess the answer based on the frequency of answers for the same type of question in the training set, e.g. replying "2" to any "how many" question. Image captioning models often learn to recognize objects based solely on their typical environment , and fail to recognize them outside their typical environment.
Let's personalize your content