Flickr30k
Emerging3papers using it
2021first seen
Flickr30k is a dataset that contains images paired with descriptive captions, used to evaluate multimodal models on tasks such as image captioning and cross-modal retrieval.
Flickr30k is a dataset that contains images paired with descriptive captions, used to evaluate multimodal models on tasks such as image captioning and cross-modal retrieval.