Flickr-8k
Emerging3papers using it
2024first seen
Flickr8k is a dataset that contains images and their corresponding textual descriptions, used to evaluate multimodal speech recognition systems.
Flickr8k is a dataset that contains images and their corresponding textual descriptions, used to evaluate multimodal speech recognition systems.