Readme
attention based model that automatically learns to describe the content of images. datasets: Flickr8k
datasets: Flickr8k
This model runs on Nvidia T4 GPU hardware. Predictions typically complete within 1 seconds.
attention based model that automatically learns to describe the content of images. datasets: Flickr8k