nohamoamary/image-captioning-with-visual-attention

Public
datasets: Flickr8k
681 runs

Run time and cost

Predictions run on CPU hardware. Predictions typically complete within 3 minutes. The predict time for this model varies significantly based on the inputs.

Readme

attention based model that automatically learns to describe the content of images. datasets: Flickr8k

Replicate