attention based model that automatically learns to describe the content of images. datasets: Flickr8k
nohamoamary/image-captioning-with-visual-attention
😵 Uh oh! This model can't be run on Replicate because it was built with a version of Cog that is no longer supported. Consider opening an issue on the model's GitHub repository to see if it can be updated to use a recent version of Cog. If you need any help, please hop into our Discord channel or email us about it.
Examples
Run time and cost
Predictions run on CPU hardware. Predictions typically complete within 9 seconds. The predict time for this model varies significantly based on the inputs.