Run time and cost
This model costs approximately $0.099 to run on Replicate, or 10 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.
This model runs on Nvidia A40 (Large) GPU hardware.
Predictions typically complete within 137 seconds.
The predict time for this model varies significantly based on the inputs.
Readme
Proof of concept to use depth-anything on 2D videos to create depth frames to create a 3D SBS video
To convert the final 3D SBS video into a Spatial Video for the Vision Pro check out this Spatial CLI tool for apple silicon: https://blog.mikeswanson.com/spatial