bytedance / sa2va-4b-image
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos (Updated 4 months ago)
Prediction
bytedance/sa2va-4b-image:ccf8e58352ce5b2a6cbf5cb68b947ca3bc62009ce66e82071eeae14555414411ID688a7frg6nrme0cn5g1a7z9pmrStatusSucceededSourceWebHardwareL40STotal durationCreatedPrediction
bytedance/sa2va-4b-image:ccf8e58352ce5b2a6cbf5cb68b947ca3bc62009ce66e82071eeae14555414411IDhc4w2m2vg9rme0cn5g1s6jh1ymStatusSucceededSourceWebHardwareL40STotal durationCreatedOutput
A snowboarder is captured mid-air, performing a jump on a snowy slope. The snowboarder is wearing a white jacket and black pants, with a red snowboard beneath them. The scene is set against a backdrop of a cloudy sky and snow-covered mountains, creating a visually stunning and adventurous atmosphere.<|end|>Prediction
bytedance/sa2va-4b-image:ccf8e58352ce5b2a6cbf5cb68b947ca3bc62009ce66e82071eeae14555414411IDqj3xwqxeydrm80cn5g1vff1jgcStatusSucceededSourceWebHardwareL40STotal durationCreatedPrediction
bytedance/sa2va-4b-image:ccf8e58352ce5b2a6cbf5cb68b947ca3bc62009ce66e82071eeae14555414411ID2tz4kzppg1rma0cn5g1v7ngx4rStatusSucceededSourceWebHardwareL40STotal durationCreatedOutput
Prediction
bytedance/sa2va-4b-image:ccf8e58352ce5b2a6cbf5cb68b947ca3bc62009ce66e82071eeae14555414411IDvwh1qc1z0xrma0cn5g28fjzxjrStatusSucceededSourceWebHardwareL40STotal durationCreatedPrediction
bytedance/sa2va-4b-image:ccf8e58352ce5b2a6cbf5cb68b947ca3bc62009ce66e82071eeae14555414411ID43d2wf2yenrme0cn5g2bbj9qswStatusSucceededSourceWebHardwareL40STotal durationCreatedPrediction
bytedance/sa2va-4b-image:ccf8e58352ce5b2a6cbf5cb68b947ca3bc62009ce66e82071eeae14555414411ID7v61mpn50drmc0cn5g2ab4cnf0StatusSucceededSourceWebHardwareL40STotal durationCreatedPrediction
bytedance/sa2va-4b-image:ccf8e58352ce5b2a6cbf5cb68b947ca3bc62009ce66e82071eeae14555414411ID6229hx66xsrma0cn5g29tb0qwrStatusSucceededSourceWebHardwareL40STotal durationCreated
Want to make some of these yourself?
Run this model