technillogue/vllm-hf-test | Run with an API on Replicate

Public

16 runs

Run time and cost

This model runs on Nvidia L40S GPU hardware. We don't yet have enough runs of this model to provide performance information.

This model doesn't have a readme.

Model created over 1 year ago