(Updated 2 months, 1 week ago)
This is the f-lite model from FAL & Freepik optimised for 2x speedups through pruna
This is the fastest Flux Dev endpoint in the world, contact us for more at pruna.ai
This a pruna optimised version of the flux 1.dev model.
This is a 3x faster FLUX.1 [schnell] model from Black Forest Labs, optimised with pruna with minimal quality loss. Contact us for more at pruna.ai
This is the hidream-e1 model accelerated with the pruna optimisation engine.
This is an optimised version of the hidream-l1-dev model using the pruna ai optimisation toolkit!
This is an optimised version of the hidream-l1 model using the pruna ai optimisation toolkit!
This is an optimised version of the hidream-full model using the pruna ai optimisation toolkit!
hunyuan3d-2 optimised with the pruna toolkit: https://github.com/PrunaAI/pruna
A 2x faster qwen 3 model through pruna oss
This is the fastest sdxl-lightning endpoint in the world on A100, contact us for more at pruna.ai
This model is an optimised version of stable-diffusion by stability AI that is 3x faster and 3x cheaper.
This is VACE-1.3B model optimised with pruna ai. Wan2.1 VACE is an all-in-one model for video creation and editing.
This is a faster VACE-14B model, optimised with pruna, contact us for more at pruna.ai
This model is cold. You'll get a fast response if the model is warm and already running, and a slower response if the model is cold and starting up.
This model runs on A100 (80GB). View more.