camenduru / moe-llava

MoE-LLaVA

  • Public
  • 1.4M runs
  • GitHub
  • Paper
  • License

Input

Output

Run time and cost

This model runs on Nvidia A40 GPU hardware. Predictions typically complete within 8 seconds.