A 40 billion parameter language model trained to follow human instructions.

This model runs on 4x Nvidia A100 (40GB) GPU hardware. Predictions typically complete within 25 seconds. The predict time for this model varies significantly based on the inputs.


Model Description

Falcon-40B-Instruct is a 40B parameter causal decoder-only model built by TII based on Falcon-40B and finetuned on a mixture of Baize. It is made available under the Apache 2.0 license.

For more information about this model, see here.


