joehoover/bart-large-mnli | Run with an API on Replicate

Zero-shot classification document classification with a light-weight model.

Public

5.5K runs

License

GitHub

Playground API Examples README Versions

Input

Run this model in Node.js with one line of code:

npx create-replicate --model=joehoover/bart-large-mnli

or set up a project from scratch

Install Replicate’s Node.js client library:

npm install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import and set up the client:

import Replicate from "replicate";

const replicate = new Replicate({
  auth: process.env.REPLICATE_API_TOKEN,
});

Run joehoover/bart-large-mnli using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

const output = await replicate.run(
  "joehoover/bart-large-mnli:35bf0a611ca728bf8d67f42dd54617fabc6296fec8d956ff058814c2cb34e844",
  {
    input: {
      input: "Replicate, I think I might...like you a lot!",
      multi_label: false,
      class_labels: "positive, negative, neutral",
      hypothesis_template: "This example is {}."
    }
  }
);

console.log(output);

To learn more, take a look at the guide on getting started with Node.js.

Install Replicate’s Python client library:

pip install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import the client:

import replicate

Run joehoover/bart-large-mnli using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

output = replicate.run(
    "joehoover/bart-large-mnli:35bf0a611ca728bf8d67f42dd54617fabc6296fec8d956ff058814c2cb34e844",
    input={
        "input": "Replicate, I think I might...like you a lot!",
        "multi_label": False,
        "class_labels": "positive, negative, neutral",
        "hypothesis_template": "This example is {}."
    }
)

print(output)

To learn more, take a look at the guide on getting started with Python.

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Run joehoover/bart-large-mnli using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

curl -s -X POST \
  -H "Authorization: Bearer $REPLICATE_API_TOKEN" \
  -H "Content-Type: application/json" \
  -H "Prefer: wait" \
  -d $'{
    "version": "joehoover/bart-large-mnli:35bf0a611ca728bf8d67f42dd54617fabc6296fec8d956ff058814c2cb34e844",
    "input": {
      "input": "Replicate, I think I might...like you a lot!",
      "multi_label": false,
      "class_labels": "positive, negative, neutral",
      "hypothesis_template": "This example is {}."
    }
  }' \
  https://api.replicate.com/v1/predictions

To learn more, take a look at Replicate’s HTTP API reference docs.

Output

No output yet! Press "Submit" to start a prediction.

Run time and cost

This model costs approximately $0.00022 to run on Replicate, or 4545 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia T4 GPU hardware. Predictions typically complete within 1 seconds.

Readme

This model performs zero-shot document classification for short documents.

It spins up an instance of the HuggingFace Transformers zero-shot-classification pipeline with a large BART model that’s been fine-tuned on NLI. It’s an old approach (see here) as far as zero shot inference is concerned, but it’s also small and fast, compared to today’s large models.

Model description

This system is a pipeline that uses a BART Large model that’s been fine-tuned on MNLI, a large natural language inference dataset.

The pipeline formulates sequence classification as an NLI problem. Given a set of class labels, an input sequence, and a hypothesis template:

A hypothesis is constructed for each label by piping labels into the hypothesis template.
Each hypothesis is appended to the user input, yielding the complete input sequence
The input sequence is passed to the NLI model, which then predicts whether the hypothesis contradicts, entails, or is irrelevant to the user input (i.e. the premise).
Entailment logits associated with each hypothesis–remember, a hypothesis is constructed for each label–are then normalized with a softmax to yield the final scores over labels that are returned in the output.

Outputs are returned as a dictionary with three keys:

hypothesis_template: The hypothesis template used to operationalize the full input sequence.
labels: Class labels specified by the user ordered by scores
scores: Scores associated with each label
sequence: Input sequence used for classification

Intended use

Prototyping, low-cost zero-shot document classification.

Ethical considerations

This model contains social and cultural biases that may impact predictions. It is also not particularly accurate or well-calibrated. It should not be used in production without serious consideration of these risks.

Caveats and recommendations

For best results, use labels and a hypothesis template that are congruent with each other. Note, also, that the model is not robust to changes in surface forms, so changing characteristics like punctuation and capitalization may change accuracy (for better or worse!).