replicate / gpt-j-6b

A large language model by EleutherAI

  • Public
  • 9.4K runs
  • GitHub
  • License

If you haven’t yet trained a model on Replicate, we recommend you read one of the following guides.

Pricing

Trainings for this model run on 4x Nvidia A100 (80GB) GPU hardware, which costs $0.0056 per second.

Create a training

Install the Python library:

pip install replicate

Then, run this to create a training with replicate/gpt-j-6b:b3546aee as the base model:

import replicate

training = replicate.trainings.create(
  version="replicate/gpt-j-6b:b3546aeec6c9891f0dd9929c2d3bedbf013c12e02e7dd0346af09c37e008c827",
  input={
    ...
  },
  destination=f"{username}/<destination-model-name>"
)

print(training)
curl -s -X POST \
-d '{"destination": "{username}/<destination-model-name>", "input": {...}}' \
  -H "Authorization: Bearer $REPLICATE_API_TOKEN" \
  https://api.replicate.com/v1/models/replicate/gpt-j-6b/versions/b3546aeec6c9891f0dd9929c2d3bedbf013c12e02e7dd0346af09c37e008c827/trainings

The API response will look like this:

{
  "id": "zz4ibbonubfz7carwiefibzgga",
  "version": "b3546aeec6c9891f0dd9929c2d3bedbf013c12e02e7dd0346af09c37e008c827",
  "status": "starting",
  "input": {
    "data": "..."
  },
  "output": null,
  "error": null,
  "logs": null,
  "started_at": null,
  "created_at": "2023-03-28T21:47:58.566434Z",
  "completed_at": null
}

Note that before you can create a training, you’ll need to create a model and use its name as the value for the destination field.