nateraw/codellama-34b-instruct

Public

21 runs

Run nateraw/codellama-34b-instruct with an API

Use one of our client libraries to get started quickly. Clicking on a library will take you to the Playground tab where you can tweak different inputs, see the results, and copy the corresponding code to use in your own project.

Input schema

The fields you can use to run this model with an API. If you don't give a value for a field its default value will be used.

Field	Type	Default value	Description
message	string		None
system_prompt	string	Provide answers in Python	The system prompt to use (for chat/instruct models only)
max_new_tokens	integer	256	The maximum number of tokens the model should generate as output.
temperature	number	0.2	The value used to modulate the next token probabilities.
top_p	number	0.9	A probability threshold for generating the output. If < 1.0, only keep the top tokens with cumulative probability >= top_p (nucleus filtering). Nucleus filtering is described in Holtzman et al. (http://arxiv.org/abs/1904.09751).
top_k	integer	50	The number of highest probability tokens to consider for generating the output. If > 0, only keep the top k tokens with highest probability (top-k filtering).

{
  "type": "object",
  "title": "Input",
  "required": [
    "message"
  ],
  "properties": {
    "top_k": {
      "type": "integer",
      "title": "Top K",
      "default": 50,
      "x-order": 5,
      "description": "The number of highest probability tokens to consider for generating the output. If > 0, only keep the top k tokens with highest probability (top-k filtering)."
    },
    "top_p": {
      "type": "number",
      "title": "Top P",
      "default": 0.9,
      "x-order": 4,
      "description": "A probability threshold for generating the output. If < 1.0, only keep the top tokens with cumulative probability >= top_p (nucleus filtering). Nucleus filtering is described in Holtzman et al. (http://arxiv.org/abs/1904.09751)."
    },
    "message": {
      "type": "string",
      "title": "Message",
      "x-order": 0
    },
    "temperature": {
      "type": "number",
      "title": "Temperature",
      "default": 0.2,
      "x-order": 3,
      "description": "The value used to modulate the next token probabilities."
    },
    "system_prompt": {
      "type": "string",
      "title": "System Prompt",
      "default": "Provide answers in Python",
      "x-order": 1,
      "description": "The system prompt to use (for chat/instruct models only)"
    },
    "max_new_tokens": {
      "type": "integer",
      "title": "Max New Tokens",
      "default": 256,
      "x-order": 2,
      "description": "The maximum number of tokens the model should generate as output."
    }
  }
}

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema

{
  "type": "array",
  "items": {
    "type": "string"
  },
  "title": "Output",
  "x-cog-array-type": "iterator",
  "x-cog-array-display": "concatenate"
}