lucataco / upstage-llama-2-70b-instruct-v2

Upstage/Llama-2-70B-instruct-v2 - GPTQ

  • Public
  • 3.3K runs
  • A100 (80GB)
  • GitHub

Input

string
Shift + Return to add a new line

Prompt to send to model

Default: "Tell me about AI"

string
Shift + Return to add a new line

System prompt that helps guide system behavior

Default: "You are a helpful, respectful and honest assistant. Always answer as helpfully as possible, while being safe. Your answers should not include any harmful, unethical, racist, sexist, toxic, dangerous, or illegal content. Please ensure that your responses are socially unbiased and positive in nature. If a question does not make any sense, or is not factually coherent, explain why instead of answering something not correct. If you don't know the answer to a question, please don't share false information."

integer
(minimum: 1, maximum: 2048)

Number of new tokens

Default: 512

number
(minimum: 0, maximum: 5)

Randomness of outputs, 0 is deterministic, greater than 1 is random

Default: 0.75

number
(minimum: 0.01, maximum: 1)

When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens

Default: 0.95

number
(minimum: 0, maximum: 5)

Penalty for repeated words in generated text; 1 is no penalty, values greater than 1 discourage repetition, less than 1 encourage it

Default: 1.1

Output

AI stands for Artificial Intelligence, which refers to the development of computer systems that can perform tasks that would normally require human intelligence, such as learning, problem-solving, decision making, and language understanding. AI technology has been rapidly advancing in recent years, with applications ranging from virtual assistants like Siri and Alexa, to self-driving cars, medical diagnosis, and even creating art and music. There are various approaches to developing AI, including machine learning, deep learning, and neural networks, all aimed at enabling machines to learn from data and improve their performance over time. While AI holds great potential for improving our lives and solving complex problems, it also raises important ethical questions about its impact on society and the future of work.</s>
Generated in

Run time and cost

This model costs approximately $0.030 to run on Replicate, or 33 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia A100 (80GB) GPU hardware. Predictions typically complete within 22 seconds.