lucataco / dolphin-2.1-mistral-7b

Mistral-7B-v0.1 fine tuned for chat with the Dolphin dataset (an open-source implementation of Microsoft's Orca)

  • Public
  • 13.4K runs
  • L40S
  • GitHub
  • Paper
  • License

Input

*string
Shift + Return to add a new line
integer

The maximum number of tokens the model should generate as output.

Default: 512

number

The value used to modulate the next token probabilities.

Default: 0.8

number

A probability threshold for generating the output. If < 1.0, only keep the top tokens with cumulative probability >= top_p (nucleus filtering). Nucleus filtering is described in Holtzman et al. (http://arxiv.org/abs/1904.09751).

Default: 0.95

integer

The number of highest probability tokens to consider for generating the output. If > 0, only keep the top k tokens with highest probability (top-k filtering).

Default: 50

number

Presence penalty

Default: 0

number

Frequency penalty

Default: 0

string
Shift + Return to add a new line

The template used to format the prompt. The input prompt is inserted into the template using the `{prompt}` placeholder.

Default: "<|im_start|>system\nYou are Dolphin, a helpful AI assistant.\n<|im_end|>\n<|im_start|>user\n{prompt}<|im_end|>\n<|im_start|>assistant\n"

Output

Training a dolphin to obey you involves establishing trust, communication, and a strong foundation in basic behaviors. Here's a step-by-step guide to training a dolphin: 1. Building trust: Spend time with the dolphin, observe its behavior, and let it get used to your presence. This will help in building trust, which is the cornerstone of any successful training relationship. 2. Learning the dolphin's body language: Observe the dolphin's body language and learn to interpret its signals. This will help you understand when the dolphin is relaxed, excited, or fearful, and will allow you to respond appropriately. 3. Using positive reinforcement: When teaching a behavior, start by rewarding the dolphin with food or attention as soon as it begins to respond correctly. This will help the dolphin associate the desired behavior with a positive outcome. 4. Breaking down behaviors into smaller steps: Instead of trying to teach a complex behavior all at once, break it down into smaller, more manageable steps. This makes it easier for the dolphin to learn and understand each part of the behavior. 5. Using clear and consistent signals: Use consistent hand signals, verbal commands, or other signals to communicate with the dolphin. This will help the dolphin understand what you want it to do and will make training more efficient. 6. Practicing patience: Training a dolphin requires patience, as they may not learn a behavior as quickly as you would like. Be patient and give the dolphin time to learn and practice the behavior. 7. Reinforcing learned behaviors: Once a behavior has been learned, continue to reinforce it periodically. This will help the dolphin maintain and strengthen its understanding of the behavior. 8. Encouraging creativity and problem-solving: Dolphins are intelligent creatures that enjoy problem-solving and new challenges. Encourage the dolphin to think creatively and solve problems by presenting new situations or tasks for it to complete. 9. Maintaining a strong bond: Continue to nurture the bond between you and the dolphin by spending time together, engaging in play, and communicating openly and honestly. This will help ensure a successful and lasting training relationship. 10. Adapting to the dolphin's needs:
Generated in

Run time and cost

This model costs approximately $0.049 to run on Replicate, or 20 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 51 seconds. The predict time for this model varies significantly based on the inputs.

Readme

About

Dolphin 2.1 🐬 https://erichartford.com/dolphin

Dolphin-2.1-mistral-7b’s training was sponsored by a16z.

This model is based on mistralAI, with apache-2.0 license, so it is suitable for commercial or non-commercial use.