lucataco / qwen2-57b-a14b-instruct

Qwen2 57 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

  • Public
  • 1.3K runs
  • 2x A100 (80GB)
  • Paper
  • License
  • Prediction

    lucataco/qwen2-57b-a14b-instruct:fc67fa3fa20d3d0ee59794df05548b59d285fbb43d944506203a8a2195b75c36
    ID
    0zkzt2qq41rgj0cgeg5tck6598
    Status
    Succeeded
    Source
    Web
    Hardware
    2x A100 (80GB)
    Total duration
    Created

    Input

    top_k
    50
    top_p
    0.9
    prompt
    Give me a short introduction to large language model.
    max_tokens
    512
    min_tokens
    0
    temperature
    0.6
    system_prompt
    You are a helpful assistant.
    presence_penalty
    0
    frequency_penalty
    0

    Output

    A large language model (LLM) is a type of artificial intelligence model that is trained on a massive amount of text data to generate human-like text. These models are typically trained using deep learning techniques, and they are able to generate text that is coherent and contextually appropriate, making them useful for a variety of natural language processing tasks. Some common applications of large language models include language translation, text summarization, and question answering. They are also used in chatbots and virtual assistants to enable more natural and realistic conversations with users. Large language models are often referred to as "generative models" because they are able to generate new text based on the patterns they have learned from the training data.
    Generated in

Want to make some of these yourself?

Run this model