microsoft / phi-4-multimodal-instruct

Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models.

  • Public
  • 3.1K runs
  • L40S
  • GitHub
  • Weights
  • Paper
  • License
Iterate in playground
  • Prediction

    microsoft/phi-4-multimodal-instruct:40c8f5c03ce250441855e776528bafd11cdb302c6677613acc0942c58dbd0afa
    ID
    qc7k89h1rhrm80cn9v8avyje44
    Status
    Succeeded
    Source
    Web
    Hardware
    L40S
    Total duration
    Created

    Input

    text
    What is shown in this image?
    images
    output
    max_tokens
    1000
    temperature
    0.7
    system_prompt
    You are a helpful assistant.

    Output

    A stop sign in front of a building with Chinese writing on it.
    Generated in
  • Prediction

    microsoft/phi-4-multimodal-instruct:40c8f5c03ce250441855e776528bafd11cdb302c6677613acc0942c58dbd0afa
    ID
    eggqjpa671rmc0cn9v9adqzrew
    Status
    Succeeded
    Source
    Web
    Hardware
    L40S
    Total duration
    Created

    Input

    text
    Transcribe the audio to text, and then translate the audio to French. Use <sep> as a separator between the original transcript and the translation.
    audio
    Video Player is loading.
    Current Time 00:00:000
    Duration 00:00:000
    Loaded: 0%
    Stream Type LIVE
    Remaining Time 00:00:000
     
    1x
    max_tokens
    1000
    temperature
    0.7
    system_prompt
    You are a helpful assistant.

    Output

    What we do as a society, we have to think about where we're moving to. I frequently talk to students about cognitive enhancing drugs, and a lot of students take them for studying and exams, but other students feel angry about this. They feel those students are cheating, and we have no long-term health and safety studies in healthy people, and we really need those before people start taking them. <sep> Ce que nous faisons en tant que société, nous devons penser à où nous allons. Je parle souvent avec des étudiants sur les médicaments cognitifs, et beaucoup d'étudiants les prennent pour étudier et les examens, mais d'autres étudiants se sentent en colère à ce sujet. Ils sentent que ces étudiants trichent, et nous n'avons pas d'études de santé et de sécurité à long terme chez des personnes saines, et nous en avons vraiment besoin avant que les gens ne commencent à les prendre.
    Generated in

Want to make some of these yourself?

Run this model