hayooucom / vision-model

This is phi-3-vision model , cost by time ,have fun~

  • Public
  • 13.6K runs
  • L40S
  • GitHub
  • Paper
  • License
  • Prediction

    hayooucom/vision-model:6afc892d
    ID
    a2kecty0xsrgj0cfmpj91qj7v0
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    top_k
    1
    top_p
    1
    prompt
    please describe this image.
    image_url
    [ "https://support.content.office.net/en-us/media/3dd2b79b-9160-403d-9967-af893d17b580.png" ]
    max_tokens
    45000
    temperature
    0.1
    image_base64
    []
    system_prompt
    You are a helpful AI assistant.
    max_new_tokens
    458
    repetition_penalty
    1.1

    Output

    The chart is a spreadsheet table with data organized into columns and rows. It includes headers labeled 'Product', 'Qtr 1', 'Qtr 2', and 'Grand Total'. There are eight products listed: Chocolade, Gummibarchen, Scottish Longbreads, Sir Rodney's Scones, Tarte au sucre, Chocolate Biscuits, and a total row at the bottom. Each product has corresponding values in the three quarters and grand total columns. For example, Chocolade has $744.60 in Qtr 1, $162.56 in Qtr 2, and a grand total of $907.16. The totals for each column are also provided at the end of the table, with Qtr 1 totaling $14,181.59, Qtr 2 totaling $8,127.78, and the Grand Total totaling $22,309.37. The table uses a green header for the first column and alternates shading between white and light grey for the rows to enhance readability.
    Generated in
  • Prediction

    hayooucom/vision-model:6afc892d
    ID
    11axq4ccj9rgg0cfmqjsamqr8g
    Status
    Succeeded
    Source
    API
    Hardware
    A40
    Total duration
    Created

    Input

    top_p
    0.9
    prompt
    <|user|> <|image_1|> 当前时间:2024年05月23日 22:40:35 星期四 这张图片有什么,请用markdown 格式展示出来。<|end|> <|assistant|>
    image_url
    [ "https://v.stylee.top/files/vision/upimg-1716475231541697.jpg" ]
    max_tokens
    400
    temperature
    0.2

    Output

    The image contains a bar chart, not a table. However, I can provide the data in a markdown table format based on the information visible in the OCR text and the image description. Here is the conversion of the bar chart data into a markdown table: ```markdown | Improvement Area | Percentage of Respondents Who Agree | |-----------------------------------------|-------------------------------------| | Having clear and pre-defined goals for meetings | 79% | | Knowing where to find the information need for a meeting | 77% | | Understanding my exact role and responsibilities when I'm invited | 76% | | Having tools to manage admin tasks like note-taking or summarization | 76% | | Having more focus time to sufficiently prepare for meetings | 75% |
    Generated in

Want to make some of these yourself?

Run this model