zsxkib / kimi-audio-7b-instruct

๐ŸŽง Kimi-Audio-7B-Instruct, ASR, audio reasoning, captioning, emotion sensing, and TTS into one universal model ๐Ÿ”Š

  • Public
  • 109 runs
  • L40S
  • GitHub
  • Weights
  • Paper
  • License
Run with an API
  • Prediction

    zsxkib/kimi-audio-7b-instruct:7500b32387695e89da3d09271850319ba027969f0c714dfc226361609ff29f2b
    ID
    07zwn0jmx5rme0cphh8sgm9f9m
    Status
    Succeeded
    Source
    Web
    Hardware
    L40S
    Total duration
    Created

    Input

    audio
    Video Player is loading.
    Current Time 00:00:000
    Duration 00:00:000
    Loaded: 0%
    Stream Type LIVE
    Remaining Time 00:00:000
     
    1x
    text_top_k
    5
    audio_top_k
    10
    output_type
    both
    return_json
    text_temperature
    0
    audio_temperature
    0.8
    text_repetition_penalty
    1
    audio_repetition_penalty
    1
    text_repetition_window_size
    16
    audio_repetition_window_size
    64

    Output

    Video Player is loading.
    Current Time 00:00:000
    Duration 00:00:000
    Loaded: 0%
    Stream Type LIVE
    Remaining Time 00:00:000
     
    1x
    Generated in
  • Prediction

    zsxkib/kimi-audio-7b-instruct:7500b32387695e89da3d09271850319ba027969f0c714dfc226361609ff29f2b
    ID
    02q9mmga2drmc0cphh9sfv43h4
    Status
    Succeeded
    Source
    Web
    Hardware
    L40S
    Total duration
    Created

    Input

    audio
    Video Player is loading.
    Current Time 00:00:000
    Duration 00:00:000
    Loaded: 0%
    Stream Type LIVE
    Remaining Time 00:00:000
     
    1x
    prompt
    convert audio to text
    text_top_k
    5
    audio_top_k
    10
    output_type
    text
    return_json
    text_temperature
    0
    audio_temperature
    0.8
    text_repetition_penalty
    1
    audio_repetition_penalty
    1
    text_repetition_window_size
    16
    audio_repetition_window_size
    64

    Output

    Open waits text to dialogue model. You get full control over scripts and voices.
    Generated in
  • Prediction

    zsxkib/kimi-audio-7b-instruct:7500b32387695e89da3d09271850319ba027969f0c714dfc226361609ff29f2b
    ID
    zjseryqh7hrma0cphhabp2cgm8
    Status
    Succeeded
    Source
    Web
    Hardware
    L40S
    Total duration
    Created
    by @zsxkib

    Input

    audio
    Video Player is loading.
    Current Time 00:00:000
    Duration 00:00:000
    Loaded: 0%
    Stream Type LIVE
    Remaining Time 00:00:000
     
    1x
    text_top_k
    5
    audio_top_k
    10
    output_type
    both
    return_json
    text_temperature
    0
    audio_temperature
    0.8
    text_repetition_penalty
    1
    audio_repetition_penalty
    1
    text_repetition_window_size
    16
    audio_repetition_window_size
    64

    Output

    json_str

    I'm just an AI, I don't have feelings or experiences, so I don't have good or bad days. But I'm always here to help you with your questions and tasks!

    media_path

    Video Player is loading.
    Current Time 00:00:000
    Duration 00:00:000
    Loaded: 0%
    Stream Type LIVE
    Remaining Time 00:00:000
     
    1x
    Generated in

Want to make some of these yourself?

Run this model