isaacgv / vec2

  • Public
  • 105 runs
  • L40S

Input

Video Player is loading.
Current Time 00:00:000
Duration 00:00:000
Loaded: 0%
Stream Type LIVE
Remaining Time 00:00:000
 
1x
*file

Audio file

string

Choose a Whisper model.

Default: "large-v2"

number

temperature to use for sampling

Default: 0

string

language spoken in the audio, specify None to perform language detection

Output

segments

[ { "end": 2.459, "text": " Vă face la fel și în acest caz.", "start": 0.34, "words": [ { "end": 0.804, "word": "Vă", "score": 0.166, "start": 0.34 }, { "end": 1.147, "word": "face", "score": 0.383, "start": 0.865 }, { "end": 1.309, "word": "la", "score": 0.5, "start": 1.188 }, { "end": 1.591, "word": "fel", "score": 0.361, "start": 1.369 }, { "end": 1.712, "word": "și", "score": 0.504, "start": 1.632 }, { "end": 1.834, "word": "în", "score": 0.506, "start": 1.753 }, { "end": 2.217, "word": "acest", "score": 0.365, "start": 1.874 }, { "end": 2.459, "word": "caz.", "score": 0.268, "start": 2.278 } ] } ]

detected_language

ro
Generated in

This example was created by a different version, isaacgv/vec2:ec9bcadf.

Run time and cost

This model runs on Nvidia L40S GPU hardware. We don't yet have enough runs of this model to provide performance information.

Readme

This model doesn't have a readme.