vm6eji6m4/whisper-chinese-pro-cert
Court-grade Traditional Chinese ASR — Mandarin/Taiwanese/Hakka, word-level alignment, speaker diarization, optional three-engine certification with confidence index. 210k-term MOE lexicon.
Run vm6eji6m4/whisper-chinese-pro-cert with an API
Use one of our client libraries to get started quickly. Clicking on a library will take you to the Playground tab where you can tweak different inputs, see the results, and copy the corresponding code to use in your own project.
Input schema
The fields you can use to run this model with an API. If you don't give a value for a field its default value will be used.
| Field | Type | Default value | Description |
|---|---|---|---|
| file |
string
|
音檔或影片(mp3/mp4/m4a/wav)。也可用 file_url。
|
|
| file_url |
string
|
|
或提供音訊 URL。
|
| language |
None
|
auto
|
語言:auto/zh/zh-tw/zh-hk/en/ja/ko
|
| num_speakers |
integer
|
0
Max: 32 |
已知說話人數(0=自動)。
|
| enable_diarization |
boolean
|
True
|
輸出 speaker 標籤。
|
| enable_certification |
boolean
|
False
|
★ 認證版:跑第二引擎做 cross-validate(disputed/agreement)。會增加 ~80% 時間。
|
| hotwords |
string
|
|
自訂熱詞(會加入內建詞庫 prompt)。
|
| hf_token |
string
|
HuggingFace token(pyannote 必需)。
|
{
"type": "object",
"title": "Input",
"required": [
"file",
"hf_token"
],
"properties": {
"file": {
"type": "string",
"title": "File",
"format": "uri",
"x-order": 0,
"description": "\u97f3\u6a94\u6216\u5f71\u7247\uff08mp3/mp4/m4a/wav\uff09\u3002\u4e5f\u53ef\u7528 file_url\u3002"
},
"file_url": {
"type": "string",
"title": "File Url",
"default": "",
"x-order": 1,
"description": "\u6216\u63d0\u4f9b\u97f3\u8a0a URL\u3002"
},
"hf_token": {
"type": "string",
"title": "Hf Token",
"format": "password",
"x-order": 7,
"writeOnly": true,
"description": "HuggingFace token\uff08pyannote \u5fc5\u9700\uff09\u3002",
"x-cog-secret": true
},
"hotwords": {
"type": "string",
"title": "Hotwords",
"default": "",
"x-order": 6,
"description": "\u81ea\u8a02\u71b1\u8a5e\uff08\u6703\u52a0\u5165\u5167\u5efa\u8a5e\u5eab prompt\uff09\u3002"
},
"language": {
"enum": [
"auto",
"zh",
"zh-tw",
"zh-hk",
"en",
"ja",
"ko"
],
"type": "string",
"title": "language",
"description": "\u8a9e\u8a00\uff1aauto/zh/zh-tw/zh-hk/en/ja/ko",
"default": "auto",
"x-order": 2
},
"num_speakers": {
"type": "integer",
"title": "Num Speakers",
"default": 0,
"maximum": 32,
"minimum": 0,
"x-order": 3,
"description": "\u5df2\u77e5\u8aaa\u8a71\u4eba\u6578\uff080=\u81ea\u52d5\uff09\u3002"
},
"enable_diarization": {
"type": "boolean",
"title": "Enable Diarization",
"default": true,
"x-order": 4,
"description": "\u8f38\u51fa speaker \u6a19\u7c64\u3002"
},
"enable_certification": {
"type": "boolean",
"title": "Enable Certification",
"default": false,
"x-order": 5,
"description": "\u2605 \u8a8d\u8b49\u7248\uff1a\u8dd1\u7b2c\u4e8c\u5f15\u64ce\u505a cross-validate\uff08disputed/agreement\uff09\u3002\u6703\u589e\u52a0 ~80% \u6642\u9593\u3002"
}
}
}
Output schema
The shape of the response you’ll get when you run this model with an API.
{
"type": "object",
"title": "Output"
}