vm6eji6m4/whisper-chinese-pro-cert

Court-grade Traditional Chinese ASR — Mandarin/Taiwanese/Hakka, word-level alignment, speaker diarization, optional three-engine certification with confidence index. 210k-term MOE lexicon.

Public

0 runs

Playground API Examples README Versions

Run vm6eji6m4/whisper-chinese-pro-cert with an API

Use one of our client libraries to get started quickly. Clicking on a library will take you to the Playground tab where you can tweak different inputs, see the results, and copy the corresponding code to use in your own project.

Input schema

The fields you can use to run this model with an API. If you don't give a value for a field its default value will be used.

Field	Type	Default value	Description
file	string		音檔或影片（mp3/mp4/m4a/wav）。也可用 file_url。
file_url	string		或提供音訊 URL。
language	None	auto	語言：auto/zh/zh-tw/zh-hk/en/ja/ko
num_speakers	integer	0 Max: 32	已知說話人數（0=自動）。
enable_diarization	boolean	True	輸出 speaker 標籤。
enable_certification	boolean	False	★ 認證版：跑第二引擎做 cross-validate（disputed/agreement）。會增加 ~80% 時間。
hotwords	string		自訂熱詞（會加入內建詞庫 prompt）。
hf_token	string		HuggingFace token（pyannote 必需）。

{
  "type": "object",
  "title": "Input",
  "required": [
    "file",
    "hf_token"
  ],
  "properties": {
    "file": {
      "type": "string",
      "title": "File",
      "format": "uri",
      "x-order": 0,
      "description": "\u97f3\u6a94\u6216\u5f71\u7247\uff08mp3/mp4/m4a/wav\uff09\u3002\u4e5f\u53ef\u7528 file_url\u3002"
    },
    "file_url": {
      "type": "string",
      "title": "File Url",
      "default": "",
      "x-order": 1,
      "description": "\u6216\u63d0\u4f9b\u97f3\u8a0a URL\u3002"
    },
    "hf_token": {
      "type": "string",
      "title": "Hf Token",
      "format": "password",
      "x-order": 7,
      "writeOnly": true,
      "description": "HuggingFace token\uff08pyannote \u5fc5\u9700\uff09\u3002",
      "x-cog-secret": true
    },
    "hotwords": {
      "type": "string",
      "title": "Hotwords",
      "default": "",
      "x-order": 6,
      "description": "\u81ea\u8a02\u71b1\u8a5e\uff08\u6703\u52a0\u5165\u5167\u5efa\u8a5e\u5eab prompt\uff09\u3002"
    },
    "language": {
      "enum": [
        "auto",
        "zh",
        "zh-tw",
        "zh-hk",
        "en",
        "ja",
        "ko"
      ],
      "type": "string",
      "title": "language",
      "description": "\u8a9e\u8a00\uff1aauto/zh/zh-tw/zh-hk/en/ja/ko",
      "default": "auto",
      "x-order": 2
    },
    "num_speakers": {
      "type": "integer",
      "title": "Num Speakers",
      "default": 0,
      "maximum": 32,
      "minimum": 0,
      "x-order": 3,
      "description": "\u5df2\u77e5\u8aaa\u8a71\u4eba\u6578\uff080=\u81ea\u52d5\uff09\u3002"
    },
    "enable_diarization": {
      "type": "boolean",
      "title": "Enable Diarization",
      "default": true,
      "x-order": 4,
      "description": "\u8f38\u51fa speaker \u6a19\u7c64\u3002"
    },
    "enable_certification": {
      "type": "boolean",
      "title": "Enable Certification",
      "default": false,
      "x-order": 5,
      "description": "\u2605 \u8a8d\u8b49\u7248\uff1a\u8dd1\u7b2c\u4e8c\u5f15\u64ce\u505a cross-validate\uff08disputed/agreement\uff09\u3002\u6703\u589e\u52a0 ~80% \u6642\u9593\u3002"
    }
  }
}

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema

{
  "type": "object",
  "title": "Output"
}