zf-kbot/manga-translator

An advanced, automated pipeline for translating manga and comics. This project integrates state-of-the-art Computer Vision and NLP models to perform detection, recognition, translation, and typesetting in a single seamless process.

Public
79 runs

Run zf-kbot/manga-translator with an API

Use one of our client libraries to get started quickly. Clicking on a library will take you to the Playground tab where you can tweak different inputs, see the results, and copy the corresponding code to use in your own project.

Input schema

The fields you can use to run this model with an API. If you don't give a value for a field its default value will be used.

Field Type Default value Description
source_lan
None
ja
Source language (Auto-selects the best OCR engine: MangaOCR for JA, Paddle for ZH/KO, EasyOCR for others)
target_lan
None
zh-CN
Target language for translation
translator
None
google
Translation Engine: 'google' (Free/Fast) or 'llm' (High Quality, requires API Key)
image
string
Input image file
openai_model
string
gpt-4o
LLM Model Name. (e.g., 'gpt-4o', 'deepseek-chat', 'moonshot-v1-8k')
openai_api_key
string
LLM API Key. Required if 'translator' is set to 'llm'. Supports OpenAI, DeepSeek, Moonshot, etc.
openai_base_url
string
https://api.openai.com/v1
LLM Base URL. Optional. Change this if using non-OpenAI models (e.g., 'https://api.deepseek.com')

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{
  "type": "string",
  "title": "Output",
  "format": "uri"
}