ditto--ai/qwen3guard-gen-4b

A 4B-parameter safety and content moderation model that classifies user prompts and assistant responses as Safe, Unsafe, or Controversial with fine-grained category labels and refusal detection. Supports 119 languages.

Public
463.4K runs

Want to make some of these yourself?

Run this model