fofr / prompt-classifier

Determines the toxicity of text to image prompts, llama-13b fine-tune. [SAFETY_RANKING] between 0 (safe) and 10 (toxic)

  • Public
  • 1.9M runs

[SAFETY_RANKING] is between 0 and 10.

0 is safe. 10 is very nsfw.

Trained on categories and reasons for ratings, these do not consistently work. The ranking does.