cjwbw / pix2struct

Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

  • Public
  • 6K runs
  • GitHub
  • Paper
  1. e32d7748

    Latest