cjwbw
/
pix2struct
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Public
6.1K runs
GitHub
Paper
Run with an API
Playground
API
Examples
README
Versions