cjwbw
/
pix2struct
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding