microsoft
/
omniparser-v2
OmniParser is a screen parsing tool to convert general GUI screen to structured elements.
Public
41.8K runs
GitHub
Weights
Paper
License
Run with an API
Playground
API
Examples
README
Versions