microsoft
/
omniparser-v2
OmniParser is a screen parsing tool to convert general GUI screen to structured elements.
Public
50.1K runs
GitHub
Weights
Paper
License
Run with an API
Playground
API
Examples
README
Versions
3 months, 2 weeks ago
Author
@lucataco
Version
cuda12.1-python3.12-torch2.4.1-X64
Commit
9c0dff05b38ca5aee9de7923ccce787576d142bc
49cf3d41
Latest