-
-
Notifications
You must be signed in to change notification settings - Fork 193
Open
Labels
enhancementNew feature or requestNew feature or request
Description
Feature request
We want to implement https://huggingface.co/microsoft/OmniParser in a ReplayStrategy
(e.g. #888)
Motivation
OmniParser is designed to be able to convert unstructured screenshot image into structured list of elements including interactable regions location and captions of icons on its potential functionality.
OmniParser is intended to be used in settings where users are already trained on responsible analytic approaches and critical reasoning is expected. OmniParser is capable of providing extracted information from the screenshot, however human judgement is needed for the output of OmniParser.
OmniParser is intended to be used on various screenshots, which includes both PC and Phone, and also on various applications.
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request