-
-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
If I let the agent perform this task, how do I convert the text information into an executable dict? And how can we convert our executable dict back into text? #92
Comments
Hi. Could you give a small code snippet for what you are trying to do? And are you using a 3rd party agent? In MiniWoB++ environments, the method |
Oh, yes. That's what I have in mind. I want to use this environment for my LLM agent experiments. My plan is to input observation data (obs) into the language model, have the language model output text-based actions, and then convert these actions into executable methods for implementation. `import time gymnasium.register_envs(miniwob) env = gymnasium.make('miniwob/click-test-2-v1', render_mode=None)
while(1):
**
` |
In this case, the The meaning of each dict entry in |
Thank you for your reply! Yes, I understand. Have there been any existing works where a similar conversion method has been used? It seems that converting the action dictionary into text can be quite challenging. |
There might be papers that use MiniWoB which have GitHub repositories. I don't know one off the top of my head. Could you explain what challenges would be involved during the conversion? You can restrict the action set (e.g. to only clicking and typing on an element with a specific ID --- CLICK_ELEMENT and FOCUS_ELEMENT_AND_TYPE_TEXT) to reduce the complexity. |
Question
If I let the agent perform this task, how do I convert the text information into an executable dict? And how can we convert our executable dict back into text?
The text was updated successfully, but these errors were encountered: