[Bug?] Issues and Suggestions for Using PGMORL in MORL-Baselines #131

hamod-kh · 2025-01-02T03:41:35Z

Dear MORL-Baselines Maintainers,

As part of my bachelor thesis, I am exploring the application of multi-objective reinforcement learning (MORL). To avoid the tedious work of implementing an algorithm from scratch, I searched for libraries similar to Stable-Baselines3 and came across MORL-Baselines. Without any particular reason, I decided to start with PGMORL to familiarize myself with MORL. However, I have encountered a few issues:

Environment Resource Limitation:
Due to restricted licensing, I can only use one instance of the environment. Unfortunately, in the constructor of the PGMORL agent, a tmp_env object is created to extract environment information. This causes an issue when an existing env object is passed to the constructor, as the tmp_env creation process raises a simulation-related error (I am using a simulation program for the environment). Nevertheless, when env=None is passed instead the issue is resolved.
Incompatibility Between Training and Evaluation Environments:
The issue mentioned above also leads to another problem: the environments used for training and evaluation cannot share the same wrapper. Specifically, the training environment is wrapped with MOSyncVectorEnv, which causes the step function to fail during policy evaluation in the eval_mo function. I worked around this by adding the line env = env.envs[0], which I assume extracts the environment with the MORecordEpisodeStatistics wrapper. It would be more practical if the same environment instance could be seamlessly used for both training and evaluation.
Model Saving and Testing:
I am unable to test the different models at the end of training because I do not know how to save them. My understanding is that the training process produces multiple agents, each optimized for different objective weightings. To test these agents, I need to be able to save them. However, I could not find a save_model method similar to that in Stable-Baselines3. This might be a misunderstanding on my part, but I would greatly appreciate clarification on this.

Thank you in advance for your response and support!

Best Regards,
hamod-kh

ffelten · 2025-01-14T17:40:18Z

Hello @hamod-kh,

Sorry, this got lost in the holiday emails :).

First of all, welcome to the community! Now to answer your points:

Ah, I see. The env and env_id kinda duplicate the information indeed. This is to stay compatible with algorithms that do not use vectorized envs, i.e., all other algos. Anyways, if you found a way it's all good. :)
For the first point, so there is a bug in the code at this moment? For the practical point, this would mess up the learning: PPO needs the training rollouts to be contiguous, and you would "break the chain" if you evaluate on the same environment you use for training, see for example https://ai.stackexchange.com/questions/38232/why-is-it-recommended-to-use-a-separate-test-environment-when-evaluating-a-mod.
There is currently no save model method for PGMORL. Implementing this can be a bit tricky as we would effectively need to store the Pareto archive of models (not just one model).

That being said, if you can only instantiae one environment, I would use a more sample efficient algorithm that PGMORL. I assume it is a continuous problem so I'd tend to advise for GPI-LS which has the save model feature implemented :).

I hope this helps and sorry for the late answer.

Cheers,

hamod-kh · 2025-01-20T21:21:45Z

Hello @ffelten,

Thank you so much for your reply.

As for this issue, it can be closed!

Regards,

LucasAlegre assigned ffelten Jan 14, 2025

LucasAlegre closed this as completed Jan 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug?] Issues and Suggestions for Using PGMORL in MORL-Baselines #131

[Bug?] Issues and Suggestions for Using PGMORL in MORL-Baselines #131

hamod-kh commented Jan 2, 2025

ffelten commented Jan 14, 2025

hamod-kh commented Jan 20, 2025

[Bug?] Issues and Suggestions for Using PGMORL in MORL-Baselines #131

[Bug?] Issues and Suggestions for Using PGMORL in MORL-Baselines #131

Comments

hamod-kh commented Jan 2, 2025

ffelten commented Jan 14, 2025

hamod-kh commented Jan 20, 2025