Some questions for PPO implemention #25

nihao-zhangtongxue · 2024-11-17T18:13:06Z

Hello, thank you for your experiments codes!But I have a problem with the PPO method of playing Car racing games for a long time:As your results show, why was his training so erratic that it was possible to converge to a bad result?I've run a similar experiment before, and it went badly, too.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some questions for PPO implemention #25

Some questions for PPO implemention #25

nihao-zhangtongxue commented Nov 17, 2024

Some questions for PPO implemention #25

Some questions for PPO implemention #25

Comments

nihao-zhangtongxue commented Nov 17, 2024