Implementations on 3D environments #3

yd-yin · 2021-02-24T07:04:01Z

Hi. Thanks for your great work of ANL.

I'm trying to reproduce the work on a scanned 3D environment (like Gibson). Unfortunately, I met some problems. I'd appreciate it if you could offer some help.

I built my project based on the released code of this repo
To test the environment module, I set the perceptual model as a pretrained VGG network and give the agent a random action each step. The success rate (the posterior map converges in 200 steps and the converged grid is correct) looks reasonable when given enough memory images. The visualization of random walk in the environment is attached.

To train the RL algorithm,

I modify the network architecture mentioned in the supp.
I give the agent a big reward (100) when the posterior finally converges (any element in the posterior matrix is larger than 0.95)

However, the actions learned by RL quickly collapse to one of the three actions, i.e., the agent will always apply one specific action. The chosen action may differ in different training procedures.

I try decreasing the learning rate to 5e-5 / 5e-6, but not much change

anl_random_walk.mp4

.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementations on 3D environments #3

Implementations on 3D environments #3

yd-yin commented Feb 24, 2021

Implementations on 3D environments #3

Implementations on 3D environments #3

Comments

yd-yin commented Feb 24, 2021