The code without syncbn will collapse #8

guanfuchen · 2021-01-21T12:13:37Z

I notice a paper "Momentum2 Teacher: Momentum Teacher with Momentum Statistics for Self-Supervised Learning". It is an interesting work.

The results using all BN will not collapse.

I doubt the results may come from all view L2Norm? Should we split two views for testing?

Hzzone · 2021-10-31T09:06:33Z

Try to increase the weight decay such as 5e-4 including the bn and bias. I have also tried to include the shufflingBN from MoCo which helps a lot. The paper you have mentioned adopted the weight decay of 1e-4 without lars.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The code without syncbn will collapse #8

The code without syncbn will collapse #8

guanfuchen commented Jan 21, 2021

Hzzone commented Oct 31, 2021 •

edited

Loading

The code without syncbn will collapse #8

The code without syncbn will collapse #8

Comments

guanfuchen commented Jan 21, 2021

Hzzone commented Oct 31, 2021 • edited Loading

Hzzone commented Oct 31, 2021 •

edited

Loading