There was an error while loading. Please reload this page.
berkeley-deep-RL-pytorch-solutions/hw2/cs285/policies/MLP_policy.py
Line 115 in 47da611
Shall the loss be averaged by N? I apologize if I am wrong. Do not have much experience with RL. Thanks.