Incorrect normalization in VAE example loss function

In the loss_function part of the VAE example, I noticed that

KLD = -0.5 * torch.sum(1 + logvar - mu.pow(2) - logvar.exp()) # Normalise by same number of elements as in reconstruction KLD /= args.batch_size * 784

But the dimensionality of the latent variables (logvar, mu) is 20, not 784 -- hence it should either be
torch.sum and normalize by args.batch_size * 20 or just straight-up torch.mean, otherwise the BCE and KLD losses are not properly scaled against each other. Changing the normalization from 784 to 20 increases the test error at the end of training, but this is due to a lower normalization increasing the scale of the KLD.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Incorrect normalization in VAE example loss function #290

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Incorrect normalization in VAE example loss function #290

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions