[D] Why is the L2 distance used for reconstruction loss in this VAE?
I was looking at the world models paper and saw that they use the L2 distance as reconstruction loss in their variational auto-encoder. All VAEs I’ve seen so far have used the cross entropy loss, so why L2?
submitted by /u/samuelknoche
[link] [comments]