VAE training yields a suboptimal model. We're expecting better performance and think about the following: - increase batch size while keeping the rel. high learning rate - add Dense layer