[D] Autoencoder to reconstruct speech input from melspectrograms
I was trying to train an autoencoder which takes a melspectrogram as input and outputs the same melspectrogram. It’s a reconstruction task. However, the model seems to be generating random noise. It’ll be great if anyone could point me towards any relevant github repos/papers which solve this task.
Thank you! 🙂