[P] Voxceleb dataset trained on Mobilenet for speaker recognition and tuned for speaker verification
Thought maybe some people would be interested in this project I worked on last year. I used The voxceleb data to train MobileNet for speech recognition, the sound data is processed into a spectrogram and then the first and second order derivatives are calculated to get 3 dimensional data. After the training was done I used a siamese model technique to tune the features for verification instead of categorization. The idea of the project was to run the model on a smartphone (Hence why I used MobileNet) and use it for speaker verification.
I’m curious what others think about the techniques used and the results, let me know if you are interested in more details!
https://github.com/jpinedaa/Voice-ML (code is messy as hell, I might organize it later)