Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

[P] Using Tacotron To Make Ben Shapiro Sing

https://www.youtube.com/watch?v=Y2uKVhATv68

100% of the vocals here were generated by my model, not spoken by Ben Shapiro himself, and do not reflect Shapiro’s views. Shapiro’s voice was created with a TTS model I trained using my implementation of the papers “Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis” (https://arxiv.org/abs/1803.09017) and “Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron” (https://arxiv.org/abs/1803.09047), using only just over 2 hours of Shapiro audio (though I suppose that’s more like 3-4 hours worth of speech for the average person). After learning Shapiro’s speech patterns it’s amusing that the speech generated by this model is even faster than the average speed Eminem raps this song (only the part at 3:12 is sped up 1.5x).

submitted by /u/hanyuqn
[link] [comments]