[R] – Speech Model Pre-training for End-to-End Spoken Language Understanding
(reposting, I guess the first time I didn’t have a tag so it was removed)
Here’s a new paper and new dataset for spoken language understanding (SLU):
Paper: arxiv.org/abs/1904.03670
Code: https://github.com/lorenlugosch/pretrain_speech_model
Data: https://www.fluent.ai/research/fluent-speech-commands/
We use transfer learning (pre-train the model on LibriSpeech) to improve end-to-end SLU models, and we introduce a new speech dataset that can be used for SLU experiments, or more generally audio and sequence classification experiments.
I’m the first author; let me know if you have any questions!
submitted by /u/m_nemo_syne
[link] [comments]