[D]: Grapheme-to-Phoneme (G2P) refinement ideas.
Hi all, I am doing a project relating to G2P. I am looking at CMU seq2seq G2P project as a starting point. Basically, for the world “Hello”, G2P gives “HH EH L OW”, which is almost what I want. I want additional separation between distinct sounds, more like, “HH EH – L OW” (i.e, the additional “-” separator), or “machine” –> “M AH – SH IY N” (instead of just “M AH SH IY N”). Re-labeling the CMU training dictionary is one way, but I wonder if there is some simpler methods? Thank you!