[D] Bahdanau attention model
Hello. I am a machine learning enthusiast. Recently I got interested in NLP and I found attention model interesting.
I was reading https://arxiv.org/pdf/1409.0473.pdf
and I couldn’t find how to compute Cz, Cr, and C.
You can find them in the paper, Appendix A where they explain how to compute update gates and reset gates.
I have searched on google but seems like people don’t mention how to compute them.
- How to compute Cz, Cr, and C in Bahdanau attention model?
- Where should I ask these questions? ( I am new to Machine learning and don’t have anyone to ask in person)
- Am I focus on too much detail? Should I just use libraries which has pre-built attention models? Actually I am working on a simple chat-bot project.
submitted by /u/wyzkssm
[link] [comments]