[D] Q about “Conv Seq 2 Seq learning” paper
I’m reading this https://arxiv.org/pdf/1705.03122.pdf
On page 3, 2nd paragraph I don’t understand what W and b_w are. I get that the inputs are a (k x d) matrix, but how is the convolution performed and why is the output Y size (1 x 2d)?
submitted by /u/ME_PhD
[link] [comments]