[D] Cloze Driven Pre Training of Self Attention networks code
This model (https://arxiv.org/pdf/1903.07785.pdf) seems to be the current sota for NER. I was looking to replicate the results, but it looks like the authors have not published code. Shot it in the dark, but has anyone done this already, and have code available?
submitted by /u/csuiuc22
[link] [comments]