[D] What is the current state-of-art in unsupervised document/information retrieval for NLP tasks?
Hello everybody,
Are there any good unsupervised methods of retrieving top-k documents from corpus based on a rather short query?
I did a bit of googling but couldn’t find anything that isn’t tf-idf based.
Maybe it would be possible to somehow retrieve similarities between docs and query by utilising contextual embeddings (such as from BERT) and use some sort of scoring function to evaluate it.
Anyway, thank you in advance for your answers.
submitted by /u/Slowai
[link] [comments]