[D] Are there examples of using QA systems to determine if an answer given is feasible?
Say I was using Bert and trained it on Squad 2.0 (which I have done) and came across:
Question: What is your favorite color?
Possible (Correct) Answer given:Blue
Possible (Wrong) Answer given:Lasagna
The idea would be that a model would predict Blue on the first one and nothing on the second one (implying it was not a feasible answer).
Is there any research or ideas on how (if possible) you could train a model like a Bert to do that? I feel like it should be doable however my current results with training on Squad 2.0 were not extremely promising so I’m not sure if I’m not thinking about it problem correct or if there is some research out there on how to better approach this.