[D] Can I use named entity recognition and multitext classification to train spacy to link key value pairs from form data?
Like the title asks, if I have a string like “address 1234 home street”, can I get spacy to recognize that the key is address and the value is 1234 home street? Or I guess the better question is, if I just have a string like 1234 home street, can i get spacy to recognize that as an address without further context around the string? It’s not in a sentence since it’s derived from form data extracted via OCR. More difficult, get spacy to recognize a string that isn’t a traditional category like an address, but a custom category?
How do I start researching this?
submitted by /u/bigdbag999
[link] [comments]