[D] Why Computer Vision still sucks?
I have a pet project built with the Computer Vision service from Microsoft. Sometimes it provides very accurate annotations and descriptions like ‘A view of a snow covered mountain’ (confidence 0.97) for an image of a mountain but mostly it’s utter garbage like ‘A motorcycle is parked on the side of a road’ (confidence 0.8) for a Formula 1 car.
The Vision AI service from Google is doing even worse.
I’m not seeing any significant improvements in this field at all. You can get a very realistic image of older you, but no one is able to annotate even a simple photo yet.
Do you think we will have truly working Computer Vision within next few years?