Join our meetup, learn, connect, share, and get to know your Toronto AI community.
Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.
Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.
Hey guys,
I’m having a project in which I have a very big dataset related to the term “apple” (case unsensitive). It contains some text with that word and my job is to determine whether it’s talking about the Apple company, or something else.
There are so many ways to do this and I can’t seem to find the best one. Eventually, I guess it’s doable with 0 machine learning but as a lazy data scientist I want that process to be as autonomous as possible (in order to generalize to other words).
I tried some NLP techniques like bag of words then kmeans but it gave horrible results.
The problem is that there is no labeled dataset.
I have some ideas, like a proper noun / common noun classifier or using wikipedia to create a context vocabulary.
Any ideas? Thanks.
submitted by /u/aewin678
[link] [comments]