Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

Category: Reddit MachineLearning

[P] Comparing 50 models Neural Machine Translation using Tensorflow

Accuracy based on word position not included padding. 80% of the dataset to train, 20% of the dataset to test. Dataset is English-Vietnam, Trainset to train, validation and test set to test.

  1. basic-seq2seq, test accuracy 10.34%
  2. lstm-seq2seq, test accuracy 11.89%
  3. gru-seq2seq, test accuracy 11.50%
  4. basic-seq2seq-contrib-greedy, test accuracy 25.28%
  5. lstm-seq2seq-contrib-greedy, test accuracy 33.09%
  6. gru-seq2seq-contrib-greedy, test accuracy 31.28%
  7. basic-birnn-seq2seq, test accuracy 12.55%
  8. lstm-birnn-seq2seq, test accuracy 12.11%
  9. gru-birnn-seq2seq, test accuracy 11.98%
  10. basic-birnn-seq2seq-contrib-greedy, test accuracy 27.50%
  11. lstm-birnn-seq2seq-contrib-greedy, test accuracy 34.25%
  12. gru-birnn-seq2seq-greedy, test accuracy 32.58%
  13. basic-seq2seq-luong, test accuracy 2.40%
  14. lstm-seq2seq-luong, test accuracy 13.08%
  15. gru-seq2seq-luong, test accuracy 7.35%
  16. basic-seq2seq-bahdanau, test accuracy 13.22%
  17. lstm-seq2seq-bahdanau, test accuracy 13.38%
  18. gru-seq2seq-bahdanau, test accuracy 14.02%
  19. basic-birnn-seq2seq-bahdanau, test accuracy 13.88%
  20. lstm-birnn-seq2seq-bahdanau, test accuracy 13.16%
  21. gru-birnn-seq2seq-bahdanau, test accuracy 13.47%
  22. basic-birnn-seq2seq-luong, test accuracy 7.49%
  23. lstm-birnn-seq2seq-luong, test accuracy 13.27%
  24. gru-birnn-seq2seq-luong, test accuracy 13.76%
  25. lstm-seq2seq-contrib-greedy-luong, test accuracy 45.52%
  26. gru-seq2seq-contrib-greedy-luong, test accuracy 8.14%
  27. lstm-seq2seq-contrib-greedy-bahdanau, test accuracy 43.88%
  28. gru-seq2seq-contrib-greedy-bahdanau, test accuracy 44.13%
  29. lstm-seq2seq-contrib-beam-bahdanau, test accuracy 24.49%
  30. gru-seq2seq-contrib-beam-bahdanau, test accuracy 22.26%
  31. lstm-birnn-seq2seq-contrib-beam-luong, test accuracy 24.15%
  32. gru-birnn-seq2seq-contrib-beam-luong, test accuracy 22.32%
  33. lstm-birnn-seq2seq-contrib-luong-bahdanau-beam
  34. gru-birnn-seq2seq-contrib-luong-bahdanau-beam
  35. bytenet-greedy
  36. capsule-lstm-seq2seq-contrib-greedy
  37. capsule-gru-seq2seq-contrib-greedy
  38. dnc-seq2seq-bahdanau-greedy
  39. dnc-seq2seq-luong-greedy
  40. lstm-birnn-seq2seq-beam-luongmonotic, test accuracy 27.23%
  41. lstm-birnn-seq2seq-beam-bahdanaumonotic, test accuracy 26.34%
  42. memory-network-lstm-seq2seq-contrib, test accuracy 28.02%
  43. attention-is-all-you-need-beam, test accuracy 37.80%
  44. conv-seq2seq, test accuracy 33.73%
  45. conv-encoder-lstm-decoder, test accuracy 32.91%
  46. dilated-conv-seq2seq, test accuracy 33.17%
  47. gru-birnn-seq2seq-greedy-residual, test accuracy 34.35%
  48. google-nmt, test accuracy 33.09%
  49. bert-multilanguage-transformer-decoder-beam, test accuracy 44.69%
  50. xlnet-base-transformer-decoder-beam, test accuracy 28.83%

Link to repository, https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/neural-machine-translation

Link to dataset, https://github.com/stefan-it/nmt-en-vi#dataset

Discussion

  1. Based on 20 epochs only.
  2. Accuracy based on word positions.
  3. Some results are empty because the models are slow to train, still waiting for the results.
  4. Sort from shortest length to longest length and do bucketing from it will improve the accuracy.

submitted by /u/huseinzol05
[link] [comments]

[D] Are filters from a particular Convolutional layer for a given CNN chosen at random by random initialization of weights in that network?

In a Convolutional Layer of a given Convolutional Neural Network there is a defined input of size NxMxD. Whereas N and M stand for a dimension of an input image (can be smaller than the original size of an image due to pooling) and as I understand, D stands for a number of filters used in Convolution. My question is how network decides, what are the best filters for a given layer?

submitted by /u/rathernot000
[link] [comments]

[D]Where can I find a data set to identify emerging and growing food concepts?

I want to mine an open data source do identify emerging and growing food concepts as title says. Concepts such as ingredients, flavor, spices etc. Do you know any source that can provide me with such information?

So far I found https://www.wired.com/story/how-grubhub-analyzed-4000-dishes-to-predict-your-next-order/ this article. I did some digging but I couldn’t find the data set they used. That was the closest I could get.

submitted by /u/DoIHAVeaNIdenTItY
[link] [comments]

[D] Implications of retraining a model (with transfer learning) with the same object?

Hi,

I got some doubts about transfer learning.

I have this pre-trained model X, trained on the COCO dataset (its a dataset made of images; cars, persons, apples, etc…), and I want to retrain it using transfer learning with a new dataset made of one object that’s already part of the COCO dataset: apples. I’m doing so because I want to use my apple dataset, and not the COCO one.

Now comes my doubt. What will happen with the previous knowledge the model has about apples? Will it differentiate my apples from COCO’s? Also, what if I re-label my apples to “manzanas” (apple in Spanish), will there be some sort of conflict between the “manzana” and the already known “apple” label?

Hope the question makes sense. Thanks.

submitted by /u/pokedata
[link] [comments]

[D] Suggestion for Machine Learning-related college capstone project

Hello,

I posted in this subreddit regarding a proposal for my undergraduate capstone project and got some feedback pointing out that perhaps stock prediction isn’t very well suited for what I was studying.

I wanted to know if you could recommend some subjects or areas you’d find interesting to research and build a project about using machine learning. Especially using newer models such as CNNs and RNNs or other state-of-the-art algorithms, since that’s what my school expects the project to cover.

Thank you.

submitted by /u/BL7599
[link] [comments]

[D] Machine Learning – WAYR (What Are You Reading) – Week 70

This is a place to share machine learning research papers, journals, and articles that you’re reading this week. If it relates to what you’re researching, by all means elaborate and give us your insight, otherwise it could just be an interesting paper you’ve read.

Please try to provide some insight from your understanding and please don’t post things which are present in wiki.

Preferably you should link the arxiv page (not the PDF, you can easily access the PDF from the summary page but not the other way around) or any other pertinent links.

Previous weeks :

1-10 11-20 21-30 31-40 41-50 51-60 61-70
Week 1 Week 11 Week 21 Week 31 Week 41 Week 51 Week 61
Week 2 Week 12 Week 22 Week 32 Week 42 Week 52 Week 62
Week 3 Week 13 Week 23 Week 33 Week 43 Week 53 Week 63
Week 4 Week 14 Week 24 Week 34 Week 44 Week 54 Week 64
Week 5 Week 15 Week 25 Week 35 Week 45 Week 55 Week 65
Week 6 Week 16 Week 26 Week 36 Week 46 Week 56 Week 66
Week 7 Week 17 Week 27 Week 37 Week 47 Week 57 Week 67
Week 8 Week 18 Week 28 Week 38 Week 48 Week 58 Week 68
Week 9 Week 19 Week 29 Week 39 Week 49 Week 59 Week 69
Week 10 Week 20 Week 30 Week 40 Week 50 Week 60

Most upvoted papers two weeks ago:

/u/zephyrzilla: https://arxiv.org/abs/1908.03770

/u/Moseyic: Exploration by Disagreement

Besides that, there are no rules, have fun.

submitted by /u/ML_WAYR_bot
[link] [comments]

[R] Audio Conversion GAN with Unpaired Data

For the past month I have been working on voice conversion using unpaired data. I naively applied image conversion algorithms to audio spectrograms and after working out a few obstacles I got convincing, although not perfect, results.

Using the exact same algorithm on music genre conversion is also possible and the results, despite a fairly shallow generator with very low capacity, are pretty interesting.

Here are some examples:

https://youtu.be/3BN577LK62Y

The model is able to translate audio signals of any length and does not use any vocoder.

I cannot find papers with similar approaches, and I don’t really know what I should do with this research. Being an Engineering student and not understanding how the academic world works, maybe a simple article and a code release is the best idea.

Thank you for your attention!

submitted by /u/artika_labs
[link] [comments]

[P] Content Update in PyTorch NLP Tutorial repo

Content Update in PyTorch NLP Tutorial repo.

Text Classification, with simple annotation.

  • Dataset: HuffPost news corpus including corresponding category.
  • Pre-trained word vectors: How pre-trained word representations affect model performance (via ablation study)

The model trained on this dataset identify the category of news article based on their headlines and descriptions.

https://github.com/lyeoni/nlp-tutorial/tree/master/news-category-classifcation

submitted by /u/lyeoni
[link] [comments]

[Project] A.I. Learns the Best Way of Earning and Saving Money (Simulation)

Hey. I have a degree in Computer Science with a focus on AI and ML. I truly believe we can do more to popularize the concept of AI. I will be doing this through simulations were AI learns to deal with a simple problem and gives an answer to a specific question. I created the first one. Please take a look if you’re interested. Feedback and ideas for the next one would be great 🙂

https://www.youtube.com/watch?time_continue=1&v=l2jVMfOhTKY

submitted by /u/mrsailor23
[link] [comments]

[D] How would you define the resume parsing task?

I am thinking of segmenting based on keywords and afterwards using Named Entity Recognition finetuned for each segment to extract different fields like:

  • Experience: company name, job title etc.
  • Education: school, major etc.

However I wanted to see if there is a better approach for this problem. Any recommendations/references is welcome!

submitted by /u/nottakumasato
[link] [comments]