Category: Reddit MachineLearning

[P] Comparing 50 models Neural Machine Translation using Tensorflow

Written on September 8, 2019. Posted in Reddit MachineLearning.

Accuracy based on word position not included padding. 80% of the dataset to train, 20% of the dataset to test. Dataset is English-Vietnam, Trainset to train, validation and test set to test.

basic-seq2seq, test accuracy 10.34%
lstm-seq2seq, test accuracy 11.89%
gru-seq2seq, test accuracy 11.50%
basic-seq2seq-contrib-greedy, test accuracy 25.28%
lstm-seq2seq-contrib-greedy, test accuracy 33.09%
gru-seq2seq-contrib-greedy, test accuracy 31.28%
basic-birnn-seq2seq, test accuracy 12.55%
lstm-birnn-seq2seq, test accuracy 12.11%
gru-birnn-seq2seq, test accuracy 11.98%
basic-birnn-seq2seq-contrib-greedy, test accuracy 27.50%
lstm-birnn-seq2seq-contrib-greedy, test accuracy 34.25%
gru-birnn-seq2seq-greedy, test accuracy 32.58%
basic-seq2seq-luong, test accuracy 2.40%
lstm-seq2seq-luong, test accuracy 13.08%
gru-seq2seq-luong, test accuracy 7.35%
basic-seq2seq-bahdanau, test accuracy 13.22%
lstm-seq2seq-bahdanau, test accuracy 13.38%
gru-seq2seq-bahdanau, test accuracy 14.02%
basic-birnn-seq2seq-bahdanau, test accuracy 13.88%
lstm-birnn-seq2seq-bahdanau, test accuracy 13.16%
gru-birnn-seq2seq-bahdanau, test accuracy 13.47%
basic-birnn-seq2seq-luong, test accuracy 7.49%
lstm-birnn-seq2seq-luong, test accuracy 13.27%
gru-birnn-seq2seq-luong, test accuracy 13.76%
lstm-seq2seq-contrib-greedy-luong, test accuracy 45.52%
gru-seq2seq-contrib-greedy-luong, test accuracy 8.14%
lstm-seq2seq-contrib-greedy-bahdanau, test accuracy 43.88%
gru-seq2seq-contrib-greedy-bahdanau, test accuracy 44.13%
lstm-seq2seq-contrib-beam-bahdanau, test accuracy 24.49%
gru-seq2seq-contrib-beam-bahdanau, test accuracy 22.26%
lstm-birnn-seq2seq-contrib-beam-luong, test accuracy 24.15%
gru-birnn-seq2seq-contrib-beam-luong, test accuracy 22.32%
lstm-birnn-seq2seq-contrib-luong-bahdanau-beam
gru-birnn-seq2seq-contrib-luong-bahdanau-beam
bytenet-greedy
capsule-lstm-seq2seq-contrib-greedy
capsule-gru-seq2seq-contrib-greedy
dnc-seq2seq-bahdanau-greedy
dnc-seq2seq-luong-greedy
lstm-birnn-seq2seq-beam-luongmonotic, test accuracy 27.23%
lstm-birnn-seq2seq-beam-bahdanaumonotic, test accuracy 26.34%
memory-network-lstm-seq2seq-contrib, test accuracy 28.02%
attention-is-all-you-need-beam, test accuracy 37.80%
conv-seq2seq, test accuracy 33.73%
conv-encoder-lstm-decoder, test accuracy 32.91%
dilated-conv-seq2seq, test accuracy 33.17%
gru-birnn-seq2seq-greedy-residual, test accuracy 34.35%
google-nmt, test accuracy 33.09%
bert-multilanguage-transformer-decoder-beam, test accuracy 44.69%
xlnet-base-transformer-decoder-beam, test accuracy 28.83%

Link to repository, https://github.com/huseinzol05/NLP-Models-Tensorflow/tree/master/neural-machine-translation

Link to dataset, https://github.com/stefan-it/nmt-en-vi#dataset

Discussion

Based on 20 epochs only.
Accuracy based on word positions.
Some results are empty because the models are slow to train, still waiting for the results.
Sort from shortest length to longest length and do bucketing from it will improve the accuracy.

submitted by /u/huseinzol05
[link] [comments]

[D] Are filters from a particular Convolutional layer for a given CNN chosen at random by random initialization of weights in that network?

Written on September 8, 2019. Posted in Reddit MachineLearning.

In a Convolutional Layer of a given Convolutional Neural Network there is a defined input of size NxMxD. Whereas N and M stand for a dimension of an input image (can be smaller than the original size of an image due to pooling) and as I understand, D stands for a number of filters used in Convolution. My question is how network decides, what are the best filters for a given layer?

submitted by /u/rathernot000
[link] [comments]

[D]Where can I find a data set to identify emerging and growing food concepts?

Written on September 8, 2019. Posted in Reddit MachineLearning.

I want to mine an open data source do identify emerging and growing food concepts as title says. Concepts such as ingredients, flavor, spices etc. Do you know any source that can provide me with such information?

So far I found https://www.wired.com/story/how-grubhub-analyzed-4000-dishes-to-predict-your-next-order/ this article. I did some digging but I couldn’t find the data set they used. That was the closest I could get.

submitted by /u/DoIHAVeaNIdenTItY
[link] [comments]

[D] Implications of retraining a model (with transfer learning) with the same object?

Written on September 8, 2019. Posted in Reddit MachineLearning.

Hi,

I got some doubts about transfer learning.

I have this pre-trained model X, trained on the COCO dataset (its a dataset made of images; cars, persons, apples, etc…), and I want to retrain it using transfer learning with a new dataset made of one object that’s already part of the COCO dataset: apples. I’m doing so because I want to use my apple dataset, and not the COCO one.

Now comes my doubt. What will happen with the previous knowledge the model has about apples? Will it differentiate my apples from COCO’s? Also, what if I re-label my apples to “manzanas” (apple in Spanish), will there be some sort of conflict between the “manzana” and the already known “apple” label?

Hope the question makes sense. Thanks.

submitted by /u/pokedata
[link] [comments]

[D] Suggestion for Machine Learning-related college capstone project

Written on September 7, 2019. Posted in Reddit MachineLearning.

Hello,

I posted in this subreddit regarding a proposal for my undergraduate capstone project and got some feedback pointing out that perhaps stock prediction isn’t very well suited for what I was studying.

I wanted to know if you could recommend some subjects or areas you’d find interesting to research and build a project about using machine learning. Especially using newer models such as CNNs and RNNs or other state-of-the-art algorithms, since that’s what my school expects the project to cover.

Thank you.

submitted by /u/BL7599
[link] [comments]

[D] Machine Learning – WAYR (What Are You Reading) – Week 70

Written on September 7, 2019. Posted in Reddit MachineLearning.

This is a place to share machine learning research papers, journals, and articles that you’re reading this week. If it relates to what you’re researching, by all means elaborate and give us your insight, otherwise it could just be an interesting paper you’ve read.

Please try to provide some insight from your understanding and please don’t post things which are present in wiki.

Preferably you should link the arxiv page (not the PDF, you can easily access the PDF from the summary page but not the other way around) or any other pertinent links.

Previous weeks :

1-10	11-20	21-30	31-40	41-50	51-60	61-70
Week 1	Week 11	Week 21	Week 31	Week 41	Week 51	Week 61
Week 2	Week 12	Week 22	Week 32	Week 42	Week 52	Week 62
Week 3	Week 13	Week 23	Week 33	Week 43	Week 53	Week 63
Week 4	Week 14	Week 24	Week 34	Week 44	Week 54	Week 64
Week 5	Week 15	Week 25	Week 35	Week 45	Week 55	Week 65
Week 6	Week 16	Week 26	Week 36	Week 46	Week 56	Week 66
Week 7	Week 17	Week 27	Week 37	Week 47	Week 57	Week 67
Week 8	Week 18	Week 28	Week 38	Week 48	Week 58	Week 68
Week 9	Week 19	Week 29	Week 39	Week 49	Week 59	Week 69
Week 10	Week 20	Week 30	Week 40	Week 50	Week 60

Most upvoted papers two weeks ago:

/u/zephyrzilla: https://arxiv.org/abs/1908.03770

/u/Moseyic: Exploration by Disagreement

Besides that, there are no rules, have fun.

submitted by /u/ML_WAYR_bot
[link] [comments]

[R] Audio Conversion GAN with Unpaired Data

Written on September 7, 2019. Posted in Reddit MachineLearning.

For the past month I have been working on voice conversion using unpaired data. I naively applied image conversion algorithms to audio spectrograms and after working out a few obstacles I got convincing, although not perfect, results.

Using the exact same algorithm on music genre conversion is also possible and the results, despite a fairly shallow generator with very low capacity, are pretty interesting.

Here are some examples:

https://youtu.be/3BN577LK62Y

The model is able to translate audio signals of any length and does not use any vocoder.

I cannot find papers with similar approaches, and I don’t really know what I should do with this research. Being an Engineering student and not understanding how the academic world works, maybe a simple article and a code release is the best idea.

Thank you for your attention!

submitted by /u/artika_labs
[link] [comments]

[P] Content Update in PyTorch NLP Tutorial repo

Written on September 7, 2019. Posted in Reddit MachineLearning.

Content Update in PyTorch NLP Tutorial repo.

– Text Classification, with simple annotation.

Dataset: HuffPost news corpus including corresponding category.
Pre-trained word vectors: How pre-trained word representations affect model performance (via ablation study)

The model trained on this dataset identify the category of news article based on their headlines and descriptions.

https://github.com/lyeoni/nlp-tutorial/tree/master/news-category-classifcation

submitted by /u/lyeoni
[link] [comments]

[Project] A.I. Learns the Best Way of Earning and Saving Money (Simulation)

Written on September 7, 2019. Posted in Reddit MachineLearning.

Hey. I have a degree in Computer Science with a focus on AI and ML. I truly believe we can do more to popularize the concept of AI. I will be doing this through simulations were AI learns to deal with a simple problem and gives an answer to a specific question. I created the first one. Please take a look if you’re interested. Feedback and ideas for the next one would be great 🙂

https://www.youtube.com/watch?time_continue=1&v=l2jVMfOhTKY

submitted by /u/mrsailor23
[link] [comments]

[D] How would you define the resume parsing task?

Written on September 7, 2019. Posted in Reddit MachineLearning.

I am thinking of segmenting based on keywords and afterwards using Named Entity Recognition finetuned for each segment to extract different fields like:

Experience: company name, job title etc.
Education: school, major etc.

However I wanted to see if there is a better approach for this problem. Any recommendations/references is welcome!

submitted by /u/nottakumasato
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

Category: Reddit MachineLearning

[P] Comparing 50 models Neural Machine Translation using Tensorflow

Discussion

[D] Are filters from a particular Convolutional layer for a given CNN chosen at random by random initialization of weights in that network?

[D]Where can I find a data set to identify emerging and growing food concepts?

[D] Implications of retraining a model (with transfer learning) with the same object?

[D] Suggestion for Machine Learning-related college capstone project

[D] Machine Learning – WAYR (What Are You Reading) – Week 70

[R] Audio Conversion GAN with Unpaired Data

[P] Content Update in PyTorch NLP Tutorial repo

[Project] A.I. Learns the Best Way of Earning and Saving Money (Simulation)

[D] How would you define the resume parsing task?