Category: Reddit MachineLearning

[D] Per channel or per sample Loss calculation and averaging in a batch ?

Written on August 4, 2019. Posted in Reddit MachineLearning.

Let’s say we have an N-class semantic segmentation problem. Now on each iteration (for each batch) we can calculate Dice loss in two ways: (1) calculate average loss over classes for each sample in a batch and after that get the average over batch, or (2) calculate average loss per class in a batch and then average over classes presented in a batch. Which one is better and why? Or there is no difference at all? Can it affect on how model learns to segment small or big objects? Any related articles?

submitted by /u/AdelSexy
[link] [comments]

[P] We are working on a tool to explain the predictions of machine learning models. Can you give us feedback?

Written on August 4, 2019. Posted in Reddit MachineLearning.

We are working on a tool, which is supposed to help developers test, understand and explain their machine learning models. The tool will be open-source and free to use.

We are in the prototyping stage and have a prototype ready to show. If anyone would like to have a look and give us feedback, that would be awesome!

If you have experience with testing models and frameworks like SHAP and LIME, it would be even more valuable.

submitted by /u/research-panda
[link] [comments]

[D] Multilayer hidden to hidden transformation in RNN (GRU/LSTM)

Written on August 4, 2019. Posted in Reddit MachineLearning.

Hello guys!
I’m training GRU neural network with single GRU layer (among other layers), and I tend to think that hidden to hidden transformation requires severe non-linearity to correctly “merge” memory with current timestep (and thus update hidden).
How do I approach this, what is the best practice?
Should I add more GRU layers or should I, for instance, add extra layers to hidden to hidden transformation with nonlinearity like relu?
If I take second approach I guess I should use tanh instead of relu to avoid exploding gradients, am I correct?

Thanks in advance.

submitted by /u/Pryanik88
[link] [comments]

[D] Does anyone work with sparse training? If so, what architectures ? Would you like a library for this?

Written on August 4, 2019. Posted in Reddit MachineLearning.

I am currently doing a literature search of sparse training architectures, I am thinking of making a library for sparse training and hoping to make it as universal as possible. For that, I would need to get a good idea for the various use cases.

submitted by /u/BatmantoshReturns
[link] [comments]

[D] The fit, transform and fit_transform methods. What do they do, exactly?

Written on August 4, 2019. Posted in Reddit MachineLearning.

ML newbie here. I see ALL ML classes have these methods. Can someone explain, in plain English, what do they do?

Also, what is the difference between using fit, then transform on a dataset and fit_transform on the same?

submitted by /u/radjeep
[link] [comments]

[D] Time series on GBM’s – is row/instance order important?

Written on August 4, 2019. Posted in Reddit MachineLearning.

When building a time series model using something like xgboost or lightgbm, do they only take into account the present row / instance?

Say I build a model that has in one row:

current values at timestamp
lagged values from t-1 to t-5 of the previous values

If the rows/instances are shuffled for k fold cross validation, will that affect prediction ?

submitted by /u/charityworker
[link] [comments]

[D] Web dev to ML questions

Written on August 3, 2019. Posted in Reddit MachineLearning.

I’m interested in transitioning from web development to machine learning.

My math background is mediocre (basic calc) and my only programming experience is in web dev technologies (3 years). I want to be realistic about my effort level and opportunities. I probably won’t be going back to college, but have 10 months left on a work contract and would like to spend any free time learning the basics.

What is the feel of competition in the field? With my time-frame and background, is it reasonable to think I can earn an entry-level position?
I’ve heard the field is broken up into data science and data modeling. Is data modeling more programmatic? Will my experience as a dev be more relevant there?

submitted by /u/shawn123321
[link] [comments]

[P] How we used USE and FAISS to enhance ElasticSearch results

Written on August 3, 2019. Posted in Reddit MachineLearning.

I just wrote an article (quite long) about how we’ve build a semantic similarity index alongside the ElasticSearch and used both to provide smarter search results.

Tools used:

Tensorflow and Simple Tensorflow Serving
Pre-trained Universal Sentence Encoder
Docker
Python 3
FAISS by Facebook
ElasticSearch
gRPC for the API

Link to the article: https://blog.onebar.io/building-a-semantic-search-engine-using-open-source-components-e15af5ed7885

I hope, this would be helpful as a practical “how-to” for anyone, trying to build a semantic search engine for their project.

submitted by /u/maxim_leonovich
[link] [comments]

[N] Pytorch hackathon at Facebook. Aug 8th – Aug 9th. Application only.

Written on August 3, 2019. Posted in Reddit MachineLearning.

https://www.eventbrite.com/e/pytorch-summer-hackathon-in-menlo-park-registration-63756668913

The eventbrite website will give you a welcome message, but you have to wait until your application is reviewed by Facebook staff and give you a confirmation message. Space is limited and not all applicants will be admitted into the hackathon.

Btw, to anyone who got in, the official website send via email is officially up now (it was down earlier this week), you can use it to find teammates.

submitted by /u/PlusImagination
[link] [comments]

[P] FP Growth

Written on August 3, 2019. Posted in Reddit MachineLearning.

I have the following table:

https://i.redd.it/5szak6k1mhe31.png

I need to perform fp growth so that the ‘Student’ tuple in ‘Category’ and ‘Profession’ are considered separately and I can get the pattern [10-20], [Student], [Student]. The code i am using gives me support along with the patterns and it considers ‘Student’ in all the columns as same, how can extract only the patterns from this code and also get the pattern mentioned above?

I have the following code:

dataset = [[’10-20′, ‘Student’, ‘Student’, ‘Chicago’],

[’30-40′, ‘Adult’, ‘Unemployed’, ‘Chicago’],

[’30-40′, ‘Adult’, ‘Finance’, ‘Washington’],

[’10-20′, ‘Student’, ‘Student’, ‘Boston’]]

import pandas as pd

from mlxtend.preprocessing import TransactionEncoder

te = TransactionEncoder()

te_ary = te.fit(dataset).transform(dataset)

df = pd.DataFrame(te_ary, columns=te.columns_)

print(df)

from mlxtend.frequent_patterns import fpgrowth

print(fpgrowth(df, min_support=0.2))

print(fpgrowth(df, min_support=0.2, use_colnames=True))

Current output:

support itemsets

0 0.50 (Student)

1 0.50 (Chicago)

2 0.50 (10-20)

3 0.50 (Adult)

4 0.50 (30-40)

5 0.25 (Unemployed)

6 0.25 (Washington)

7 0.25 (Finance)

8 0.25 (Boston)

9 0.25 (Chicago, Student)

10 0.50 (10-20, Student)

11 0.25 (10-20, Chicago)

12 0.25 (10-20, Chicago, Student)

13 0.25 (Chicago, Adult)

14 0.50 (30-40, Adult)

15 0.25 (30-40, Chicago)

16 0.25 (30-40, Chicago, Adult)

17 0.25 (30-40, Unemployed)

18 0.25 (Unemployed, Adult)

19 0.25 (Chicago, Unemployed)

20 0.25 (30-40, Unemployed, Adult)

21 0.25 (30-40, Chicago, Unemployed)

22 0.25 (Chicago, Unemployed, Adult)

23 0.25 (30-40, Chicago, Unemployed, Adult)

24 0.25 (Washington, 30-40)

25 0.25 (Washington, Adult)

26 0.25 (Washington, 30-40, Adult)

27 0.25 (Washington, Finance)

28 0.25 (30-40, Finance)

29 0.25 (Finance, Adult)

30 0.25 (Washington, 30-40, Finance)

31 0.25 (Washington, Finance, Adult)

32 0.25 (30-40, Finance, Adult)

33 0.25 (Washington, 30-40, Finance, Adult)

34 0.25 (10-20, Boston)

35 0.25 (Student, Boston)

36 0.25 (10-20, Student, Boston)

>>>

submitted by /u/gursi1
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

Category: Reddit MachineLearning

[D] Per channel or per sample Loss calculation and averaging in a batch ?

[P] We are working on a tool to explain the predictions of machine learning models. Can you give us feedback?

[D] Multilayer hidden to hidden transformation in RNN (GRU/LSTM)

[D] Does anyone work with sparse training? If so, what architectures ? Would you like a library for this?

[D] The fit, transform and fit_transform methods. What do they do, exactly?

[D] Time series on GBM’s – is row/instance order important?

[D] Web dev to ML questions

[P] How we used USE and FAISS to enhance ElasticSearch results

[N] Pytorch hackathon at Facebook. Aug 8th – Aug 9th. Application only.

[P] FP Growth