Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

Category: Reddit MachineLearning

[D] Per channel or per sample Loss calculation and averaging in a batch ?

Let’s say we have an N-class semantic segmentation problem. Now on each iteration (for each batch) we can calculate Dice loss in two ways: (1) calculate average loss over classes for each sample in a batch and after that get the average over batch, or (2) calculate average loss per class in a batch and then average over classes presented in a batch. Which one is better and why? Or there is no difference at all? Can it affect on how model learns to segment small or big objects? Any related articles?

submitted by /u/AdelSexy
[link] [comments]

[P] We are working on a tool to explain the predictions of machine learning models. Can you give us feedback?

We are working on a tool, which is supposed to help developers test, understand and explain their machine learning models. The tool will be open-source and free to use.

We are in the prototyping stage and have a prototype ready to show. If anyone would like to have a look and give us feedback, that would be awesome!

If you have experience with testing models and frameworks like SHAP and LIME, it would be even more valuable.

submitted by /u/research-panda
[link] [comments]

[D] Multilayer hidden to hidden transformation in RNN (GRU/LSTM)

Hello guys!
I’m training GRU neural network with single GRU layer (among other layers), and I tend to think that hidden to hidden transformation requires severe non-linearity to correctly “merge” memory with current timestep (and thus update hidden).
How do I approach this, what is the best practice?
Should I add more GRU layers or should I, for instance, add extra layers to hidden to hidden transformation with nonlinearity like relu?
If I take second approach I guess I should use tanh instead of relu to avoid exploding gradients, am I correct?

Thanks in advance.

submitted by /u/Pryanik88
[link] [comments]

[D] Time series on GBM’s – is row/instance order important?

When building a time series model using something like xgboost or lightgbm, do they only take into account the present row / instance?

Say I build a model that has in one row:

  • current values at timestamp

  • lagged values from t-1 to t-5 of the previous values

If the rows/instances are shuffled for k fold cross validation, will that affect prediction ?

submitted by /u/charityworker
[link] [comments]

[D] Web dev to ML questions

I’m interested in transitioning from web development to machine learning.

My math background is mediocre (basic calc) and my only programming experience is in web dev technologies (3 years). I want to be realistic about my effort level and opportunities. I probably won’t be going back to college, but have 10 months left on a work contract and would like to spend any free time learning the basics.

  1. What is the feel of competition in the field? With my time-frame and background, is it reasonable to think I can earn an entry-level position?
  2. I’ve heard the field is broken up into data science and data modeling. Is data modeling more programmatic? Will my experience as a dev be more relevant there?

submitted by /u/shawn123321
[link] [comments]

[P] How we used USE and FAISS to enhance ElasticSearch results

I just wrote an article (quite long) about how we’ve build a semantic similarity index alongside the ElasticSearch and used both to provide smarter search results.

Tools used:

Link to the article: https://blog.onebar.io/building-a-semantic-search-engine-using-open-source-components-e15af5ed7885

I hope, this would be helpful as a practical “how-to” for anyone, trying to build a semantic search engine for their project.

submitted by /u/maxim_leonovich
[link] [comments]

[N] Pytorch hackathon at Facebook. Aug 8th – Aug 9th. Application only.

https://www.eventbrite.com/e/pytorch-summer-hackathon-in-menlo-park-registration-63756668913

The eventbrite website will give you a welcome message, but you have to wait until your application is reviewed by Facebook staff and give you a confirmation message. Space is limited and not all applicants will be admitted into the hackathon.

Btw, to anyone who got in, the official website send via email is officially up now (it was down earlier this week), you can use it to find teammates.

submitted by /u/PlusImagination
[link] [comments]

[P] FP Growth

[P] FP Growth

I have the following table:

https://i.redd.it/5szak6k1mhe31.png

I need to perform fp growth so that the ‘Student’ tuple in ‘Category’ and ‘Profession’ are considered separately and I can get the pattern [10-20], [Student], [Student]. The code i am using gives me support along with the patterns and it considers ‘Student’ in all the columns as same, how can extract only the patterns from this code and also get the pattern mentioned above?

I have the following code:

dataset = [[’10-20′, ‘Student’, ‘Student’, ‘Chicago’],

[’30-40′, ‘Adult’, ‘Unemployed’, ‘Chicago’],

[’30-40′, ‘Adult’, ‘Finance’, ‘Washington’],

[’10-20′, ‘Student’, ‘Student’, ‘Boston’]]

import pandas as pd

from mlxtend.preprocessing import TransactionEncoder

te = TransactionEncoder()

te_ary = te.fit(dataset).transform(dataset)

df = pd.DataFrame(te_ary, columns=te.columns_)

df

print(df)

from mlxtend.frequent_patterns import fpgrowth

print(fpgrowth(df, min_support=0.2))

print(fpgrowth(df, min_support=0.2, use_colnames=True))

Current output:

support itemsets

0 0.50 (Student)

1 0.50 (Chicago)

2 0.50 (10-20)

3 0.50 (Adult)

4 0.50 (30-40)

5 0.25 (Unemployed)

6 0.25 (Washington)

7 0.25 (Finance)

8 0.25 (Boston)

9 0.25 (Chicago, Student)

10 0.50 (10-20, Student)

11 0.25 (10-20, Chicago)

12 0.25 (10-20, Chicago, Student)

13 0.25 (Chicago, Adult)

14 0.50 (30-40, Adult)

15 0.25 (30-40, Chicago)

16 0.25 (30-40, Chicago, Adult)

17 0.25 (30-40, Unemployed)

18 0.25 (Unemployed, Adult)

19 0.25 (Chicago, Unemployed)

20 0.25 (30-40, Unemployed, Adult)

21 0.25 (30-40, Chicago, Unemployed)

22 0.25 (Chicago, Unemployed, Adult)

23 0.25 (30-40, Chicago, Unemployed, Adult)

24 0.25 (Washington, 30-40)

25 0.25 (Washington, Adult)

26 0.25 (Washington, 30-40, Adult)

27 0.25 (Washington, Finance)

28 0.25 (30-40, Finance)

29 0.25 (Finance, Adult)

30 0.25 (Washington, 30-40, Finance)

31 0.25 (Washington, Finance, Adult)

32 0.25 (30-40, Finance, Adult)

33 0.25 (Washington, 30-40, Finance, Adult)

34 0.25 (10-20, Boston)

35 0.25 (Student, Boston)

36 0.25 (10-20, Student, Boston)

>>>

submitted by /u/gursi1
[link] [comments]