Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

Category: Reddit MachineLearning

[Discussion] Google patent “GENERATING AUDIO USING NEURAL NETWORKS”

This specification describes how a system implemented as computer programs on one or more computers in one or more locations can generate a sequence of audio data that includes a respective audio sample at each of multiple time steps . For example , the sequence of audio data can represent speech in a particular natural language or a piece of music.

https://patentimages.storage.googleapis.com/1b/d3/ac/60abc0081cdf1b/US20190251987A1.pdf

It seems that this patent covers Wavenet. It has the iconic diagram demonstrating dilated convolutions and the equations for the layers.

submitted by /u/RickMcCoy
[link] [comments]

[D] Solutions for Entity Matching with textual data (i.e. names, addresses)

Currently looking for either open source projects or paid solution that can be used for an Entity Matching problem.

Example:

Entity 1, name: aad, address: aacb, phone : 123

Entity 2, name: aad, address: aacb 1, phone: +4 123

Should be matched as same entity.

So far I have only found this open source solution: https://github.com/anhaidgroup/deepmatcher

Are there more open sourced solutions?

Also, are there any paid solutions for this problem?

I know how I could approach the problem myself, but before I do I want to see what is current SOTA.

Thank you.

submitted by /u/DemiourgosD
[link] [comments]

[D] How to efficiently implement local attention?

I’d like to implement a simple dot-product attention mechanism such that the output at each timestep is computed by attending to the preceding L elements. This is similar to the standard setup for autoregressive attention, but differing in that only a fixed window is attended to at each timestep.

Suppose we are training on sequences of length N and want to compute attention over windows of L elements. The options that I can think of are:

  1. Compute all N2 elements of the attention matrix and apply a mask so that only the N*L elements of interest are used. This is inefficient for L<<N and often impractical for large N due to memory constraints.
  2. Manually window the inputs into overlapping sequences of length L, then apply attention to each window. This only requires N*L dot products, but involves tiling/repeating the inputs (attention keys/values) L times which is impractical for large L.
  3. Manually loop over N and L and individually compute each of the N*L dot products. This is efficient in an algorithmic sense but practically will be terrible if implemented using a high-level DL library.

My question is whether or not this operation can be efficiently computed with high-level DL libraries.

submitted by /u/attention-question
[link] [comments]

[Discussion] Roadblock in building a classification model

I’ve been working on a dataset, something that I’ve never worked on before.

I am in the process of building a classification model which can separate 2 classes using 2 continuous features & around 10-15 multiclass categorical features. The classes are heavily imbalanced (3:1 ratio) & I have over 500k observations.

I’ve tried a few methods like downsampling, class balancing along with a few algorithms like Logistic Regression, KNN, Random Forest, a few Gradient Boosting algorithms etc.. All these models are giving me poor results.

I am working locally & don’t have access to a cloud service, hence I’m not keen on using NNs or SVMs which tend to be more computationally expensive.

What else can I do?

Thanks

submitted by /u/Rockflagandeeeagle
[link] [comments]

[D] Is accurately estimating the contents of a voxel outside of a real 3D model possible?

Imagine we have a training set of real 3D models (by this I mean, for example, 3D models of a rooms in buildings), and with this training set, we train a neural network to understand the relationship between a voxel and the 3D model in which it resides (I hypothesize that something like light would be a factor). Then, we use our neural network to predict one voxel outside of the captured (by a camera) 3D model. Is this possible? Would the desired results be achieved?

submitted by /u/Mjjjokes
[link] [comments]

[D] Medium/TowardsDataScience Noise

Lately in my work and personal research, I have noticed that whenever I google anything related to ML/AI/DS there is a whole page of medium/tds articles. Some of these are actually useful but more often than not they’re garbage and doing a really bad job of presenting a topic of interest.

Do other people have a way of weeding through these? I think Medium should introduce some sort of usefulness tagging mechanism to actually tell if an article does a good job in explaining a concept or not. These days every ‘data scientist’ with little to no communication skills is writing Medium posts.

submitted by /u/humanager
[link] [comments]