Category: Reddit MachineLearning

[P] Character-based OCR model?

Written on August 19, 2019. Posted in Reddit MachineLearning.

Has anybody achieved good results with character-based OCR? I’ve been struggling to train a CNN to recognize common fonts – seems like the best accuracy I can squeeze out is around 75-80%.

This gist shows the Keras model I’ve been working with (and some variations). Fast inference is important for this application, so I’m trying to keep it as lightweight as possible. Character boxes are scaled down to 28×28.

We only need to recognize onscreen text (web pages, documents, etc) – pretty much ideal circumstances for OCR. Some mistakes are expected at the character level (e.g. I vs l vs 1), but what we’re seeing is significantly worse and sometimes pure gibberish. Training data is synthetic but virtually identical to real-world text.

Is it realistic to achieve decent accuracy from a character-based net? Adding recurrent layers and a dictionary would be pretty heavy for this application, so we’re hoping to avoid it. If anybody can provide recommendations I’d love to hear it.

submitted by /u/hundley10
[link] [comments]

[D] Should I include my weights inside my docker container?

Written on August 19, 2019. Posted in Reddit MachineLearning.

I am running my ML inference inside a docker container. Should I include my weights in the image, or should I download them from S3 when the container starts up? From what I can see, the benefits are as follows:

Pros for including: faster startup times since I don’t need to download after startup. Less dependencies since everything is included in the container image

Pros for downloading: separation of weights and code. Easier weight tweaking since I won’t need to redeploy image when changing weights

Thoughts?

submitted by /u/aaronjl33
[link] [comments]

[P] My first ‘real’ Machine Learning project

Written on August 19, 2019. Posted in Reddit MachineLearning.

This is a quick showcase of my first ‘real’ Machine Learning project: ‘Jumping over obstacles’:

https://youtu.be/73ekxXGMpMc

Everything was done from scratch using the Unity3D engine (C#).

Any Feedback or Questions will be appreciated!

submitted by /u/iMilchshake
[link] [comments]

[D] Why is KL Divergence so popular?

Written on August 19, 2019. Posted in Reddit MachineLearning.

In most objective functions comparing a learned and source probability distribution, KL divergence is used to measure their dissimilarity. What advantages does KL divergence have over true metrics like Wasserstein (earth mover’s distance), and Bhattacharyya? Is its asymmetry actually a desired property because the fixed source distribution should be treated differently compared to a learned distribution?

submitted by /u/LemonByte
[link] [comments]

[D] Reflinks vs. Symlinks vs. Hard Links: How They Can Help ML Projects

Written on August 19, 2019. Posted in Reddit MachineLearning.

In ML projects hard links and symbolic links can help us, when setting up new experiments, to rearrange data files quickly and efficiently. However, with traditional links, we run the risk of polluting the data files with erroneous edits.

The article explain details of using links, some cool new stuff in modern file systems (reflinks), and an example of how DVC (Data Version Control) tool leverages this for managing ML project datasets and workflow: Reflinks vs symlinks vs hard links, and how they can help machine learning projects

submitted by /u/cmstrump
[link] [comments]

[D] Node Embedding and GNN for Graphs with both Continuous and Categorical Atrributes

Written on August 19, 2019. Posted in Reddit MachineLearning.

My problem consists of classifying nodes of graphs (number of vertices <500) , nodes are attributed with both Continuous (some numerical features) and Categorical (CountVectorizer) features. I’ve looked in to various articles both about node embedding and graph neural networks but I’m still not sure what algorithms will be the best fit for my task.

What is your recommended approach for my problem?

Sorry for my short explanation, please ask me if you need more details.

submitted by /u/suddenintent
[link] [comments]

[D] SELUs don’t actually solve the dying ReLU problem

Written on August 19, 2019. Posted in Reddit MachineLearning.

One frequently mentioned problem with ReLUs is that they can get stuck outputting nothing but 0s when their input shifts such that every value is negative. SELUs [1] claim to solve this problem.

However, there is another way that activation functions can stop being useful to the network: when they degenerate to a linear function. This can happen with ReLUs, SELUs and some other activation functions when their input shifts such that every value is positive. To demonstrate this I made a simple toy network.

The task is to approximate the ReLU function itself with the function f(x * a + b) * c + d, where x is the input, a, b, c and d are learned scalar values and f is an activation function. Values for x are uniformly chosen from the range [-0.5, 0.5].

If we start with a = 1 and b = 0.5 then all inputs to f will be positive. For many starting points of c and d this will still converge when ReLU is used for f. But for c = 1 and d = -0.5 all common piecewise activation functions will fail, including SELU and ELU.

However, there is a potential activation function that does not exhibit that problem, that I don’t see being talked about a lot: Softplus, defined as log(exp(x) + 1). Its derivative is strictly monotonically increasing and therefor non-linear in every sub range. Using softplus in place of f in the toy example allows it to converge from any starting point. [proof pending]

In the following images you can see the learned function at different numbers of iterations. The starting point a = 1, b = 0.5, c = 1 and d = -0.5 was used. All use Adam optimizer with a learning rate of 0.1 and default values for alpha and beta. The mean absolute difference is minimized. Tensorflow 1.14.0 was used.

ReLU

SELU

Softplus

In practice the inputs to activation functions may follow a long tail distribution making this very unlikely when the loss function is fixed. But for some problems, like adversarial networks, where the loss function itself is learned, this might not be the case.

There are even situations where SELU fails to converge whereas ReLU and ELU do. The following images use the starting point a = 1, b = 0.5, c = 1 and d = 0. Again, all initial inputs to the activation function are positive. As we can see this does not necessarily mean that it is stuck.

ReLU. The slight curve at 0 is a result of under-sampling the function.

SELU. Note how the initial increase in gradient below 0 creates an insurmountable wall of increased loss that gradient descent can’t overcome.

ELU. By having a monotonic gradient it does not have the same problem as SELU.

Softplus

Alternative title for this post: SELU considered harmful. (I mean no offense to the authors, their paper is truly insightful and you should definitely read it!)

[1] https://arxiv.org/abs/1706.02515

submitted by /u/relgukxilef
[link] [comments]

[P] Tensorflow live video generation

Written on August 19, 2019. Posted in Reddit MachineLearning.

I am currently working a project to generate live video clips using biggan

I started by using the default code from the deepmind colab but i am now facing a optimization problem

I need to have as many fps but doing a sess.run() for every frame is very heavy, i tried looking on maybe converting the model to a TF Lite model, but i didn’t find any information on how to do it from a tensorflow hub module

I’m pretty new to tensorflow and i didn’t find any other idea to improve my code, so i would apreciate any help

submitted by /u/Remideza
[link] [comments]

[P] Train CIFAR10 to 94% in 26 SECONDS on a single-GPU

Written on August 19, 2019. Posted in Reddit MachineLearning.

In this blog post, the author introduces a bag of standard and not-so-standard tricks to reduce training time to 34s of a Resnet model on CIFAR10 dataset, or 26s with test-time augmentation.

Blog post: https://myrtle.ai/how-to-train-your-resnet-8-bag-of-tricks/

Colab notebook: https://colab.research.google.com/github/davidcpage/cifar10-fast/blob/master/bag_of_tricks.ipynb

Author: David Page

Original tweet: https://twitter.com/dcpage3/status/1163563850442182657

submitted by /u/youali
[link] [comments]

[D] Uncover new, more meaningful KPIs with Machine Learning

Written on August 19, 2019. Posted in Reddit MachineLearning.

It is well known that machine learning is already helping companies achieve their performance goals by optimizing existing performance metrics. By leveraging the growing volume of data on customer behavior, pricing, competitive action, and operational statistics, it can deliver critical insights in a variety of ways. Machine learning offers many benefits from optimizing marketing or pricing to improving customer service and operational efficiency. However, a recent article in the MIT Sloan Management Review shows that companies are increasingly using machine learning to identify entirely new KPIs to correlate with overall performance.

submitted by /u/seemingly_omniscient
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

Category: Reddit MachineLearning

[P] Character-based OCR model?

[D] Should I include my weights inside my docker container?

[P] My first ‘real’ Machine Learning project

[D] Why is KL Divergence so popular?

[D] Reflinks vs. Symlinks vs. Hard Links: How They Can Help ML Projects

[D] Node Embedding and GNN for Graphs with both Continuous and Categorical Atrributes

[D] SELUs don’t actually solve the dying ReLU problem

[P] Tensorflow live video generation

[P] Train CIFAR10 to 94% in 26 SECONDS on a single-GPU

[D] Uncover new, more meaningful KPIs with Machine Learning