Category: Reddit MachineLearning

[D] Any developments on randomly wired neural networks?

Written on September 15, 2019. Posted in Reddit MachineLearning.

I recently read this paper which randomly wires image classification networks using classical random graph generators.

I’m interested in the subject but haven’t been able to find anything on it since. Has anyone heard of any new developments on the subject?

submitted by /u/MartialLemur
[link] [comments]

Literally the best Introduction to Artificial Neural Networks (ANNs)

Written on September 15, 2019. Posted in Reddit MachineLearning.

submitted by /u/theadxy
[link] [comments]

[R] Neural Oblivious Decision Ensembles

Written on September 15, 2019. Posted in Reddit MachineLearning.

TL;DR: authors propose a DenseNet-like ensemble of decision trees, trained end-to-end by backpropagation and beats both xgboost and neural networks on heterogeneous (“tabular”) data.

(IMHO) unlike all other “neural decision tree” methods this one worked out of the box for production scale problems without heavy wizardry.

Differentiable decision tree (figure 1 from arxiv paper)

ArXiv: https://arxiv.org/abs/1909.06312

Source code: https://github.com/Qwicen/node

Abstract:

Nowadays, deep neural networks (DNNs) have become the main instrument for machine learning tasks within a wide range of domains, including vision, NLP, and speech. Meanwhile, in an important case of heterogenous tabular data, the advantage of DNNs over shallow counterparts remains questionable. In particular, there is no sufficient evidence that deep learning machinery allows constructing methods that outperform gradient boosting decision trees (GBDT), which are often the top choice for tabular problems. In this paper, we introduce Neural Oblivious Decision Ensembles (NODE), a new deep learning architecture, designed to work with any tabular data. In a nutshell, the proposed NODE architecture generalizes ensembles of oblivious decision trees, but benefits from both end-to-end gradient-based optimization and the power of multi-layer hierarchical representation learning. With an extensive experimental comparison to the leading GBDT packages on a large number of tabular datasets, we demonstrate the advantage of the proposed NODE architecture, which outperforms the competitors on most of the tasks. We open-source the PyTorch implementation of NODE and believe that it will become a universal framework for machine learning on tabular data.

submitted by /u/justheuristic
[link] [comments]

[P] Set up the CTRL text-generating model on Google Compute Engine with just a few console commands.

Written on September 15, 2019. Posted in Reddit MachineLearning.

Over the weekend I posted a Twitter thread of my experience with CTRL, which received a lot of attention. Today, I’m releasing a script + guides to set up the CTRL text-generating model on Google Compute Engine with just a few console commands: https://github.com/minimaxir/ctrl-gce

I also added a few more generation examples + usability guides.

Let me know how it works for you!

submitted by /u/minimaxir
[link] [comments]

[D] Why are Kaggle prizes so low?

Written on September 15, 2019. Posted in Reddit MachineLearning.

E.g., the new challenge from Lyft has a top prize of $12,000.

A model that outperforms their current implementation would presumably bring in a huge amount of value, so why are they and the other companies who sponsor these challenges so stingy on the payouts?

submitted by /u/mystikaldanger
[link] [comments]

[R] GMLS-Nets for unstructured data

Written on September 15, 2019. Posted in Reddit MachineLearning.

Paper link: https://arxiv.org/pdf/1909.05371.pdf

Just come across this submitted paper. Reading it gives me a different vibe since it must have been written by mathematicians rather than ML researcher. Nevertheless, I am still fresh in this area so I can’t tell if this is a good article or not. Any comments or critics?

submitted by /u/QueueTee314
[link] [comments]

[D] Proximal Policy Optimization in keras (Actor-Critic Method)

Written on September 15, 2019. Posted in Reddit MachineLearning.

This article is written by Chintan Trivedi. Proximal Policy Optimization aka PPO was released by OpenAI in 2017. It is considered as the state-of-the-art algorithm in reinforcement learning. The USP of this article is its simplistic explanations and coding of PPO as well as the accompanying videos. The author also released the code in his github page.

https://towardsdatascience.com/proximal-policy-optimization-tutorial-part-1-actor-critic-method-d53f9afffbf6

submitted by /u/begooboi
[link] [comments]

[D] Convolutional Neural Network with Numpy input Help G

Written on September 15, 2019. Posted in Reddit MachineLearning.

Good evening good sir

I am a newbie with ML, i found this code from fellow redditor parasdahal.

I can make it run with mnist datasets.

Have few questions, i’d really really really appreciate it if someone can help me

https://github.com/iqbaalmuhmd/CNNnumpy

Is it possible to run only test, with our own image input. Or just random image from mnist dataset
What should i pickle to save the parameters?

submitted by /u/youshouldknowsz
[link] [comments]

[P] SpeedTorch. 4x faster pinned CPU -> GPU data transfer than Pytorch pinned CPU tensors, and 110x faster GPU -> CPU transfer. Augment parameter size by hosting on CPU. Use non sparse optimizers (Adadelta, Adamax, RMSprop, Rprop, etc.) for sparse training (word2vec, node2vec, GloVe, NCF, etc.).

Written on September 14, 2019. Posted in Reddit MachineLearning.

https://i.imgur.com/wr4VaUV.png

https://github.com/Santosh-Gupta/SpeedTorch

This is library I made for Pytorch, for fast transfer between pinned CPU tensors and GPU pytorch variables. The inspiration came from needed to train large number of embeddings, which don’t all fit on GPU ram at a desired embedding size, so I needed a faster CPU <-> GPU transfer method. This also allows using any optimizer for sparse training, since every embedding contained in the Pytorch embedding variable receives an update, previously only Pytorch’s SGD, Adagrad, and SparseAdam were suitable for such training.

In addition to augmenting parameter sizes, you can use to increase the speed of which data on your CPU is transferred to Pytorch Cuda variables.

Also, SpeedTorch’s GPU tensors are also overall faster then Pytorch cuda tensors, when taking into account both transferring two and from (overall 2.6x faster). For just transfering to a Pytorch Cuda, Pytorch is still faster, but significantly slower when transfering from a Pytorch Cuda variable.

I have personally used this to nearly double the embedding size of embeddings in two other projects, by holding half the parameters on CPU. The training speed is decent thanks to the fast CPU<->GPU exchange.

https://github.com/Santosh-Gupta/Research2Vec2

https://github.com/Santosh-Gupta/lit2vec2

There’s a bit of a learning curve for the very first time getting started with it, so as soon as you run into any sort of friction, feel free to ask a question on the project gitter

https://gitter.im/SpeedTorch

And I’ll answer them.

https://i.imgur.com/6o8C1BP.gif

submitted by /u/BatmantoshReturns
[link] [comments]

[P] Reinforcement Learning with Raw Actions and Observations in PySC2

Written on September 14, 2019. Posted in Reddit MachineLearning.

This tutorial is the follow up of the previous one, which introduced a recent addition to PySC2 known as raw observations and raw actions. Now we can take that knowledge and attempt to teach our bot how to play using reinforcement learning:

https://medium.com/@skjb/reinforcement-learning-with-raw-actions-and-observations-in-pysc2-af0b6fd8391f?source=friends_link&sk=202c65fce90d72d8c17250992b70bd96

submitted by /u/kiarash-irandoust
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

Category: Reddit MachineLearning

[D] Any developments on randomly wired neural networks?

Literally the best Introduction to Artificial Neural Networks (ANNs)

[R] Neural Oblivious Decision Ensembles

[P] Set up the CTRL text-generating model on Google Compute Engine with just a few console commands.

[D] Why are Kaggle prizes so low?

[R] GMLS-Nets for unstructured data

[D] Proximal Policy Optimization in keras (Actor-Critic Method)

[D] Convolutional Neural Network with Numpy input Help G

[P] SpeedTorch. 4x faster pinned CPU -> GPU data transfer than Pytorch pinned CPU tensors, and 110x faster GPU -> CPU transfer. Augment parameter size by hosting on CPU. Use non sparse optimizers (Adadelta, Adamax, RMSprop, Rprop, etc.) for sparse training (word2vec, node2vec, GloVe, NCF, etc.).

[P] Reinforcement Learning with Raw Actions and Observations in PySC2