Category: Reddit MachineLearning

[P] Comparing 7 Deep Dependency parsing models using Tensorflow

Written on September 29, 2019. Posted in Reddit MachineLearning.

Trained on CONLL English Dependency, https://github.com/UniversalDependencies/UD_English-EWT. Train set to train, dev and test sets to test.

Stackpointer and Biaffine-attention originally from https://github.com/XuezheMax/NeuroNLP2 written in Pytorch.

Accuracy based on arc, types and root accuracies after 15 epochs only.

Bidirectional RNN + CRF + Biaffine, arc accuracy 70.48%, types accuracy 65.18%, root accuracy 66.4%
Bidirectional RNN + Bahdanau + CRF + Biaffine, arc accuracy 70.82%, types accuracy 65.33%, root accuracy 66.77%
Bidirectional RNN + Luong + CRF + Biaffine, arc accuracy 71.22%, types accuracy 65.73%, root accuracy 67.23%
BERT Base + CRF + Biaffine, arc accuracy 64.30%, types accuracy 62.89%, root accuracy 74.19%
Bidirectional RNN + Biaffine Attention + Cross Entropy, arc accuracy 72.42%, types accuracy 63.53%, root accuracy 68.51%
BERT Base + Biaffine Attention + Cross Entropy, arc accuracy 72.85%, types accuracy 67.11%, root accuracy 73.93%
Bidirectional RNN + Stackpointer, arc accuracy 61.88%, types accuracy 48.20%, root accuracy 89.39%

Link to repository, https://github.com/huseinzol05/NLP-Models-Tensorflow#dependency-parser

Discussion

Based on 15 epochs only.
No dropout here, feel free to do it.
BERT cannot implemented in Stackpointer model, stack pointer model required each decoder step.

submitted by /u/huseinzol05
[link] [comments]

[D] The problem with anthropomorphizing AI

Written on September 29, 2019. Posted in Reddit MachineLearning.

When you start to humanize (current) AI technologies and describe them in ways you would talk about persons, you can draw all the wrong conclusions. This happens often when we see AI algorithms perform tasks that were previously thought to be off-limits for computers, such as playing Go or detecting cancer or converting text to speech.

Without a reality check on the capabilities and limits of current AI, we tend to have trumped up expectations of what AI can do for us, and become disenchanted when those expectations aren’t met.

https://bdtechtalks.com/2019/01/02/humanizing-ai-deep-learning-alphazero/

submitted by /u/bendee983
[link] [comments]

[R] Class Feature Pyramids for Video Explanation

Written on September 29, 2019. Posted in Reddit MachineLearning.

Hello everyone,

We have recently made available both the code and paper for our project on visual explanations for (spatio-temporal) 3D-CNNs named Class Feature Pyramids for Video Explanation. Through our method we aim at highlighting the specific time and space locations in videos that different layers and neurons in the network consider informative given a specific class by the discovery of a hierarchical feature association.

ArXiv preprint link Github repo link

This work will also be presented in the ICCV 2019 Workshop on Interpreting and Explaining Visual Artificial Intelligence Models

Abstract

Deep convolutional networks are widely used in video action recognition. 3D convolutions are one prominent approach to deal with the additional time dimension. While 3D convolutions typically lead to higher accuracies, the inner workings of the trained models are more difficult to interpret. We focus on creating human-understandable visual explanations that represent the hierarchical parts of spatio-temporal networks. We introduce Class Feature Pyramids, a method that traverses the entire network structure and incrementally discovers kernels at different network depths that are informative for a specific class. Our method does not depend on the network’s architecture or the type of 3D convolutions, supporting grouped and depth-wise convolutions, convolutions in fibers, and convolutions in branches. We demonstrate the method on six state-of-the-art 3D convolution neural networks (CNNs) on three action recognition (Kinetics-400, UCF-101, and HMDB-51) and two egocentric action recognition datasets (EPIC-Kitchens and EGTEA Gaze+).

submitted by /u/Alex_Stergiou
[link] [comments]

[R] Neural Ordinary Differential Equations: the current landscape

Written on September 29, 2019. Posted in Reddit MachineLearning.

After NeurIPS 2018 and the “Neural Ordinary Differential Equations” paper deep learning research has opened up tremendously in this direction and as a result it has been rather difficult to keep up with the latest advancements. To this end I have been collecting relevant papers in the following github repository. Feel free to contribute or suggest changes if you feel something is missing. Hope this is useful to those of you working in this area!

submitted by /u/Zymieth
[link] [comments]

[N] UC Berkeley’s CS 285: Deep Reinforcement Learning

Written on September 29, 2019. Posted in Reddit MachineLearning.

http://rail.eecs.berkeley.edu/deeprlcourse/

Lectures are recorded and live streamed

Material which will be covered:

From supervised learning to decision making

Model-free algorithms: Q-learning, policy gradients, actor-critic

Advanced model learning and prediction

Transfer and multi-task learning, meta-learning

Exploration

Open problems, research talks, invited lectures

There’s a subreddit for this course: r/berkeleydeeprlcourse

submitted by /u/Bayequentist
[link] [comments]

[D] First time to NeurIPS

Written on September 29, 2019. Posted in Reddit MachineLearning.

What are the best venues to go to (tutorials, conference, workshops, workshop dinner, expo, conference reception)?

submitted by /u/ch3njust1n
[link] [comments]

[R] On Model Stability as a Function of Random Seed

Written on September 28, 2019. Posted in Reddit MachineLearning.

submitted by /u/kakoosman
[link] [comments]

[P] Using Machine Learning to detect handguns being held by a person, not in holster – SmarterEveryDay [video 12:55]

Written on September 28, 2019. Posted in Reddit MachineLearning.

Destin posted an video on using Machine Learning to analyze (surveillance) video for guns being held by a person.

Very interesting discussion on training the system, and the challenge of reducing false-positives as people hold cellphones in a similar way to how someone would hold a gun. Also on privacy, regarding the decision to not look for concealed carry.

We built a gun detector using machine learning that works with existing surveillance cameras.

There are many companies working on this type of technology at the moment. We made ours in 2018, and Chad visited my house in January 2019 to film most of this. We didn’t post it for various reasons, but decided now might be a good time. The fidelity of the inference is a function of how you train it…. so obviously the more data points you provide, the better it will perform. This particular type of machine learning is known as object detection, with the computation being performed by a deep neural network “on the edge”. The video stream is processed locally and no internet is needed to find the object in real time. If you’re working on something similar or could use this technology in your systems and would like to collaborate…. (or would simply like to support what we’re doing) please reach out by using this link: https://www.lantern.systems/gun-detector

https://www.youtube.com/watch?v=Lh0x54GC1sw

submitted by /u/misconfig_exe
[link] [comments]

[Project] Implementing Improvements to Hypergradient optimizers

Written on September 28, 2019. Posted in Reddit MachineLearning.

We try improvements to the existing ‘Hypergradient’ based optimizers proposed in the paper Online Learning Rate Adaptation with Hypergradient Descent.

We expect that the hypergradient based learning rate update could be more accurate and aim to exploit the gains much better by boosting the learning rate updates with momentum and adaptive gradients, experimenting with

Hypergradient descent with momentum, and
Adam with Hypergradient,

alongside the model optimizers SGD, SGD with Nesterov(SGDN) and Adam.

The new optimizers are compared with their resepective hypergradient-descent baselines and provide advantages such as better generalization and faster convergence for the loss function. The code and the results of our experiments are available at https://github.com/harshalmittal4/Hypergradient_variants.

submitted by /u/harshalmittal4
[link] [comments]

[P] I scraped career statistics of Tour de France starters and used an LSTM network to predict their end of career

Written on September 28, 2019. Posted in Reddit MachineLearning.

The notebooks can be found here: https://github.com/quickcoffee/TDF-Starters-RUL

Let me know what you think, it’s my first project with web scraping and neural networks 🙂

submitted by /u/Gebo_vending
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

Category: Reddit MachineLearning

[P] Comparing 7 Deep Dependency parsing models using Tensorflow

Discussion

[D] The problem with anthropomorphizing AI

[R] Class Feature Pyramids for Video Explanation

Abstract

[R] Neural Ordinary Differential Equations: the current landscape

[N] UC Berkeley’s CS 285: Deep Reinforcement Learning

[D] First time to NeurIPS

[R] On Model Stability as a Function of Random Seed

[P] Using Machine Learning to detect handguns being held by a person, not in holster – SmarterEveryDay [video 12:55]

We built a gun detector using machine learning that works with existing surveillance cameras.

[Project] Implementing Improvements to Hypergradient optimizers

[P] I scraped career statistics of Tour de France starters and used an LSTM network to predict their end of career