Category: Reddit MachineLearning

[D] consequences of converting .tiff to some tf.data supported format in terms of information loss?

Written on September 4, 2019. Posted in Reddit MachineLearning.

Hi, I’m currently working with a dataset of .tiff files and want to feed those to a model using a tf Dataset for performance reasons. However, tf currently does not support loading .tiff files with Dataset.

Now I’m curious how to assess the loss in information if I convert a tiff to e.g. a png. Currently theses tiffs only hold a single image which for me should not make much a difference.

What would be a good approach to assess this?

submitted by /u/turnmanh
[link] [comments]

[D] I just found the earliest description of the GAN idea – in the context of genetic algorithms

Written on September 4, 2019. Posted in Reddit MachineLearning.

Danny Hillis “Co-evolving parasites improve simulated evolution as an optimization procedure “, 1990.

https://archive.org/details/06Kahle001316/page/n3

…there are two independent gene pools, each evolving according to the selection/mutation/recombination sequence outlined above. One population, the “hosts”, represents sorting networks, while the other population, the “parasites”, represents test cases. (These two populations might also be considered as “prey” and “predator”, since their evolution rates are comparable.) Both populations evolve on the same grid, and their interaction is through their fitness functions. The sorting networks are scored according to the test cases provided by the parasites at the same grid location. The parasites are scored according to how well they find flaws in sorting networks. Specifically, the phenotype of each parasite is a group of 10 to 20 test cases, and its score is the number of these tests that the corresponding sorting network fails to pass. The fitness functions of the host sorting networks and the parasitic sets of test patterns are complementary in the sense that a success of the sorting network represents a failure of the test pattern and vice versa.

submitted by /u/p1esk
[link] [comments]

[R] STEGASURAS: STEGanography via Arithmetic coding and Strong neURAl modelS

Written on September 4, 2019. Posted in Reddit MachineLearning.

Online demo

arXiv link

Code

We recently released our demo for our EMNLP paper “Neural Linguistic Steganography”, hiding secret messages in natural language via arithmetic coding and GPT-2. Arithmetic coding is a powerful entropy coding technique that is optimal for random sequences. Using arithmetic coding in reverse enables extremely efficient steganography, and when combined with modern language models like GPT-2 it allows for convincing cover text generations that encode information.

submitted by /u/kcazyz
[link] [comments]

[P] Comparing 11 Speech-to-Text models using Tensorflow

Written on September 4, 2019. Posted in Reddit MachineLearning.

Here I compare 11 Speech-to-Text models using Tensorflow, 100% jupyter notebook and simplify. Accuracy based on character position.

80% of the dataset to train, 20% of the dataset to test.

Tacotron, test accuracy 77.09%
BiRNN LSTM, test accuracy 84.66%
BiRNN Seq2Seq + Luong Attention + Cross Entropy, test accuracy 87.86%
BiRNN Seq2Seq + Bahdanau Attention + Cross Entropy, test accuracy 89.28%
BiRNN Seq2Seq + Bahdanau Attention + CTC, test accuracy 86.35%
BiRNN Seq2Seq + Luong Attention + CTC, test accuracy 80.30%
CNN RNN + Bahdanau Attention, test accuracy 80.23%
Dilated CNN RNN, test accuracy 31.60%
Wavenet, test accuracy 75.11%
Deep Speech 2, test accuracy 81.40%
Wav2Vec Transfer learning BiRNN LSTM, test accuracy 83.24%

Link to repository, https://github.com/huseinzol05/NLP-Models-Tensorflow#speech-to-text

Link to dataset, https://tspace.library.utoronto.ca/handle/1807/24487, also included a notebook how to download the dataset.

Discussion

Dataset is not that really big, only 286MB.
Transfer learning Wav2Vec accuracy is not that high, maybe need more dataset.
I use my own hyperparameters for Wav2Vec, use original hyperparameters caused my GPU sync problem, sequence is too long.
I need to use bigger dataset.

submitted by /u/huseinzol05
[link] [comments]

[Discussion] Category-theoretic approach to machine learning

Written on September 4, 2019. Posted in Reddit MachineLearning.

I’d like to start a thread about a small surge of recent papers studying machine learning from the perspective of functional programming/category theory. Plenty of interesting things are happening that most people don’t seem to be aware of, unless they are in these tight circles!

Category theory is a very general and rigorous mathematical theory of compositionality that seems have become a powerful unifying force in all the mathematics and very recently sciences. Its main concerns are those alike to in deep learning: finding compositional structure in data, such that the created abstractions are non-leaky and as general as possible.

Alongside the many of the papers I linked below, the Symposium on Compositional Structures that’s happening this week has two talks on abstract mathematical generalizations of machine learning.

Note: unlike most ML papers which are focused on experiments, almost all of these are biased heavily on theory and disentangling of some of the existing structure, rather than providing new ad-hoc design choices in neural network architectures. They don’t have a SOTA result or any immediate benefit you can implement right now, but are more focused on the long term understanding of the relevant structures underlying neural networks.

I’ve compiled a list of these papers below. To me all these things are cool and I thought it might be useful for people to see these new approaches, as they might show us a shape of things to come.

Backprop As Functor

The simple essence of automatic differentiation

Lenses and Learners

Compositional Deep Learning

Generalized convolution and efficient language recognition

Towards formalizing and extending differential programming using tangent categories

Learning as change propagation with delta lenses

From open learners to open games

EDIT: Disclaimer: I am the author of the fourth paper “Compositional deep learning”

submitted by /u/totallynotAGI
[link] [comments]

[D] Deep Learning Meme Review

Written on September 4, 2019. Posted in Reddit MachineLearning.

Just a bit of poking fun at ML culture.

https://youtu.be/MIEA8azwu1k

submitted by /u/ykilcher
[link] [comments]

[P] Agent Learns to Park a Car using Unity ML-Agents / Deep Reinforcement Learning (PPO)

Written on September 4, 2019. Posted in Reddit MachineLearning.

Video available here (YouTube): https://youtu.be/VMp6pq6_QjI

Context:

An AI learns to park a car in a parking lot in a 3D physics simulation. The simulation was implemented using Unity’s ML-Agents framework (https://unity3d.com/machine-learning). The AI consists of a deep Neural Network with 3 hidden layers of 128 neurons each. It is trained with the Proximal Policy Optimization (PPO) algorithm.

The input of the Neural Network are the readings of eight depth sensors, the cars current speed and position, as well as its relative position to the target. The outputs of the Neural Network are interpreted as engine force, braking force and turning force (continuous values). These outputs can be seen at the top right corner of the zoomed out camera shots.

The AI starts off with random behaviour, i.e. the Neural Network is initialized with random weights. It then gradually learns to solve the task by reacting to environment feedback accordingly.

The AI is rewarded with small positive signals for getting closer to the parking spot, which is outlined in red, and gets a larger reward when it actually reaches the parking spot and stops there. The final reward for reaching the parking spot is dependent on how parallel the car stops in relation to the actual parking position. If the car stops in a 90° angle to the actual parking direction for instance, the AI will only be rewarded a very small amount, relative to the amount it would get for stopping completely parallel to the actual direction. The AI is penalized with a negative reward signal, when it either drives further away from the parking spot or if it crashes into any obstacles.

https://i.redd.it/cul3s0nanrk31.png

submitted by /u/SamuelArzt
[link] [comments]

[N] Awesome Artificial Intelligence Research and Projects on Computer Vision News (with codes!) September 2019

Written on September 4, 2019. Posted in Reddit MachineLearning.

The September issue of Computer Vision News: 34 pages about AI and Deep Learning through both research and practical applications.

HTML5 version (recommended)

PDF version

Technical articles on pages 4-7 and 16-21. Subscribe for free on page 34.

https://i.redd.it/bu2i2gc1krk31.jpg

submitted by /u/Gletta
[link] [comments]

[P] Content Update in NLP Tutorial repo : Text Classification on HuffPost news article

Written on September 4, 2019. Posted in Reddit MachineLearning.

Content Update in PyTorch NLP Tutorial repo.

– Text Classification, with simple annotation.

Dataset: HuffPost news corpus including corresponding category.
Pre-trained word vectors: How pre-trained word representations affect model performance (via ablation study)

The model trained on this dataset identify the category of news article based on their headlines and descriptions.

link : https://github.com/lyeoni/nlp-tutorial/tree/master/news-category-classifcation

submitted by /u/lyeoni
[link] [comments]

[D] What does your ML pipeline look like?

Written on September 4, 2019. Posted in Reddit MachineLearning.

Hi everyone! I found this post on HN and would like to know your opinion/professional experience.

“Experienced machine learning professionals – How do you create scalable, deployable and reproducible data/ML pipelines at your work?”

submitted by /u/pirate7777777
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

Category: Reddit MachineLearning

[D] consequences of converting .tiff to some tf.data supported format in terms of information loss?

[D] I just found the earliest description of the GAN idea – in the context of genetic algorithms

[R] STEGASURAS: STEGanography via Arithmetic coding and Strong neURAl modelS

[P] Comparing 11 Speech-to-Text models using Tensorflow

Discussion

[Discussion] Category-theoretic approach to machine learning

[D] Deep Learning Meme Review

[P] Agent Learns to Park a Car using Unity ML-Agents / Deep Reinforcement Learning (PPO)

[N] Awesome Artificial Intelligence Research and Projects on Computer Vision News (with codes!) September 2019

[P] Content Update in NLP Tutorial repo : Text Classification on HuffPost news article

[D] What does your ML pipeline look like?