Author: torontoai

[P] pytorch-fuzzdom: Write browser tests without DOM specifics

Written on December 19, 2019. Posted in Reddit MachineLearning.

The goal of the project is to let developers write automated browser (ie selenium) tests that have no specific DOM knowledge of the target application. The idea is to reduce the need to rewrite these tests when the underlining DOM implementation changes. To accomplish this an RL agent trained on the Miniwob++ dataset to map user specified actions to a series of DOM events. The agent is exposed through a familiar `ActionChains` interface that ques up actions to be performed in a browser.

One notable difference from prior approaches is the utilization of the DOM as graph data. This uses pytorch-geometric to represent both the state space and the available action space. This allows the agent to work with a variable number of nodes and actions.

Github: https://github.com/zbyte64/pytorch-fuzzdom
Current status: The model after a day of training should converge on ~10 of the 16 tasks. I am about to have less free time to work on this so I am making it public now.

submitted by /u/zbyte64
[link] [comments]

Software Engineer, Machine Learning – Geotab – Oakville, ON

Written on December 19, 2019. Posted in Toronto Job Postings.

Geotab is seeking a Software Engineer, Machine Learning, who can help us build production machine learning pipelines. Are you ready to join us?
From Geotab – Fri, 20 Dec 2019 19:51:55 GMT – View all Oakville, ON jobs

An introduction to Azure FarmBeats at Microsoft Ignite 2019

Written on December 19, 2019. Posted in Microsoft.

The post An introduction to Azure FarmBeats at Microsoft Ignite 2019 appeared first on The AI Blog.

[D]Deep learning with Python and Keras – course is different from the book!!

Written on December 19, 2019. Posted in Reddit MachineLearning.

I did a course on Udemy called ‘deep learning with python and keras’ thinking it’s a course version of the similar name book by François Chollet (author of Keras). But it’s not!

It was stupid of me to not realize this earlier. The course is made by some 2 other guys. Having finished the course I think, it’s average at best.

The worst part is that the examples they use in the course hardly prove the theory. For example, they make a claim on the influence of batch size on accuracy and rate of learning. But when you run their example it does not prove their point.
Some of the data they use is low in volume and quality that minor changes in test/train split or hyperparameter changes gives huge improvements.
Another strange thing I noticed was that the test error would be lower than training error in a lot of cases.

Overall I would strongly urge people to learn theory from many other strong courses (like Andrew NG’s) and for Keras, buy the book by the author.

This course was made by two guys, both of who have no experience but teaching experience in ML through these online courses and bootcamps. Neither have academic background not industry experience in ML. Not to say that academic background or industry experience is necessary for being an expert in ML, but if someone doesn’t have either, I’d feel a little skeptical.

submitted by /u/Correct-Mortgage
[link] [comments]

[P]vedaseg: A semantic segmentation toolbox in pytorch

Written on December 19, 2019. Posted in Reddit MachineLearning.

Introduction

vedaseg is an open source semantic segmentation toolbox based on PyTorch.

Features

Modular Design
We decompose the semantic segmentation framework into different components. The flexible and extensible design make it easy to implement a customized semantic segmentation project by combining different modules like building Lego.
Support of several popular frameworks
The toolbox supports several popular and semantic segmentation frameworks out of box, e.g. DeepLabv3+, DeepLabv3, UNet, PSPNet, FPN, etc.

submitted by /u/jackson_ditred
[link] [comments]

[D] ML on resource-constrained devices (MCUs)

Written on December 19, 2019. Posted in Reddit MachineLearning.

Do you think SVM on 8 bit microcontrollers can be of help in the ML-on-the-edge? I can’t understand why anyone talks about ANN on microcontrollers and nobody thought about SVM / Decision Trees / Random Forests… which should be much smaller in size. I wrote a couple posts on the topic and would really appreciate any suggestion on comment on the subject.

https://eloquentarduino.github.io/2019/11/you-can-run-machine-learning-on-arduino/

submitted by /u/EloquentArduino
[link] [comments]

[D] Methods to handle streaming/real-time data storage, wrangling and prediction?

Written on December 19, 2019. Posted in Reddit MachineLearning.

Say that there is data being streamed into Python (Kafka, Kinesis etc) every 10 seconds that I would like to wrangle and predict on. What is the best way to store this streaming data in order to do this? In the past, I have used online learning methods to do this. I am curious how to do this with a batch learning method.

I was thinking we iteratively populate a DataFrame with this data until stream stops, preprocess on the entire dataframe, predict, clear/delete the DataFrame. A caveat of this method that I am able to think of would be scenarios in which this preprocessing and predicting takes longer than 10 seconds.

What are some ways to handle this?

submitted by /u/Straighteight424
[link] [comments]

can I use transformer for name-entity-recognition? [Discussion]

Written on December 19, 2019. Posted in Reddit MachineLearning.

I want use transformer for name-entity-recognition

the training source word sequence like “Lily want to company”

the training target tag sequence like “Person O O Location”

can transformer network do this NER task?

NOTE:I want just use transformer nerual network,NOT bert

submitted by /u/Hopeful-Aerie
[link] [comments]

[P] PySS3: A Python package implementing a novel text classifier with visualization tools for Explainable AI

Written on December 19, 2019. Posted in Reddit MachineLearning.

A recently created Python package that may be really helpful for those working on NLP or Text Mining problems. Enjoy!

Github: https://github.com/sergioburdisso/pyss3

Documentation: https://pyss3.readthedocs.io/en/latest/

Paper preprint: https://arxiv.org/abs/1912.09322

submitted by /u/sergbur
[link] [comments]

[D] A potential sneaky strategy to get your ICLR paper accepted [Not Recommended!]

Written on December 19, 2019. Posted in Reddit MachineLearning.

I went through some papers on ICLR open reviews and noticed a sneaky strategy possibly. It looks like if a paper gets a heated discussions with harsh attacks from a pseudonymous commenter, the chair usually sides with the author and accept the paper even against the original reviewers’ scores. Wondering if any author employed such strategy? Quite sneaky!

submitted by /u/thntk
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

Author: torontoai

[P] pytorch-fuzzdom: Write browser tests without DOM specifics

Software Engineer, Machine Learning – Geotab – Oakville, ON

An introduction to Azure FarmBeats at Microsoft Ignite 2019

[D]Deep learning with Python and Keras – course is different from the book!!

[P]vedaseg: A semantic segmentation toolbox in pytorch

Introduction

Features

[D] ML on resource-constrained devices (MCUs)

[D] Methods to handle streaming/real-time data storage, wrangling and prediction?

can I use transformer for name-entity-recognition? [Discussion]

[P] PySS3: A Python package implementing a novel text classifier with visualization tools for Explainable AI

[D] A potential sneaky strategy to get your ICLR paper accepted [Not Recommended!]