Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

Author: torontoai

Tick Identification to Combat Lyme Disease

Photo credit: Jim Gathany

By Ian Gormely

Toronto – Today, the Vector Institute, an independent, not-for-profit research institute focused on leading-edge machine learning, announced the third of its series of Pathfinder Projects to implement artificial intelligence (AI) in the health sector.

The third Pathfinder Project, performed in partnership with Public Health Ontario (PHO), will classify tick species using computer vision. Blacklegged ticks are the only ticks in Ontario known to carry B. burgdorferi, the bacteria that causes Lyme disease. Not all blacklegged ticks carry B. burgdorferi, but a bite from one is of more concern than a bite from a dog tick or another tick species that doesn’t carry the bacteria. For this project, Vector’s technical AI staff scientist Dr. Elham Dolatabadi, Dr. Vanessa Allen, Chief of Microbiology, PHO and Dr. Samir Patel, clinical microbiologist at PHO, will develop a method to automatically identify tick species using computer vision.

The first deliverable will be an AI algorithm that professionals at PHO will use to identify whether or not a tick is a blacklegged tick. The long-term goal is to create an app that anyone can use to simply take a photo of a tick. Once the app identifies the species, it will provide advice.

“The app we want to build would empower the public,” says Dr. Patel. PHO receives around 10,000 ticks each year for identification. Currently, the PHO laboratory has to identify each individual tick that is submitted. “Manually identifying and reporting each tick back to the submitter can take up to three weeks,” he says. The process can be automated using machine learning approaches so it is faster at PHO in the short term. “Once the app is developed the process will be even faster because the app can tell you right away whether or not it is a blacklegged tick and infer the risk of contracting Lyme disease.” The rapid identification of the blacklegged ticks will allow individuals to determine whether or not they should seek medical attention within the recommended 72 hours of tick removal.

Pathfinder Projects are small-scale efforts designed to produce results in 12 to 18 months that guide future research and technology adoption. With technical and resource support from the Vector Institute, the projects each bring together a multidisciplinary research team to tackle an important health care problem or opportunity using machine learning and AI more broadly. Each project was chosen for its potential to help identify a “path” through which world-class machine learning research can be translated into widespread benefits for patients.

About the Vector Institute

The Vector Institute is an independent, not-for-profit corporation dedicated to advancing artificial intelligence, excelling in machine and deep learning. The Vector Institute’s vision is to drive excellence and leadership in Canada’s knowledge, creation, and use of AI to foster economic growth and improve the lives of Canadians.

The Vector Institute is funded by the Province of Ontario, the Government of Canada through the Pan-Canadian AI Strategy administered by CIFAR, and industry sponsors from across the Canadian economy.

Tick Identification

Ticks, and the threat of Lyme disease, have become a regular feature of venturing outdoors in the summer months. For many Canadians, a thorough check for the tiny insects, which feed on our blood, is par for the course when returning from a hike or camping trip. Yet, only certain tick species actually carry the bacteria that causes Lyme disease. The challenge for most Ontarians is correctly identifying the type of tick that has decided to make you its lunch.

“Half of the ticks in Ontario are dog ticks,” explains Dr. Samir Patel, clinical microbiologist with Public Health Ontario (PHO). “They don’t carry the bacteria that causes Lyme disease.” However, blacklegged ticks are capable of carrying and transmitting that bacteria, with the risk of infection higher in certain parts of the province than others. Anyone who finds one on their body should consult a doctor.

PHO receives over 10,000 tick submissions every year — ticks sent to their laboratory site in Sault Ste. Marie — from Ontarians looking for guidance around a potential tick bite. Currently the laboratory has to manually identify each bug, a process that can take up to three weeks.

To ensure rapid and streamlined medical assessment of high risk tick bites, as well as reduce individuals’ anxiety about the potential Lyme disease after a tick bite, PHO is developing a mobile app to rapidly and accurately identify tick species and provide next-steps medical guidance. “There’s currently a gap in care,” admits Dr. Vanessa Allen, Chief of Medical Microbiology at PHO, “and this is one way to close that gap and improve the care and the delivery of services for Lyme disease in Ontario and beyond.”

Along with Dr. Allen and Vector’s technical AI staff scientist, Dr. Elham Dolatabadi, Dr. Patel is currently developing a computer vision model to differentiate between the two common tick species normally found in Ontario. “In the short-term we look forward to using computer vision for blacklegged tick identification at PHO.” he says. “Once the app is developed, it will empower the public. If you find a tick on your body, the app will be able to tell you right away whether it is a blacklegged tick or not.”

Tick populations have been increasing in recent years, as has awareness around tick bites and the threat of Lyme disease, says Dr. Allen. But both she and Dr. Patel caution that a tick bite, even from a blacklegged tick, doesn’t automatically mean that a person will contract Lyme disease and where appropriate, a single dose of prophylaxis should mitigate the chances of infection.

PHO plans to have the app available to the public by the end of next year. They also hope to use data from the user-submitted photos to help track tick populations in the province and better understand where ticks are moving, which can help inform future strategies, says Dr. Allen. “It’s not a magic bullet, but it’s a tool to speed up the process of both patient care and our understanding of Lyme disease.”

[R] Neural Oblivious Decision Ensembles

[R] Neural Oblivious Decision Ensembles

TL;DR: authors propose a DenseNet-like ensemble of decision trees, trained end-to-end by backpropagation and beats both xgboost and neural networks on heterogeneous (“tabular”) data.

(IMHO) unlike all other “neural decision tree” methods this one worked out of the box for production scale problems without heavy wizardry.

Differentiable decision tree (figure 1 from arxiv paper)

ArXiv: https://arxiv.org/abs/1909.06312

Source code: https://github.com/Qwicen/node

Abstract:

Nowadays, deep neural networks (DNNs) have become the main instrument for machine learning tasks within a wide range of domains, including vision, NLP, and speech. Meanwhile, in an important case of heterogenous tabular data, the advantage of DNNs over shallow counterparts remains questionable. In particular, there is no sufficient evidence that deep learning machinery allows constructing methods that outperform gradient boosting decision trees (GBDT), which are often the top choice for tabular problems. In this paper, we introduce Neural Oblivious Decision Ensembles (NODE), a new deep learning architecture, designed to work with any tabular data. In a nutshell, the proposed NODE architecture generalizes ensembles of oblivious decision trees, but benefits from both end-to-end gradient-based optimization and the power of multi-layer hierarchical representation learning. With an extensive experimental comparison to the leading GBDT packages on a large number of tabular datasets, we demonstrate the advantage of the proposed NODE architecture, which outperforms the competitors on most of the tasks. We open-source the PyTorch implementation of NODE and believe that it will become a universal framework for machine learning on tabular data.

submitted by /u/justheuristic
[link] [comments]

[P] Set up the CTRL text-generating model on Google Compute Engine with just a few console commands.

Over the weekend I posted a Twitter thread of my experience with CTRL, which received a lot of attention. Today, I’m releasing a script + guides to set up the CTRL text-generating model on Google Compute Engine with just a few console commands: https://github.com/minimaxir/ctrl-gce

I also added a few more generation examples + usability guides.

Let me know how it works for you!

submitted by /u/minimaxir
[link] [comments]

[D] Proximal Policy Optimization in keras (Actor-Critic Method)

This article is written by Chintan Trivedi. Proximal Policy Optimization aka PPO was released by OpenAI in 2017. It is considered as the state-of-the-art algorithm in reinforcement learning. The USP of this article is its simplistic explanations and coding of PPO as well as the accompanying videos. The author also released the code in his github page.

https://towardsdatascience.com/proximal-policy-optimization-tutorial-part-1-actor-critic-method-d53f9afffbf6

submitted by /u/begooboi
[link] [comments]

[D] Convolutional Neural Network with Numpy input Help G

Good evening good sir

I am a newbie with ML, i found this code from fellow redditor parasdahal.

I can make it run with mnist datasets.

Have few questions, i’d really really really appreciate it if someone can help me

https://github.com/iqbaalmuhmd/CNNnumpy

  1. Is it possible to run only test, with our own image input. Or just random image from mnist dataset
  2. What should i pickle to save the parameters?

submitted by /u/youshouldknowsz
[link] [comments]

Google at Interspeech 2019

This week, Graz, Austria hosts the 20th Annual Conference of the International Speech Communication Association (Interspeech 2019), one of the world‘s most extensive conferences on the research and engineering for spoken language processing. Over 2,000 experts in speech-related research fields gather to take part in oral presentations and poster sessions and to collaborate with streamed events across the globe.

As a Gold Sponsor of Interspeech 2019, we are excited to present 30 research publications, and demonstrate some of the impact speech technology has made in our products, from accessible, automatic video captioning to a more robust, reliable Google Assistant. If you’re attending Interspeech 2019, we hope that you’ll stop by the Google booth to meet our researchers and discuss projects and opportunities at Google that go into solving interesting problems for billions of people. Our researchers will also be on hand to discuss Google Cloud Text-to-Speech and Speech-to-text, demo Parrotron, and more. You can also learn more about the Google research being presented at Interspeech 2019 below (Google affiliations in blue).

Organizing Committee includes:
Michiel Bacchiani

Technical Program Committee includes:
Tara Sainath

Tutorials
Neural Machine Translation
Organizers include: Wolfgang Macherey, Yuan Cao

Accepted Publications
Building Large-Vocabulary ASR Systems for Languages Without Any Audio Training Data (link to appear soon)
Manasa Prasad, Daan van Esch, Sandy Ritchie, Jonas Fromseier Mortensen

Multi-Microphone Adaptive Noise Cancellation for Robust Hotword Detection (link to appear soon)
Yiteng Huang, Turaj Shabestary, Alexander Gruenstein, Li Wan

Direct Speech-to-Speech Translation with a Sequence-to-Sequence Model
Ye Jia, Ron Weiss, Fadi Biadsy, Wolfgang Macherey, Melvin Johnson, Zhifeng Chen, Yonghui Wu

Improving Keyword Spotting and Language Identification via Neural Architecture Search at Scale (link to appear soon)
Hanna Mazzawi, Javier Gonzalvo, Aleks Kracun, Prashant Sridhar, Niranjan Subrahmanya, Ignacio Lopez Moreno, Hyun Jin Park, Patrick Violette

Shallow-Fusion End-to-End Contextual Biasing (link to appear soon)
Ding Zhao, Tara Sainath, David Rybach, Pat Rondon, Deepti Bhatia, Bo Li, Ruoming Pang

VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Quan Wang, Hannah Muckenhirn, Kevin Wilson, Prashant Sridhar, Zelin Wu, John Hershey, Rif Saurous, Ron Weiss, Ye Jia, Ignacio Lopez Moreno

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Daniel Park, William Chan, Yu Zhang, Chung-Cheng Chiu, Barret Zoph, Ekin Dogus Cubuk, Quoc Le

Two-Pass End-to-End Speech Recognition
Ruoming Pang, Tara Sainath, David Rybach, Yanzhang He, Rohit Prabhavalkar, Mirko Visontai, Qiao Liang, Trevor Strohman, Yonghui Wu, Ian McGraw, Chung-Cheng Chiu

On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition
Kazuki Irie, Rohit Prabhavalkar, Anjuli Kannan, Antoine Bruguier, David Rybach, Patrick Nguyen

Contextual Recovery of Out-of-Lattice Named Entities in Automatic Speech Recognition (link to appear soon)
Jack Serrino, Leonid Velikovich, Petar Aleksic, Cyril Allauzen

Joint Speech Recognition and Speaker Diarization via Sequence Transduction
Laurent El Shafey, Hagen Soltau, Izhak Shafran

Personalizing ASR for Dysarthric and Accented Speech with Limited Data
Joel Shor, Dotan Emanuel, Oran Lang, Omry Tuval, Michael Brenner, Julie Cattiau, Fernando Vieira, Maeve McNally, Taylor Charbonneau, Melissa Nollstadt, Avinatan Hassidim, Yossi Matias

An Investigation Into On-Device Personalization of End-to-End Automatic Speech Recognition Models (link to appear soon)
Khe Chai Sim, Petr Zadrazil, Francoise Beaufays

Salient Speech Representations Based on Cloned Networks
Bastiaan Kleijn, Felicia Lim, Michael Chinen, Jan Skoglund

Cross-Lingual Consistency of Phonological Features: An Empirical Study (link to appear soon)
Cibu Johny, Alexander Gutkin, Martin Jansche

LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
Heiga Zen, Viet Dang, Robert Clark, Yu Zhang, Ron Weiss, Ye Jia, Zhifeng Chen, Yonghui Wu

Improving Performance of End-to-End ASR on Numeric Sequences
Cal Peyser, Hao Zhang, Tara Sainath, Zelin Wu

Developing Pronunciation Models in New Languages Faster by Exploiting Common Grapheme-to-Phoneme Correspondences Across Languages (link to appear soon)
Harry Bleyan, Sandy Ritchie, Jonas Fromseier Mortensen, Daan van Esch

Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
Ke Hu, Antoine Bruguier, Tara Sainath, Rohit Prabhavalkar, Golan Pundak

Fréchet Audio Distance: A Reference-free Metric for Evaluating Music Enhancement Algorithms
Kevin Kilgour, Mauricio Zuluaga, Dominik Roblek, Matthew Sharifi

Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Yu Zhang, Ron Weiss, Heiga Zen, Yonghui Wu, Zhifeng Chen, RJ Skerry-Ryan, Ye Jia, Andrew Rosenberg, Bhuvana Ramabhadran

Sampling from Stochastic Finite Automata with Applications to CTC Decoding
Martin Jansche, Alexander Gutkin

Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model (link to appear soon)
Anjuli Kannan, Arindrima Datta, Tara Sainath, Eugene Weinstein, Bhuvana Ramabhadran, Yonghui Wu, Ankur Bapna, Zhifeng Chen, SeungJi Lee

A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet
Jean-Marc Valin, Jan Skoglund

Low-Dimensional Bottleneck Features for On-Device Continuous Speech Recognition
David Ramsay, Kevin Kilgour, Dominik Roblek, Matthew Sharif

Unified Verbalization for Speech Recognition & Synthesis Across Languages (link to appear soon)
Sandy Ritchie, Richard Sproat, Kyle Gorman, Daan van Esch, Christian Schallhart, Nikos Bampounis, Benoit Brard, Jonas Mortensen, Amelia Holt, Eoin Mahon

Better Morphology Prediction for Better Speech Systems (link to appear soon)
Dravyansh Sharma, Melissa Wilson, Antoine Bruguier

Dual Encoder Classifier Models as Constraints in Neural Text Normalization
Ajda Gokcen, Hao Zhang, Richard Sproat

Large-Scale Visual Speech Recognition
Brendan Shillingford, Yannis Assael, Matthew Hoffman, Thomas Paine, Cían Hughes, Utsav Prabhu, Hank Liao, Hasim Sak, Kanishka Rao, Lorrayne Bennett, Marie Mulville, Ben Coppin, Ben Laurie, Andrew Senior, Nando de Freitas

Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation
Fadi Biadsy, Ron Weiss, Pedro Moreno, Dimitri Kanevsky, Ye Jia

[P] SpeedTorch. 4x faster pinned CPU -> GPU data transfer than Pytorch pinned CPU tensors, and 110x faster GPU -> CPU transfer. Augment parameter size by hosting on CPU. Use non sparse optimizers (Adadelta, Adamax, RMSprop, Rprop, etc.) for sparse training (word2vec, node2vec, GloVe, NCF, etc.).

https://i.imgur.com/wr4VaUV.png

https://github.com/Santosh-Gupta/SpeedTorch

This is library I made for Pytorch, for fast transfer between pinned CPU tensors and GPU pytorch variables. The inspiration came from needed to train large number of embeddings, which don’t all fit on GPU ram at a desired embedding size, so I needed a faster CPU <-> GPU transfer method. This also allows using any optimizer for sparse training, since every embedding contained in the Pytorch embedding variable receives an update, previously only Pytorch’s SGD, Adagrad, and SparseAdam were suitable for such training.

In addition to augmenting parameter sizes, you can use to increase the speed of which data on your CPU is transferred to Pytorch Cuda variables.

Also, SpeedTorch’s GPU tensors are also overall faster then Pytorch cuda tensors, when taking into account both transferring two and from (overall 2.6x faster). For just transfering to a Pytorch Cuda, Pytorch is still faster, but significantly slower when transfering from a Pytorch Cuda variable.

I have personally used this to nearly double the embedding size of embeddings in two other projects, by holding half the parameters on CPU. The training speed is decent thanks to the fast CPU<->GPU exchange.

https://github.com/Santosh-Gupta/Research2Vec2

https://github.com/Santosh-Gupta/lit2vec2

There’s a bit of a learning curve for the very first time getting started with it, so as soon as you run into any sort of friction, feel free to ask a question on the project gitter

https://gitter.im/SpeedTorch

And I’ll answer them.

https://i.imgur.com/6o8C1BP.gif

submitted by /u/BatmantoshReturns
[link] [comments]