Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

Author: torontoai

[D] Model to predict site outage on Telecommunication Networks

Context: Telecommunication Networks are complex multi-vendor/technology (Huawei, Ericsson, Nokia, …) which NOC (Network Operation Center) continually monitor and respond to faults/failures on equipment & sites usually concentrating all alarms from different vendors.

Challenge: Use historical data to predict if a site will go down on the next hour (or so) based on the events their NMS (Network Management Software) are collecting from all elements – this is a time window-based analysis that chances all the time.

Anyone on the community that have faced this (or a similar problem) would like to share any thoughts / papers / links to solution/model/approach ?

submitted by /u/macacochimpa
[link] [comments]

[Research] Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)

Hi, I am one of the authors of this EMNLP 2019 paper.

We create Universal Adversarial Triggers:

Phrases that cause a specific model prediction when concatenated to 𝘢𝘯𝘺 input.

Triggers cause:

– GPT-2 to turn racist

– SQuAD models to predict “to kill american people” for 72% of “why” questions

– Text classifier accuracy 90%->1%.

Paper: https://arxiv.org/abs/1908.07125

Twitter: https://twitter.com/Eric_Wallace_/status/1168907518623571974

Blog: http://ericswallace.com/triggers

submitted by /u/Eric_WallaceUMD
[link] [comments]

[D] Classifying high dimensional sequences using traditional machine learning methods

I’m a beginner in machine learning and I want to classify high dimensional sequences without using an LSTM or other deep learning methods, but using something simpler, to learn more about feature extraction.

I’m trying gesture recognition using OpenPose, with a couple of gestures (wave, swipe left, swipe right, circle) of varying length. Because of OpenPose, I have 67 features (body and hands) times 3 (x, y, c). The sequences go from 4 to 43 frames, so there is a lot of variation there.

First of all, how do I google this? Time series classification is all about long, usually one dimensional sequences, while these are shorter and highly multi dimensional sequences. I don’t know if this has been done before without deep learning?

Second of all, here is what I tried:

  • Padding and clipping the sequences to the same length, then using an RBF SVC from sklearn.
  • Extracting mean, median, std, max, min, and other statistics to obtain a fixed representation
  • Performing Fourier transformations per keypoint/feature pair, so 67*3 transforms

All three of these overfit a lot (100 to 50 train/test accuracy), the Fourier one the most because of the high amount of features. PCA does not help a lot. Are Fourier transformations even useful here because there is no real periodicity in the gestures and the transformations look weird (normally you have these peaks in frequencies, but for me the Fourier transform just looks like a choppy sine wave for most cases)

I looked at DTW but I have way too much features???

How do I classify OpenPose sequences using machine learning and not deep learning? I can’t find a lot of info on google

submitted by /u/RaptorDotCpp
[link] [comments]

[D] When Is It Better To Keep The Algorithm To Yourself?

Crosspost r/datascience

Suppose you’re working on a machine learning/coding contest and through your own research come up with a technique that is say, 4% better than the best thing anyone has tried (just pulling numbers out of the air here).

At what point is it better to not claim the prize on said contest and just keep the method secret? At what point should you publish it? Does it ever make sense to just use it in your own capacity analyzing data for companies as a 3rd party?

I mean I’m sure it’s all dependent on the money involved but one has to wonder where the breaking point is. What can you make as an independent 3rd party willing to do analysis with proprietary software you aren’t releasing?

In such a case, how could you ever provide confidence enough that the methods work? Also, how would you bill if essentially the time spent is mainly runtime? I’d love to hear any speculation, stories, industry standard behavior or history around this sort of thing.

submitted by /u/mystikaldanger
[link] [comments]

[D] Machine learning, alchemy or real science?

A while ago I’ve seen a video of a talk by Ali Rahimi. It’s worth the 20 minutes for people who haven’t seen it!

I recently rewatched it and tried to look around if I could find papers which go into the direction of focusing more onto smaller ML experiments to build fundamental knowledge. But I’ve not found much aside from stuff on interpretability.

As a side note, I’m not an ML researcher but a physicist who uses ML in various aspects for work. I’d not consider myself an expert on any ML specific field but have an interest and try to keep up to date on things. From my personal experience, I’ve certainly felt (not sure how to describe it but let’s go with annoyed/demotivated) to use ML when it’s not clear to me why certain things work.

So a lot of times it seems more like Alchemy, in that I mix and match certain things until I get a result I like.

But why does it work? No clue.

Will it work for the next problem that is sufficiently different? No clue!

I’d be very interested to hear what other people think or how they feel about this topic.

Also if you know of any paper seeking to build a more fundamental understanding, I’d appreciate a link 🙂

submitted by /u/oucp
[link] [comments]

Frontera’s New Frontier: Fastest Academic Supercomputer Wields NVIDIA GPUs for Science Research

Everything’s bigger in Texas — supercomputers included.

The Texas Advanced Computing Center today launched Frontera, the most powerful academic supercomputer in the world, now featuring two subsystems powered by some 800 NVIDIA GPUs.

Frontera will leverage the AI, high performance computing and data analytics capabilities of NVIDIA Tensor Core GPUs to enable powerful scientific simulation and accelerate research areas including drug discovery, astrophysics and natural hazards modeling.

Housed at The University of Texas at Austin, Frontera ranked fifth on the most recent TOP500 list of fastest supercomputers, achieving 23.5 petaflops on the High-Performance Linpack benchmark and 38.75 petaflops of peak double-precision performance. The new GPU subsystems add a further 11 petaflops of peak single-precision performance for researchers.

NVIDIA GPUs power more than 100 systems on the TOP500 list, including half the top 10 incuding Summit, the world’s fastest supercomputer.

“With Frontera, the key is time to solution. That’s what we’re here for — to solve the biggest problems in science and engineering,” said Niall Gaffney, the center’s director of data-intensive computing.

One of the new subsystems features a cluster of 360 NVIDIA Tensor Core GPUs, liquid-cooled in racks developed by GRC, which specializes in immersion cooling for data centers. Another, built by IBM and named Longhorn, consists of 448 NVIDIA Tensor Core GPUs. Purpose-built with mixed-precision capabilities, these powerful GPUs provide scientists the flexibility to accelerate a variety of AI, simulation and data analysis workloads.

More than three dozen research teams have been using Frontera since the system began supporting science applications in June. The supercomputer was funded by a $60 million award from the National Science Foundation.

Over its lifetime, Frontera and its GPU subsystems will be used for hundreds of applications by thousands of researchers from academic institutions around the world.

From Molecular to Supermassive, Accelerating Science Research

High performance computing systems help researchers rapidly analyze data and run experiments and simulations. GPU acceleration enables faster iteration, cutting down the time it takes for scientists to achieve breakthroughs that can improve human health, broaden our understanding of the universe, and inform how we use materials and energy resources.

“Techniques like machine learning and AI are becoming more and more important for researchers doing large-scale compute,” Gaffney said. “GPU environments allow scientists to take advantage of acceleration for a wide array of applications.”

Initial projects benefiting from the powerful NVIDIA GPU-accelerated Frontera subsystems include:

  • Astronomy insights: In the field of astrophysics, researchers often work with datasets 100 terabytes in size or more. GPU acceleration and AI enables them to separate signal from noise in these massive datasets, run large-scale simulations of the universe and better understand phenomena like neutron star collision.
  • Medical breakthroughs: Deep learning tools are used in the field of medical imaging to help doctors more quickly identify diseases and abnormalities, like spotting glioblastoma tumors from brain scans. With supercomputing resources, developers can create more complex models to improve the accuracy of cancer diagnosis.
  • Drug discovery: Identifying promising molecular compounds for drug candidates is computationally demanding, time-consuming and expensive. Researchers can leverage GPU-accelerated systems for faster simulations of protein folding, helping narrow down candidates to test in a wet laboratory.
  • Smart city planning: Cities collect vast quantities of data that can be analyzed for smarter urban planning. With an AI model that can analyze visual data from traffic pole cameras, cities can identify congested areas and better address safety concerns like dangerous intersections.
  • Understanding Earth: In weather modeling and in energy research, scientists depend on high-fidelity simulations to analyze the interaction of complex natural systems. Researchers can use AI to better predict weather events and earthquakes, inform precision agriculture projects and explore potential energy sources such as nuclear fusion.

Learn more about how NVIDIA GPUs power the world’s top supercomputers.

The post Frontera’s New Frontier: Fastest Academic Supercomputer Wields NVIDIA GPUs for Science Research appeared first on The Official NVIDIA Blog.