Author: torontoai

[D] Model to predict site outage on Telecommunication Networks

Written on September 2, 2019. Posted in Reddit MachineLearning.

Context: Telecommunication Networks are complex multi-vendor/technology (Huawei, Ericsson, Nokia, …) which NOC (Network Operation Center) continually monitor and respond to faults/failures on equipment & sites usually concentrating all alarms from different vendors.

Challenge: Use historical data to predict if a site will go down on the next hour (or so) based on the events their NMS (Network Management Software) are collecting from all elements – this is a time window-based analysis that chances all the time.

Anyone on the community that have faced this (or a similar problem) would like to share any thoughts / papers / links to solution/model/approach ?

submitted by /u/macacochimpa
[link] [comments]

Data Science Solutions SME – DataRobot – Toronto, ON

Written on September 2, 2019. Posted in Toronto Job Postings.

Able to articulate high value use cases and value proposition for AI and machine learning in FinTech. Interact with the customers in the FinTech industry to…
From DataRobot – Tue, 03 Sep 2019 16:55:25 GMT – View all Toronto, ON jobs

[Research] Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)

Written on September 2, 2019. Posted in Reddit MachineLearning.

Hi, I am one of the authors of this EMNLP 2019 paper.

We create Universal Adversarial Triggers:

Phrases that cause a specific model prediction when concatenated to 𝘢𝘯𝘺 input.

Triggers cause:

– GPT-2 to turn racist

– SQuAD models to predict “to kill american people” for 72% of “why” questions

– Text classifier accuracy 90%->1%.

Paper: https://arxiv.org/abs/1908.07125

Twitter: https://twitter.com/Eric_Wallace_/status/1168907518623571974

Blog: http://ericswallace.com/triggers

submitted by /u/Eric_WallaceUMD
[link] [comments]

[D] NeurIPS 2019 decisions are out

Written on September 2, 2019. Posted in Reddit MachineLearning.

In CMT

submitted by /u/adamb90
[link] [comments]

[D] Classifying high dimensional sequences using traditional machine learning methods

Written on September 2, 2019. Posted in Reddit MachineLearning.

I’m a beginner in machine learning and I want to classify high dimensional sequences without using an LSTM or other deep learning methods, but using something simpler, to learn more about feature extraction.

I’m trying gesture recognition using OpenPose, with a couple of gestures (wave, swipe left, swipe right, circle) of varying length. Because of OpenPose, I have 67 features (body and hands) times 3 (x, y, c). The sequences go from 4 to 43 frames, so there is a lot of variation there.

First of all, how do I google this? Time series classification is all about long, usually one dimensional sequences, while these are shorter and highly multi dimensional sequences. I don’t know if this has been done before without deep learning?

Second of all, here is what I tried:

Padding and clipping the sequences to the same length, then using an RBF SVC from sklearn.
Extracting mean, median, std, max, min, and other statistics to obtain a fixed representation
Performing Fourier transformations per keypoint/feature pair, so 67*3 transforms

All three of these overfit a lot (100 to 50 train/test accuracy), the Fourier one the most because of the high amount of features. PCA does not help a lot. Are Fourier transformations even useful here because there is no real periodicity in the gestures and the transformations look weird (normally you have these peaks in frequencies, but for me the Fourier transform just looks like a choppy sine wave for most cases)

I looked at DTW but I have way too much features???

How do I classify OpenPose sequences using machine learning and not deep learning? I can’t find a lot of info on google

submitted by /u/RaptorDotCpp
[link] [comments]

[D] What annotation tool do you use for your image/video datasets?

Written on September 2, 2019. Posted in Reddit MachineLearning.

I have just written an article about a benchmark I made of labeling tools for computer vision.

https://blog.sicara.com/best-open-source-annotation-tools-in-computer-vision-4b9f6a18f911

Do you know other tools which I should talk about?

submitted by /u/lolo_sicara
[link] [comments]

[D] When Is It Better To Keep The Algorithm To Yourself?

Written on September 2, 2019. Posted in Reddit MachineLearning.

Crosspost r/datascience

Suppose you’re working on a machine learning/coding contest and through your own research come up with a technique that is say, 4% better than the best thing anyone has tried (just pulling numbers out of the air here).

At what point is it better to not claim the prize on said contest and just keep the method secret? At what point should you publish it? Does it ever make sense to just use it in your own capacity analyzing data for companies as a 3rd party?

I mean I’m sure it’s all dependent on the money involved but one has to wonder where the breaking point is. What can you make as an independent 3rd party willing to do analysis with proprietary software you aren’t releasing?

In such a case, how could you ever provide confidence enough that the methods work? Also, how would you bill if essentially the time spent is mainly runtime? I’d love to hear any speculation, stories, industry standard behavior or history around this sort of thing.

submitted by /u/mystikaldanger
[link] [comments]

[D] Machine learning, alchemy or real science?

Written on September 2, 2019. Posted in Reddit MachineLearning.

A while ago I’ve seen a video of a talk by Ali Rahimi. It’s worth the 20 minutes for people who haven’t seen it!

I recently rewatched it and tried to look around if I could find papers which go into the direction of focusing more onto smaller ML experiments to build fundamental knowledge. But I’ve not found much aside from stuff on interpretability.

As a side note, I’m not an ML researcher but a physicist who uses ML in various aspects for work. I’d not consider myself an expert on any ML specific field but have an interest and try to keep up to date on things. From my personal experience, I’ve certainly felt (not sure how to describe it but let’s go with annoyed/demotivated) to use ML when it’s not clear to me why certain things work.

So a lot of times it seems more like Alchemy, in that I mix and match certain things until I get a result I like.

But why does it work? No clue.

Will it work for the next problem that is sufficiently different? No clue!

I’d be very interested to hear what other people think or how they feel about this topic.

Also if you know of any paper seeking to build a more fundamental understanding, I’d appreciate a link 🙂

submitted by /u/oucp
[link] [comments]

[D] Library to extract sentiment from business news?

Written on September 2, 2019. Posted in Reddit MachineLearning.

I was looking to extract if any news related to a company was positive or negative from news using python.

Which libraries would work good for such a project?

submitted by /u/firewolf333
[link] [comments]

Frontera’s New Frontier: Fastest Academic Supercomputer Wields NVIDIA GPUs for Science Research

Written on September 2, 2019. Posted in NVIDIA.

Everything’s bigger in Texas — supercomputers included.

The Texas Advanced Computing Center today launched Frontera, the most powerful academic supercomputer in the world, now featuring two subsystems powered by some 800 NVIDIA GPUs.

Frontera will leverage the AI, high performance computing and data analytics capabilities of NVIDIA Tensor Core GPUs to enable powerful scientific simulation and accelerate research areas including drug discovery, astrophysics and natural hazards modeling.

Housed at The University of Texas at Austin, Frontera ranked fifth on the most recent TOP500 list of fastest supercomputers, achieving 23.5 petaflops on the High-Performance Linpack benchmark and 38.75 petaflops of peak double-precision performance. The new GPU subsystems add a further 11 petaflops of peak single-precision performance for researchers.

NVIDIA GPUs power more than 100 systems on the TOP500 list, including half the top 10 incuding Summit, the world’s fastest supercomputer.

“With Frontera, the key is time to solution. That’s what we’re here for — to solve the biggest problems in science and engineering,” said Niall Gaffney, the center’s director of data-intensive computing.

One of the new subsystems features a cluster of 360 NVIDIA Tensor Core GPUs, liquid-cooled in racks developed by GRC, which specializes in immersion cooling for data centers. Another, built by IBM and named Longhorn, consists of 448 NVIDIA Tensor Core GPUs. Purpose-built with mixed-precision capabilities, these powerful GPUs provide scientists the flexibility to accelerate a variety of AI, simulation and data analysis workloads.

More than three dozen research teams have been using Frontera since the system began supporting science applications in June. The supercomputer was funded by a $60 million award from the National Science Foundation.

Over its lifetime, Frontera and its GPU subsystems will be used for hundreds of applications by thousands of researchers from academic institutions around the world.

From Molecular to Supermassive, Accelerating Science Research

High performance computing systems help researchers rapidly analyze data and run experiments and simulations. GPU acceleration enables faster iteration, cutting down the time it takes for scientists to achieve breakthroughs that can improve human health, broaden our understanding of the universe, and inform how we use materials and energy resources.

“Techniques like machine learning and AI are becoming more and more important for researchers doing large-scale compute,” Gaffney said. “GPU environments allow scientists to take advantage of acceleration for a wide array of applications.”

Initial projects benefiting from the powerful NVIDIA GPU-accelerated Frontera subsystems include:

Astronomy insights: In the field of astrophysics, researchers often work with datasets 100 terabytes in size or more. GPU acceleration and AI enables them to separate signal from noise in these massive datasets, run large-scale simulations of the universe and better understand phenomena like neutron star collision.
Medical breakthroughs: Deep learning tools are used in the field of medical imaging to help doctors more quickly identify diseases and abnormalities, like spotting glioblastoma tumors from brain scans. With supercomputing resources, developers can create more complex models to improve the accuracy of cancer diagnosis.
Drug discovery: Identifying promising molecular compounds for drug candidates is computationally demanding, time-consuming and expensive. Researchers can leverage GPU-accelerated systems for faster simulations of protein folding, helping narrow down candidates to test in a wet laboratory.
Smart city planning: Cities collect vast quantities of data that can be analyzed for smarter urban planning. With an AI model that can analyze visual data from traffic pole cameras, cities can identify congested areas and better address safety concerns like dangerous intersections.
Understanding Earth: In weather modeling and in energy research, scientists depend on high-fidelity simulations to analyze the interaction of complex natural systems. Researchers can use AI to better predict weather events and earthquakes, inform precision agriculture projects and explore potential energy sources such as nuclear fusion.

Learn more about how NVIDIA GPUs power the world’s top supercomputers.

The post Frontera’s New Frontier: Fastest Academic Supercomputer Wields NVIDIA GPUs for Science Research appeared first on The Official NVIDIA Blog.

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

Author: torontoai

[D] Model to predict site outage on Telecommunication Networks

Data Science Solutions SME – DataRobot – Toronto, ON

[Research] Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)

[D] NeurIPS 2019 decisions are out

[D] Classifying high dimensional sequences using traditional machine learning methods

[D] What annotation tool do you use for your image/video datasets?

[D] When Is It Better To Keep The Algorithm To Yourself?

[D] Machine learning, alchemy or real science?

[D] Library to extract sentiment from business news?

Frontera’s New Frontier: Fastest Academic Supercomputer Wields NVIDIA GPUs for Science Research

From Molecular to Supermassive, Accelerating Science Research