Author: torontoai

[D] Does Neural Program Synthesis be improved with x100 scaling of samples/compute/labels/curriculum ?!

Written on September 4, 2019. Posted in Reddit MachineLearning.

Looking at some recent papers on program synthesis

Neural (Meta) Program Synthesis, Singh {GB}

AlphaNPI twitted that acceptance to NIPS2019 with spotlight

I am wondering if field is still working out good architectures, representations, etc

existing SOTA techniques can be applied if we have x100 more compute, or a massive dataset of input-output pairs, or maybe a long detailed curated curriculum of specs and solutions, etc

submitted by /u/so_tiredso_tired
[link] [comments]

[R] Reusing Convolutional Activations from Frame to Frame to Speed up Learning and Inference

Written on September 4, 2019. Posted in Reddit MachineLearning.

Abstract: When processing similar frames in succession, we can take advantage of the locality of the convolution operation to reevaluate only portions of the image that changed from the previous frame. By saving the output of a layer of convolutions and calculating the change from frame to frame, we can reuse previous activations and save computational resources that would otherwise be wasted recalculating convolutions whose outputs we have already observed. This technique can be applied to many domains, such as processing videos from stationary video cameras, studying the effects of occluding or distorting sections of images, applying convolution to multiple frames of audio or time series data, or playing Atari games. Furthermore, this technique can be applied to speed up both training and inference.

Summary of results: Reusing convolutional activations with CPUs is a good way to save computation for both training and inference, and can serve as a viable alternative to training or doing inference on GPUs in some scenarios. It is likely cheaper, sometimes faster, and it will likely have access to more memory. Unfortunately, there is currently not as much incentive to use this method on GPUs, other than possibly saving power. There are many possible application domains for this technique, and there are likely many ways to improve upon it.

Code and result figures: https://github.com/arnokha/reusing_convolutions

Paper link: [Submitted, coming soon]

submitted by /u/arnokha
[link] [comments]

[R] Public ML Companies

Written on September 4, 2019. Posted in Reddit MachineLearning.

Hi all!

I’m currently working on a research project, looking to profile some publicly traded ML companies.

Problem is, most ML companies get bought out by IBM, Microsoft, Google, or Amazon before going public.

I’ve got a small list of ones I’ve found, but if you know of any that are doing cool things, I’d love to hear about them!

Thanks in advance 🙂

submitted by /u/BitMadHere
[link] [comments]

[R] Adversarial Training for Free!

Written on September 4, 2019. Posted in Reddit MachineLearning.

submitted by /u/downtownslim
[link] [comments]

[P] Write With Transformer: A web app to compare generative NLP transformer-based models by Hugging Face

Written on September 4, 2019. Posted in Reddit MachineLearning.

Sharing with you a project we’ve been working on at Hugging Face: Write With Transformer. It is a web app that hosts most state-of-the-art transformer-based NLP generative models like GPT-2, GPT or XLNet.

You can write a context and trigger completions from the generative model you choose, in a Google Doc-like interface. It also includes one of our fine-tuned models, using GPT-2 as a pretrained model and fine-tuning it on Arxiv papers to get NLP/Deep Learning completions.

It’s built on top of our library pytorch-transformers. Let us know what you think!

submitted by /u/jikkii
[link] [comments]

[R] DeepMind: Making Efficient Use of Demonstrations to Solve Hard Exploration Problems

Written on September 4, 2019. Posted in Reddit MachineLearning.

This paper introduces R2D3, an agent that makes efficient use of demonstrations to solve hard exploration problems in partially observable environments with highly variable initial conditions. We also introduce a suite of eight tasks that combine these three properties, and show that R2D3 can solve several of the tasks where other state of the art methods (both with and without demonstrations) fail to see even a single successful trajectory after tens of billions of steps of exploration.

Blog post: https://deepmind.com/research/publications/Making-Efficient-Use-of-Demonstrations-to-Solve-Hard-Exploration-Problems

Paper: https://arxiv.org/pdf/1909.01387.pdf

submitted by /u/modeless
[link] [comments]

Dial A for AI: Charter Boosts Customer Service with AI

Written on September 4, 2019. Posted in NVIDIA.

Charter Communications is working to make customer service smarter even before an operator picks up the phone.

Senior Director of Wireless Engineering Jared Ritter took a break from his presentations at GTC in Santa Clara to talk to AI Podcast host Noah Kravitz about Charter’s perspective on customer relations.

Multi-service operators — operators that run multiple cable television systems — revolve around client relations. And when it comes to customer service, “cable companies don’t have the best reputations,” Ritter admits.

Charter Communications, also known as Spectrum, is using AI to improve their customer service and process data more intelligently.

The most common basis for customer service at a standard telecommunications company is called interactive voice response. This automated voice lists a menu to route customers to the correct line.

But this often takes too long or routes customers incorrectly. Kravitz admits that when he hears the automated voice, “I just start yelling ‘representative’ at it until someone answers or my wife takes the phone away.”

Charter wants to make it easier for clients to call and get help. “You want your customers to talk to you,” Ritter says. “And no matter how good your network is, you’re never gonna have a day where you don’t receive calls or questions from customers.”

The other aspect of customer service is called agency, which is what the company’s AI can do. Charter wants to move past the traditional use of AI to route customers to yet another menu.

To do so, Charter is challenging the data lake model. Ritter explains that, in this traditional setup, networks generate a large amount of data that pours into a lake and stays in its native format until it’s needed. It’s then more challenging to recognize and access important data.

“We’ve flipped the script on that, and we’ve got the antithesis of a data lake, where we’ve got the AI looking through all that data before we ever store it,” Ritter explains. Their AI is trained to look for key issues or trends, allowing customer service representatives to be better informed to help clients.

Their reps can then preemptively predict customer problems, rather than learning about network outages or other issues after the fact.

When asked what else AI can make possible for Charter’s customer service, Ritter reflects, “I can’t even think about what it’ll look like in five years, because every week something new happens.”

To find out more about what Charter is making possible, visit their newsroom.

Help Make the AI Podcast Better

Have a few minutes to spare? Fill out this short listener survey. Your answers will help us make a better podcast.

How to Tune in to the AI Podcast

Get the AI Podcast through iTunes, Castbox, DoggCatcher, Overcast, PlayerFM, Podbay, PodBean, Pocket Casts, PodCruncher, PodKicker, Stitcher, Soundcloud and TuneIn. Your favorite not listed here? Email us at aipodcast [at] nvidia [dot] com.

The post Dial A for AI: Charter Boosts Customer Service with AI appeared first on The Official NVIDIA Blog.

[D] Shouldn’t we be doing more than complaining about patents and hoping they won’t affect our research and development activities?

Written on September 4, 2019. Posted in Reddit MachineLearning.

Check out this newly issued patent (filed in Sept. 2014) that appears to shut down the entire discipline of ML as it applies to classifying different machine actions with AI.

https://patentimages.storage.googleapis.com/17/f1/6d/15a1e6f88983c7/US10032117.pdf

I’m paraphrasing / simplifying so it’s probably not completely accurate, but it looks like this tries to patent:

1. Receiving training data from channels on-board (not external to) a mobile machine;

2. Determining training features/data and corresponding labels from the training data;

3. Where the labels/data relate to different machine actions (as opposed to simply active or inactive) each occurring over different time periods;

4. Building the classifier by feeding training features/data and labels into a ML algorithm.

Surely this was known prior to the Sept. 2014 filing date of this patent. Is anyone else looking for a green light to continue to innovate in this space? Would be curious if anyone is aware of, or can find, an earlier public document that overlaps with the main claims of this patent. Thanks!! PS – yes, I’ve googled and found some relevant documents, but have not yet found a single document that expressly includes all 4 items listed above.

submitted by /u/throwaway_mlengineer
[link] [comments]

[D] optimizing clipping functions

Written on September 4, 2019. Posted in Reddit MachineLearning.

In Reinforcement Learning I have noticed a trend in some(1, 2) papers that involve optimizing surrogate clipped functions. Has anyone seen any work that digs deeper into the effects of this? For example, in this paper they dig deeper into the relationship between clipped surrogate functions and trust regions. The above references I gave were from clipped surrogate objectives, but this doesn’t have to be the case(ex: drop the max and only optimize the clipped objective).

submitted by /u/CartPole
[link] [comments]

[D] consequences of converting .tiff to some tf.data supported format in terms of information loss?

Written on September 4, 2019. Posted in Reddit MachineLearning.

Hi, I’m currently working with a dataset of .tiff files and want to feed those to a model using a tf Dataset for performance reasons. However, tf currently does not support loading .tiff files with Dataset.

Now I’m curious how to assess the loss in information if I convert a tiff to e.g. a png. Currently theses tiffs only hold a single image which for me should not make much a difference.

What would be a good approach to assess this?

submitted by /u/turnmanh
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

Author: torontoai

[D] Does Neural Program Synthesis be improved with x100 scaling of samples/compute/labels/curriculum ?!

[R] Reusing Convolutional Activations from Frame to Frame to Speed up Learning and Inference

[R] Public ML Companies

[R] Adversarial Training for Free!

[P] Write With Transformer: A web app to compare generative NLP transformer-based models by Hugging Face

[R] DeepMind: Making Efficient Use of Demonstrations to Solve Hard Exploration Problems

Dial A for AI: Charter Boosts Customer Service with AI

Help Make the AI Podcast Better

How to Tune in to the AI Podcast

[D] Shouldn’t we be doing more than complaining about patents and hoping they won’t affect our research and development activities?

[D] optimizing clipping functions

[D] consequences of converting .tiff to some tf.data supported format in terms of information loss?