Author: torontoai

[D] How do you build and train a model with a million output classes?

Written on December 4, 2019. Posted in Reddit MachineLearning.

Most networks in tutorials only show examples with 10-1000 output classes. What techniques can be used to build and train networks with a million or more output classes?

It seems to me that using the traditional approach would be horribly computationally expensive. Intuitively, I would expect that some kind of hierarchical approach should be taken. Any links to blogs or papers that show these techniques would be appreciated.

submitted by /u/mateja
[link] [comments]

[1912.01412] Deep Learning for Symbolic Mathematics

Written on December 4, 2019. Posted in Reddit MachineLearning.

submitted by /u/avturchin
[link] [comments]

Pod Squad: Descript Uses AI to Make Managing Podcasts Quicker, Easier

Written on December 4, 2019. Posted in NVIDIA.

You can’t have an AI podcast and not interview someone using AI to make podcasts better.

That’s why we reached out to serial entrepreneur Andrew Mason to talk to him about what he’s doing now. His company, Descript Podcast Studio, uses AI, natural language processing and automatic speech synthesis to make podcast editing easier and more collaborative.

Mason, Descript’s CEO and perhaps best known as Groupon’s founder, spoke with AI Podcast host Noah Kravitz about his company and the newest beta service it offers, called Overdub.

Key Points From This Episode

Descript works like a collaborative word processor. Users record audio, which Descript converts to text. They can then edit and rearrange text, and the program will change the audio.
Overdub, created in collaboration with Descript’s AI research division, eliminates the need to re-record audio. Type in new text, and Overdub creates audio in the user’s voice.
Descript 3.0 launched in November, adding new features such as a detector that can identify and remove vocalized pauses like “um” and “uh” as well as silence.

Tweetables

“We’re trying to use AI to automate the technical heavy lifting components of learning to use editors — as opposed to automating the craft — and we leave space for the user to display and refine their craft” — Andrew Mason [07:10]

“What’s really unique to us is a kind of tonal or prosodic connecting of the dots, where we’ll analyze the audio before and after whatever you’re splicing in with Overdub, and make sure that it sounds continuous in a natural transition” — Andrew Mason [10:30]

The Next Hans Zimmer? How AI May Create Music for Video Games, Exercise Routines

Imagine Wolfgang Amadeus Mozart as an algorithm or the next Hans Zimmer as a computer. Pierre Barreau and his startup, Aiva Technologies, are using deep learning to compose music. Their algorithm can create a theme in four minutes flat.

How Deep Learning Can Translate American Sign Language

Rochester Institute of Technology computer engineering major Syed Ahmed, a research assistant at the National Technical Institute for the Deaf, uses AI to translate between American sign language and English. Ahmed trained his algorithm on 1,700 sign language videos.

Tune in to the AI Podcast

Get the AI Podcast through iTunes, Google Podcasts, Google Play, Castbox, DoggCatcher, Overcast, PlayerFM, Pocket Casts, Podbay, PodBean, PodCruncher, PodKicker, Soundcloud, Spotify, Stitcher and TuneIn.

Make Our Podcast Better

Have a few minutes to spare? Fill out this short listener survey. Your answers will help us make a better podcast.

The post Pod Squad: Descript Uses AI to Make Managing Podcasts Quicker, Easier appeared first on The Official NVIDIA Blog.

[R] NeurIPS 2019 Livestream

Written on December 4, 2019. Posted in Reddit MachineLearning.

aideeptalk will livestream the expo and posters at NeurIPS 2019 on Twitch at twitch.tv/aideeptalk

To receive a notification when we go live, please follow us and enable notifications on our Twitch channel.

Please pass this on to those who can’t make it to NeurIPS

For more details see our website aideeptalk.com

submitted by /u/aideeptalk
[link] [comments]

[D] Confused about generating a translation using Transformer

Written on December 4, 2019. Posted in Reddit MachineLearning.

I’m reading the Attention Is All You Need paper and it doesn’t seem to explain how exactly the Transformer is used to generate a translation. Here’s how I understand it so far (please correct if I’m wrong):

A sequence of k tokens comes in as one-hot vectors of length v – the vocab size. This is a (k x v) token matrix.
The tokens are embedded in d_m (model size, e.g. 512) dimensional space via multiplication by an Embedding matrix E of dim (v x d_m), yielding a (k x d_m).
Positional Encodings added, dim is still (k x d_m).

Encoding:

Encoder block takes in the (k x d_m) matrix and outputs another (k x d_m) matrix.
Repeat N times to get a final (k x d_m) matrix, i.e. the encoder output.

Now for decoding:

The decoder takes in a (p x d_m) matrix and adds positional encodings.
The (non-masked) multi-head attention function inside the decoder receives encoder’s (k x d_m) output as key K, and value V, and a (p x d_m) matrix as the query Q, yielding a (p x d_m) output.
The final output of the decoder is therefore (p x d_m).

Final output:

The (p x d_m) decoder output is mapped to (p x v) by a matrix multiply (Question: they say it’s “tied” to the embedding matrix E, so is this just E^T?).
Select the max of each value in the p rows (softmax), so you get p tokens out.

Suppose I want to translate the sequence “This attention paper is super confusing !” into German. Here k = 7, so my encoder outputs a (7 x 512) matrix. From here, can someone walk me through the steps of generating the translation?

Thanks for looking at my question and have an awesome day!

submitted by /u/ME_PhD
[link] [comments]

Microsoft using ‘Game of Drones’ AI racing challenge to improve trustable autonomy systems

Written on December 4, 2019. Posted in Microsoft.

The post Microsoft using ‘Game of Drones’ AI racing challenge to improve trustable autonomy systems appeared first on The AI Blog.

[R] Piecewise Strong Convexity of Neural Networks

Written on December 4, 2019. Posted in Reddit MachineLearning.

Paper: https://arxiv.org/abs/1810.12805

Video summary: https://www.youtube.com/watch?v=z89BTMQGVn

Earlier related work: https://arxiv.org/abs/1607.04917 (piecewise convexity)

I am not the author. This paper will be presented at NeurIPS this month and exposes some convexity results about piece-wise linear nns under the least squares loss – namely piecewise strong-convexity & the non-existance of differentiable local maxima. The approach is a spectral analysis of the Hessian and weights of the nn. The result is a relatively attractive convergence estimate for sgd.

I guess this provides some more motivation for studying techniques like ADMM which have convergence properties for some classes of piece-wise functions and can exploit lipschitz cts gradients. Nice work!

submitted by /u/i-heart-turtles
[link] [comments]

[D] Best network for battle game agents (neuroevolution)

Written on December 4, 2019. Posted in Reddit MachineLearning.

Hi,

I’m working on an indie game where you evolve teams of agents that each have a neural network, and then battle them against other players team (looks like this: https://youtu.be/EPekL1JMXEY).

I already have an implementation of sparse lstm-ish networks (1), but I’d like to optimize this further and wanted to see what people here have to suggest. Since it’s an evolution based game I don’t use backprop. It also needs to be fairly simple as it all runs on the GPU (which is why I can have simulations of thousands of agents on a single machine). And since it all runs on the GPU I’d prefer something that is fixed size, which is why I’ve stayed away from NEAT so far.

So my question is; what would be the best network for something like this?

(1) My current network works as follows; Each node has: a state (1 float), 12 indices, 12 weights, 2 bias values. Each index decides which other nodes it “reads” from, so a node can be connected to any other node (layers are therefore less important, they only decide the order of updates). 8 of the inputs are used for the next value of the state. 4 of the inputs are used as a write gate (-1 keep state -> 1 update state). There are some more details but that’s roughly it.

submitted by /u/FredrikNoren
[link] [comments]

[D] Figure Eight and alternatives

Written on December 4, 2019. Posted in Reddit MachineLearning.

Hey,

We are considering to use Figure Eight in our company.

Has someone tried to use Figure Eight as an annotation plartform? What are the pros and cons? How is it compared to Amazon in terms of price, annotation quality etc?

Would love to hear your opinions.

submitted by /u/guzguzit
[link] [comments]

[D] Evolutionary Algorithms researchers, do you feel like a new library is needed?

Written on December 4, 2019. Posted in Reddit MachineLearning.

Researchers who work with Evolutionary Algorithms, do you feel there are current libraries that satisfy your needs? Do you feel there is a need for a new library? What features do you feel are missing/you need?

submitted by /u/ghost_shaba7
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

Author: torontoai

[D] How do you build and train a model with a million output classes?

[1912.01412] Deep Learning for Symbolic Mathematics

Pod Squad: Descript Uses AI to Make Managing Podcasts Quicker, Easier

Key Points From This Episode

Tweetables

You Might Also Like

The Next Hans Zimmer? How AI May Create Music for Video Games, Exercise Routines

How Deep Learning Can Translate American Sign Language

Tune in to the AI Podcast

Make Our Podcast Better

[R] NeurIPS 2019 Livestream

[D] Confused about generating a translation using Transformer

Microsoft using ‘Game of Drones’ AI racing challenge to improve trustable autonomy systems

[R] Piecewise Strong Convexity of Neural Networks

[D] Best network for battle game agents (neuroevolution)

[D] Figure Eight and alternatives

[D] Evolutionary Algorithms researchers, do you feel like a new library is needed?