Author: torontoai

[P] How can I build this simple text-based ML tool?

Written on November 24, 2019. Posted in Reddit MachineLearning.

Hello everyone!

I work with spreadsheets a lot, doing tasks manually that are just a bit too complex for rules, but I believe they certainly fall into what ML can handle. In a nutshell, I spend 2+ hours a day going through company names, removing legal terms like “LLC” or “Limited”, and humanizing them.

For instance, I have a spreadsheet with company names and emails.

Company Name	Email Address
Concur Recruitment Limited – 02476 668 204	sconvery@concurengineering.co.uk
Confluent Technology Group	mark.anderson@confluentgroup.com
Construction Maintenance and Allied Workers	donmelanson@cmaw.ca

These would become (currently by hand):

Company Name	Email Address
Concur Engineering	sconvery@concurengineering.co.uk
Confluent	mark.anderson@confluentgroup.com
CMAW	donmelanson@cmaw.ca

What we’re doing here is:

Shorting names to their essence
Removing legal terms and words
Looking at domain names (in email addresses) as a clue for the “most human name”

Now, I very well believe this is something Google Cloud has capabilities for. Given the lack of programming involved with Google Cloud ML (and its potential integration with Google Sheets), I’d imagine it’s the best vehicle for this tool.

Some questions before I embark upon this journey:

Would your recommend I use Google Cloud ML or another tool?
How much data would you imagine would be necessary to train this tool? (uncleaned spreadsheets and cleaned spreadsheets)
Am I critically misunderstanding something here? This is pretty much my first time practically applying ML.

Thank you very much for all your help!

submitted by /u/ventura__highway
[link] [comments]

[D] Use AI to turn low poly world into photorealistic scenarios

Written on November 24, 2019. Posted in Reddit MachineLearning.

Hi,

I wonder why we are still trying to mimic photorealistic world by counting every reflection, polygon, tracing every ray and so on. Shouldn’t it be done in such manner that AI is just doing the job basing on photos and low polygon input like here https://assetstore.unity.com/packages/3d/characters/animals/poly-art-forest-set-128568 Also all other games like Zelda BOTW, Team Fortress 2 or even Fortnite could be easily turned by AI into photorealistic env. Shouldn’t we start thinking about doing AI accelerators (like first 3dfx cards) for enriching low polygonic world’s generated easily by most commodity hardware? I guess even ray tracing could be made by ML. I believed that future belongs to generating world by AI not by tricky mathematic graphics algorithms. Especially that in future it is easier to go from such trained networks into environment where instead of heaving an output on display, the output would be “drawn” directly in human brain through neural-connectivity. Also AI is able to properly handle cases where object is moving fast or turning around.

Cheers, Alexa

submitted by /u/AlexaPomata
[link] [comments]

[1911.09723] Fast Sparse ConvNets

Written on November 24, 2019. Posted in Reddit MachineLearning.

submitted by /u/ekelsen
[link] [comments]

[D] Image synthesis before GANs

Written on November 24, 2019. Posted in Reddit MachineLearning.

I’m writing my master’s thesis on image generation and was wondering: which other methods were/are used to synthesize images, aside from GANs? Any time I look up “image synthesis” I only find GAN tutorials, and excluding GANs seems to bring up photon scanning.

submitted by /u/WoahCanyonero
[link] [comments]

[R] DeepFovea: Neural Reconstruction for Foveated Rendering and Video Compression using Learned Statistics of Natural Videos (SIGGRAPH Asia 2019)

Written on November 24, 2019. Posted in Reddit MachineLearning.

I thought this project from FB Research is a really cool work:

Abstract

In order to provide an immersive visual experience, modern displays require head mounting, high image resolution, low latency, as well as high refresh rate. This poses a challenging computational problem. On the other hand, the human visual system can consume only a tiny fraction of this video stream due to the drastic acuity loss in the peripheral vision. Foveated rendering and compression can save computations by reducing the image quality in the peripheral vision. However, this can cause noticeable artifacts in the periphery, or, if done conservatively, would provide only modest savings. In this work, we explore a novel foveated reconstruction method that employs the recent advances in generative adversarial neural networks. We reconstruct a plausible peripheral video from a small fraction of pixels provided every frame. The reconstruction is done by finding the closest matching video to this sparse input stream of pixels on the learned manifold of natural videos. Our method is more efficient than the state-of-the-art foveated rendering, while providing the visual experience with no noticeable quality degradation. We conducted a user study to validate our reconstruction method and compare it against existing foveated rendering and video compression techniques. Our method is fast enough to drive gaze-contingent head-mounted displays in real time on modern hardware. We plan to publish the trained network to establish a new quality bar for foveated rendering and compression as well as encourage follow-up research.

Project page / paper / video / code: https://research.fb.com/publications/deepfovea-neural-reconstruction-for-foveated-rendering-and-video-compression-using-learned-statistics-of-natural-videos/

submitted by /u/hardmaru
[link] [comments]

[D] ImageNet classification training full-resolution, no crop no resize.

Written on November 24, 2019. Posted in Reddit MachineLearning.

Are there papers that do so? ImageNet images have various resolutions. Since we already have adaptive pooling techniques such as global average pooling, attention-based pooling, etc. that support variable resolution, should it be possible?

Random fixed size crop and resize seems hacky.

submitted by /u/tsauri
[link] [comments]

[P] NBoost: Boost Elasticsearch Search Relevance by 80% with BERT

Written on November 24, 2019. Posted in Reddit MachineLearning.

Hi Everyone!

New to reddit, but I’d like to share a project I’ve been working on called NBoost. It’s essentially a proxy for search APIs (e.g. Elasticsearch) that reranks search results using finetuned models (e.g. BERT).

Check out our medium article or github to learn more!

It’s main features include: – Super easy to set up (you can just pip install nboost) – Easy, non-invasive integration with Elasticsearch and potentially other search APIs. – Finetuned models are plugins (you can swap them in and out). – Fast and scaleable (written at the lowest level possible)

submitted by /u/colethienes
[link] [comments]

[D] anybody know an implementation of fchollet’s 2016 paper on label embeddings?

Written on November 23, 2019. Posted in Reddit MachineLearning.

This is the paper: Information-theoretical label embeddings for large-scale image classification

https://arxiv.org/abs/1607.05691

submitted by /u/daanvdn
[link] [comments]

[R] A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.

Written on November 23, 2019. Posted in Reddit MachineLearning.

In the paper: MUSE: Parallel Multi-Scale Attention for Sequence to Sequence Learning，

They delve into three questions in sequence to sequence learning:

Is attention alone good enough？
Is parallel representation learning applicable to sequence data and tasks?
How to design a module that combines both inductive bias of convolution and self-attention？

They find that there are shortcomings in stand-alone self-attention, and present a new module that maps the input to the hidden space and performs the three operations of self-attention, convolution and nonlinearity in parallel, simply stacking this module outperforms all previous models including Transformer (Vasvani et al., 2017) on main NMT tasks under standard setting.

Key features:

First successfully combine convolution and self-attention in one module for sequence tasks by the proposed shared projection，
SOTA on three main translation datasets, including WMT14 En-Fr, WMT14 En-De and IWSLT14 De-En，
Parallel learn sequence representations and thus have potential for acceleration。

Quick links:

Arxiv : pdf;

Github : Code, pretrained models, instructions for training are all available.

Main results:

https://preview.redd.it/2qkne32qwo041.png?width=554&format=png&auto=webp&s=f7ed2973ebdd434f2982a2ec546f65ca635ff529

https://preview.redd.it/cdx3sc5vwo041.png?width=554&format=png&auto=webp&s=611d277629129f5734119321b4466f1bd4781f03

Abstract:

In sequence to sequence learning, the self-attention mechanism proves to be highly effective, and achieves significant improvements in many tasks. However, the self-attention mechanism is not without its own flaws. Although self-attention can model extremely long dependencies, the attention in deep layers tends to overconcentrate on a single token, leading to insufficient use of local information and difficultly in representing long sequences. In this work, we explore parallel multi-scale representation learning on sequence data, striving to capture both long-range and short-range language structures. To this end, we propose the Parallel MUlti-Scale attEntion (MUSE) and MUSE-simple. MUSE-simple contains the basic idea of parallel multi-scale sequence representation learning, and it encodes the sequence in parallel, in terms of different scales with the help from self-attention, and pointwise transformation. MUSE builds on MUSE-simple and explores combining convolution and self-attention for learning sequence representations from more different scales. We focus on machine translation and the proposed approach achieves substantial performance improvements over Transformer, especially on long sequences. More importantly, we find that although conceptually simple, its success in practice requires intricate considerations, and the multi-scale attention must build on unified semantic space. Under common setting, the proposed model achieves substantial performance and outperforms all previous models on three main machine translation tasks. In addition, MUSE has potential for accelerating inference due to its parallelism.

submitted by /u/stopwind
[link] [comments]

[D] What do you see as the most promising directions for reducing sample inefficiency in reinforcement learning?

Written on November 23, 2019. Posted in Reddit MachineLearning.

I often read from ML researchers, but more from computational cognitive scientists, that humans are able to generalize patterns from only a few data points or use “rich, informative priors” even as children, and how that is very important for us as cognitive beings that sets us apart from the current neural network approaches to RL used today.

I’m also not entirely convinced that the current neural net paradigm with the McCulloch–Pitts-esque neurons is ever going to become sample efficient enough for real-world reinforcement learning tasks. It seems like despite our best efforts to increase sample efficiency in NN techniques, the most impressive results still use hundreds of thousands or more simulations/data points that could be infeasible to implement for any sufficiently complex real-world environments.

That being said, what approaches are you most excited for in reducing sample efficiency in reinforcement learning or in neural network techniques in general?

submitted by /u/p6m_lattice
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

Author: torontoai

[P] How can I build this simple text-based ML tool?

[D] Use AI to turn low poly world into photorealistic scenarios

[1911.09723] Fast Sparse ConvNets

[D] Image synthesis before GANs

[R] DeepFovea: Neural Reconstruction for Foveated Rendering and Video Compression using Learned Statistics of Natural Videos (SIGGRAPH Asia 2019)

[D] ImageNet classification training full-resolution, no crop no resize.

[P] NBoost: Boost Elasticsearch Search Relevance by 80% with BERT

[D] anybody know an implementation of fchollet’s 2016 paper on label embeddings?

[R] A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.

[D] What do you see as the most promising directions for reducing sample inefficiency in reinforcement learning?