Category: Reddit MachineLearning

[D] Leveraging Learning in Robotics: RSS 2019 Highlights

Written on August 25, 2019. Posted in Reddit MachineLearning.

https://thegradient.pub/leveraging-learning-in-robotics-rss-2019-highlights/

I recently wrote a blog post, summarizing interesting works presented at the Robotics Science and Systems Conference. Would love your feedback!

submitted by /u/aseembits93
[link] [comments]

[P] Introducing Deepkit – the first collaborative desktop app for deep learning experiments. Experiment tracking, model debugging, infrastructure management.

Written on August 25, 2019. Posted in Reddit MachineLearning.

https://deepkit.ai

Hi guys, I’m the founder of Deepkit. An app that helps you visualize, debug, track, and run ML/DL experiments, directly on your workstation or on your own servers, in your LAN or in the cloud. Deepkit will be free for individual users and available in all app stores. You can use the app alone or use the real-time collaborative features within a team using the Deepkit team server.

We’re are looking for alpha users that want to help us building a better, cheaper and more efficient way of doing ML/DL experiments. If you’re interested, please register at the website directly or use this link. We currently only support MacOS, but Windows & Linux will follow. Follow us on Twitter @deepkitAI to get notified once we release the public version.

If you got any questions, I’m happy to answer in the comments.

submitted by /u/marcjschmidt
[link] [comments]

[R] AdvHat: Real-world adversarial attack on ArcFace Face ID system

Written on August 25, 2019. Posted in Reddit MachineLearning.

Hi! We have done some interesting research on breaking the current best public Face ID system – ArcFace – using the adversarial attack technique. It’s quite ordinary but what we succeed is to do it in the real world (i.e. made it not in digital domain only): someone can print the color sticker and stick it to a hat, and after that the similarity with the ground truth drops significantly. Even some sort of attack transferability to other top Face ID models from insightface exists.

Paper: https://arxiv.org/abs/1908.08705

Code: https://github.com/papermsucode/advhat

Video demonstration: https://www.youtube.com/watch?v=a4iNg0wWBsQ

Any comments are welcome!

submitted by /u/AleksanderPet
[link] [comments]

[D] How can I reduce the difference between real and predicted stock price?

Written on August 25, 2019. Posted in Reddit MachineLearning.

I’m using LSTM in deep learning to predict indexes.

I used MinMaxSacler(0~1) and The lowest MSE obtained through the model is 0.000003

However, there is a big difference between the predicted price and the real price.

I wonder if it is possible to bridge this gap.

If I can, I wonder which method I should use.

Your valuable opinions and thoughts will be very much appreciated.

submitted by /u/GoBacksIn
[link] [comments]

[D] Classification of irregular time series

Written on August 24, 2019. Posted in Reddit MachineLearning.

I have been working on classification of variable stars using light curves. However, the curves have different number of data points, from 40 data points up to 100. I have been training my network by randomly removing points until having about 50 points per star, and also augmented the data with different combination of eliminated points, but it seems to introduce a lot of loss.

I am interested in different approaches or ideas on how to handle the irregularity.

submitted by /u/CuzImLonelyWannaDie
[link] [comments]

[N] [CFP] 3rd edition of Emergent Communication workshop at NeurIPS’19 (EmeCom)

Written on August 24, 2019. Posted in Reddit MachineLearning.

Hi everyone,

We are pleased to announce the Call for Papers for the 3rd Workshop on Emergent Communication to be held at NeurIPS 2019 on Dec 13 or 14 at Vancouver, Canada.

We invite submissions from researchers both inside and outside the machine learning community in the following areas:

– multi-agent communication
– grounding emergent protocols to natural language
– compositionality in emergent/natural languages
– linguistic generalization
– learning cognitive skills through language
– language evolution
– deep multi-agent learning
– any other area related to the subject of the workshop

Submission Format:
The submitted work should be an extended abstract not exceeding 4 pages (excluding references and supplementary material). The submission should be in pdf format and should follow the style guidelines for NeurIPS 2019. The review process is double-blind. The submissions should not have been previously published in any ML conference nor have appeared in the NeurIPS main conference. We do however appreciate submitting published work from other non-ML conferences. Work currently under submission to another conference is also welcome. We discourage submitting the same work to other NeurIPS workshops. There will be no formal publication of workshop proceedings. However, the accepted papers will be made available online on the workshop website as non-archival reports to allow submissions to future conferences/journals.

Workshop Website: https://sites.google.com/view/emecom2019/
Submissions Link: https://cmt3.research.microsoft.com/emecom2019/

Important Dates:
Submissions Open: Aug 15
Submissions Deadline: Sep 15
Accepted papers notification: Sep 30
Camera ready Deadline: Nov 15
Upload poster/video (optional): Dec 7
Workshop Date: Dec 13 or 14

(All deadlines expire at 11:59pm AoE on the respective dates)

For any queries, reach out to us at [emecomworkshop@gmail.com](mailto:emecomworkshop@gmail.com). We look forward to receiving your submissions!

On behalf of all organizers, Cheers!

Abhinav Gupta (Mila)
Michael Noukhovitch (Mila)
Cinjon Resnick (NYU)
Natasha Jaques (MIT)
Angelos Filos (Oxford)
Marie Ossenkopf (Uni Kassel)
Jakob Foerster (FAIR)
Angeliki Lazaridou (DeepMind)
Ryan Lowe (Mila)
Douwe Kiela (FAIR)
Kyunghyun Cho (NYU/FAIR)

submitted by /u/dbg99
[link] [comments]

[R] [1906.03671] Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds

Written on August 24, 2019. Posted in Reddit MachineLearning.

submitted by /u/wordbag
[link] [comments]

[D] Machine Learning – WAYR (What Are You Reading) – Week 69

Written on August 24, 2019. Posted in Reddit MachineLearning.

This is a place to share machine learning research papers, journals, and articles that you’re reading this week. If it relates to what you’re researching, by all means elaborate and give us your insight, otherwise it could just be an interesting paper you’ve read.

Please try to provide some insight from your understanding and please don’t post things which are present in wiki.

Preferably you should link the arxiv page (not the PDF, you can easily access the PDF from the summary page but not the other way around) or any other pertinent links.

Previous weeks :

1-10	11-20	21-30	31-40	41-50	51-60	61-70
Week 1	Week 11	Week 21	Week 31	Week 41	Week 51	Week 61
Week 2	Week 12	Week 22	Week 32	Week 42	Week 52	Week 62
Week 3	Week 13	Week 23	Week 33	Week 43	Week 53	Week 63
Week 4	Week 14	Week 24	Week 34	Week 44	Week 54	Week 64
Week 5	Week 15	Week 25	Week 35	Week 45	Week 55	Week 65
Week 6	Week 16	Week 26	Week 36	Week 46	Week 56	Week 66
Week 7	Week 17	Week 27	Week 37	Week 47	Week 57	Week 67
Week 8	Week 18	Week 28	Week 38	Week 48	Week 58	Week 68
Week 9	Week 19	Week 29	Week 39	Week 49	Week 59
Week 10	Week 20	Week 30	Week 40	Week 50	Week 60

Most upvoted papers two weeks ago:

/u/Cantrill1758: Anomaly Detection Using Autoencoders with Nonlinear Dimensionality Reduction

/u/lysecret: https://arxiv.org/abs/1904.01681

Besides that, there are no rules, have fun.

submitted by /u/ML_WAYR_bot
[link] [comments]

[D] Why is PyTorch as fast as (and sometimes faster than) TensorFlow?

Written on August 24, 2019. Posted in Reddit MachineLearning.

Since both libraries use cuDNN under the hood, I would expect the individual operations to be similar in speed. However, TensorFlow (in graph mode) compiles a graph so when you run the actual train loop, you have no python overhead outside of the session.run call. In PyTorch, you are in Python a lot due to the dynamic graph, so I would expect that to add some overhead. Not to mention the fact that having a static graph means you can graph optimizations like node pruning and ordering operations. But in many benchmarks I see online, PyTorch has no problems keeping up with TensorFlow on GPUs.

A specific example is the Adam implementations in both libraries:

https://github.com/pytorch/pytorch/blob/master/torch/optim/adam.py

https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/training/adam.py

PyTorch has all the ops as you would expect. For TensorFlow in the {_resource}_apply_dense case (which is the common case, AFAIK), TensorFlow has a dedicated C++ implementation. So here, TensorFlow does not spend extra time in Python AND it has an optimized implementation in C++. In this case, why isn’t the TensorFlow version straight up faster?

I’ve heard that PyTorch is better optimized on the cuDNN level. Can anyone provide more details about this? What’s preventing TensorFlow from doing the same thing? The only optimization I know of is that PyTorch uses the NCHW format (which is better optimized for cuDNN) whereas TensorFlow by default uses NHWC.

I saw these two discussions but did not see a satisfactory answer:

https://www.reddit.com/r/MachineLearning/comments/7ujc6y/d_can_someone_give_a_technical_explanation_as_to/

https://www.reddit.com/r/MachineLearning/comments/8iguaw/d_why_is_tensorflow_so_slow/

submitted by /u/student_at_uw
[link] [comments]

[R] FacebookAI releases Adaptive attention span and All-attention layer to reduce decrease computation time / memory footprint

Written on August 23, 2019. Posted in Reddit MachineLearning.

https://video.twimg.com/tweet_video/ECqxU2AU0AAkBwc.mp4

To enable wider use of this powerful deep learning architecture, we propose two new methods. The first, adaptive attention span is a way to make Transformer networks more efficient for longer sentences. With this method, we were able to increase the attention span of a Transformer to over 8,000 tokens without significantly increasing computation time or memory footprint. The second, all-attention layer is a way to simplify the model architecture of Transformer networks. Even with a much simpler architecture, our all-attention network matched the state-of-the-art performance of Transformer networks.

https://ai.facebook.com/blog/making-transformer-networks-simpler-and-more-efficient/

submitted by /u/BatmantoshReturns
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

Category: Reddit MachineLearning

[D] Leveraging Learning in Robotics: RSS 2019 Highlights

[P] Introducing Deepkit – the first collaborative desktop app for deep learning experiments. Experiment tracking, model debugging, infrastructure management.

[R] AdvHat: Real-world adversarial attack on ArcFace Face ID system

[D] How can I reduce the difference between real and predicted stock price?

[D] Classification of irregular time series

[N] [CFP] 3rd edition of Emergent Communication workshop at NeurIPS’19 (EmeCom)

[R] [1906.03671] Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds

[D] Machine Learning – WAYR (What Are You Reading) – Week 69

[D] Why is PyTorch as fast as (and sometimes faster than) TensorFlow?

[R] FacebookAI releases Adaptive attention span and All-attention layer to reduce decrease computation time / memory footprint