Category: Reddit MachineLearning

[P] r/datascienceproject is live

Written on August 22, 2019. Posted in Reddit MachineLearning.

Post your data science project to r/datascienceproject without restrictions. It’s a way to notify the public of what you are working on .

submitted by /u/OppositeMidnight
[link] [comments]

[R] A collection of notes on 150 papers in neural dialog modeling and related areas

Written on August 22, 2019. Posted in Reddit MachineLearning.

https://github.com/ricsinaruto/Seq2seqChatbots/wiki/Chatbot-and-Related-Research-Paper-Notes-with-Images

submitted by /u/ricsinaruto
[link] [comments]

[D] How do you handle the high uncertainty of your timeline/deadline for delivering a ml/dl product?

Written on August 22, 2019. Posted in Reddit MachineLearning.

Hi everyone! I found this interesting post on LinkedIn and I would like to know your opinion about.

Here’s the post for those of you who don’t have a LinkedIn account or the author of the post in your connections.

“The most challenging problem data scientists are facing today is having a highly uncertain timeline/deadline for delivering a product. I find it challenging to predict the due delivery date for a data science-related product. Because you need much experiment to understand the problem and then propose the solution, before that, it’s impossible to determine the timeline or even the accuracy that can be achieved. Also, sometimes after the EDA you may face many difficulties that may change the deadline. How can we design a data science project in a more deterministic way like regular software? e.g., the first-month design database, then in the second design the login page, etc. I would love to know how do you deal with it.” – Mundher Al-Shabi – Data Scientist at CADS

submitted by /u/pirate7777777
[link] [comments]

[D] Attention layer yields inconsistent results

Written on August 22, 2019. Posted in Reddit MachineLearning.

Hi reddit,

I am currently working on a problem that involves Recurrent Neural Networks. More precisely, I am dealing with sequences of inputs and try to make a prediction at each time step. As such, I decided to try to include an attention layer that is to look on the left context only.

The problem with this is that the results vary depending on the validation data, and by this, I mean a difference of 10-15% accuracy ! I suspect that something is not going so well. I even wonder if the attention layer does not look at both the left and right contexts, which would partly explain why sometimes performances are so good (but I set history_only=True ).

Has it ever happened to you, and what would you do in such situations ? Thank you 🙂

submitted by /u/lazywiing
[link] [comments]

Automatic Data Augmentation for NLP task

Written on August 22, 2019. Posted in Reddit MachineLearning.

submitted by /u/I_ai_AI
[link] [comments]

[D] Where can I find CNN baseline architectures for MNIST, CIFAR-10?

Written on August 22, 2019. Posted in Reddit MachineLearning.

For my research in semi-supervised learning, I am looking for good CNN architectures with their respective optimization hyperparameters and pre-processing schemes to be used as baselines for my work on semi-supervised learning.

submitted by /u/adelredjimi
[link] [comments]

[P] OpenGPT-2: We Replicated GPT-2 Because You Can Too

Written on August 22, 2019. Posted in Reddit MachineLearning.

The author trained a 1.5 billion param GPT-2 model on a similar sized text dataset called OpenWebTextCorpus and they reported perplexity results that can be compared with the original model.

Recently, large language models like BERT¹, XLNet², GPT-2³, and Grover⁴ have demonstrated impressive results in generating text and on multiple NLP tasks. Since Open-AI has not released their largest model at this time (but has released their 774M param model), we seek to replicate their 1.5B model to allow others to build on our pretrained model and further improve it.

https://medium.com/@vanya_cohen/opengpt-2-we-replicated-gpt-2-because-you-can-too-45e34e6d36dc

submitted by /u/baylearn
[link] [comments]

[D] Can you use LBFGS to solve Huber linear regression problems?

Written on August 22, 2019. Posted in Reddit MachineLearning.

That is, is it guarantee to converge to a minimum even though the Huber loss isn’t second-differentiable?

submitted by /u/sbarratt
[link] [comments]

[N] Announcing the xView 2 Prize Challenge: Assess Building Damage

Written on August 22, 2019. Posted in Reddit MachineLearning.

https://i.redd.it/6h9ydxj644i31.jpg

When disaster strikes, speed is critical. The time it takes to properly assess damage in the wake of a major event can be the difference between life and death. However, emergency responders must often navigate disruptions to local communication and transportation infrastructure, making accurate assessments dangerous, difficult, and slow. While satellite and aerial imagery offer a less risky alternative that covers more ground, analysts must still conduct manual, time-intensive assessments of images.

The Defense Innovation Unit’s (DIU’s) xView2 Challenge (Challenge) seeks to automate post-disaster damage assessment. DIU is challenging machine learning experts to develop computer vision algorithms that will speed up analysis of satellite and aerial imagery by localizing and categorizing various types of building damage caused by natural disasters. The xView2 Challenge is DIU’s second prize competition focused on furthering innovation in computer vision for humanitarian assistance and disaster relief (HADR) efforts. This year’s competition builds upon the xView1 Challenge, which sought out computer vision algorithms to locate and identify distinct objects on the ground useful to first responders.

“DIU’s goal in hosting this Challenge is to enlist the global community of machine learning experts to tackle a critically hard problem: detecting key objects in overhead imagery in context and assessing damage in a disaster situation,” said Mike Kaul, DIU AI Portfolio Director.

“We are always looking for ways to improve rapid damage assessment to ensure we and our partners deliver the right resources to the right places at the right time, and we are confident the DIU Challenge can contribute to that goal,” said FEMA Regional Administrator Robert Fenton, a partner in the Challenge.

DIU led a team of experts from academia and industry to create a new dataset, xBD, to enable both localization and damage assessment before and after disasters. The dataset will provide the foundation for the Challenge. While several open datasets for object detection from satellite imagery already exist (e.g. SpaceNet, xView 1), each represent only a single snapshot in time and lack information about the type and severity of damage following a disaster. xBD is currently the largest and most diverse annotated building damage dataset, allowing ML/AI practitioners to generate and test models to help automate building damage assessment. The open source electro-optical imagery (0.3 m resolution) xBD dataset will encompass ~544,556 building annotations across ~19,520 square kilometers of freely available imagery from multiple countries*. Six disaster types are included: wildfire, landslides, volcanic eruptions, earthquakes/tsunamis, and wind and flooding damage (*more data are being added to xBD as the labeling becomes available).

There are three competition prize tracks for the xView2 Challenge:

Open source. Teams compete for leaderboard position and awards for top scores. By releasing their models publicly under a permissive open source license, teams also become eligible for an additional open source award.
Non-exclusive Government purpose rights. Teams grant government purpose rights to become eligible for awards or top scores on the leaderboard. Solutions can be used to help future disaster recovery efforts.
Evaluation Only. Teams retain IP and only grant rights to benchmark their solution and compete for leaderboard position. Top teams in this category will still be eligible for a special monetary prize pool for their submissions.

The best solutions for all three categories will be eligible for a share of a $150,000 prize purse. Top solvers will also be invited to present their work at the December NeurIPS 2019 Workshop on AI for HADR. Winners of any cash prize will be considered eligible to be awarded follow-on work with the Department of Defense. The competition will start in September 2019 and run through November 2019.

Findings will be applied in both operational and academic use cases that include, but are not limited to: obstructed roads, rerouting across obstructed roads, force of nature identification, resource allocation decision-making, object recognition, and object identification . Baseline models, developed collaboratively between DIU and Carnegie Mellon’s Software Engineering Institute, will be publicly available as a starting point for the Challenge. In addition to advancing the state of the art in damage assessment, it is envisioned that the xBD dataset will provide researchers, companies, and other groups with the means and motive to develop algorithms that bring humanitarian assistance and disaster response into the age of AI.

The Challenge’s partners represent a first-of-its-kind coalition between the artificial intelligence and disaster response communities including NASA Earth Science Disasters Program, Federal Emergency Management Agency’s Region 9, California Governor’s Office of Emergency Services (CAL OES), California Department of Forestry and Fire Protection (CAL FIRE), California Guard, DoD’s Joint Artificial Intelligence Center, Carnegie Mellon ‘s Software Engineering Institute, the United States Geological Service, National Geospatial-Intelligence Agency, and the National Security Innovation Network (NSIN).

Additional details about the dataset can be found at (CVPR 2019 xBD Paper) which was first discussed at the IEEE CVPR in June . Terms and rules of the Challenge can be found on the xView2 website, xview2.org/terms.

Visit xview2.org to register.

submitted by /u/nirav_diu
[link] [comments]

[R] Adversarial point perturbations on 3D objects

Written on August 21, 2019. Posted in Reddit MachineLearning.

I am a high school student, and I’ve been working on a very niche field featuring the intersection of 3D deep learning and adversarial robustness.

My recent paper is on generating adversarial point sets against neural networks like PointNet. In total, there are four novel attacks in two categories: distributional and shape attacks. Distributional attacks barely change the shape of a point set because we use the shape-aware Hausdorff distance instead of a p-norm. Shape attacks are focused on changing the shape (that results in changes to the point clouds), which is realistic and robust against point removal defenses to restore an adversarial point cloud that some other groups and I have previously proposed. Note that these shape attacks do not need any extra info other than the raw 3D points.

You can read about point set adversarial attacks in my blog post: https://blog.liudaniel.com/birth-of-a-new-sub-sub-field

Or read my latest paper: https://arxiv.org/abs/1908.06062

submitted by /u/c0deb0t
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

Category: Reddit MachineLearning

[P] r/datascienceproject is live

[R] A collection of notes on 150 papers in neural dialog modeling and related areas

[D] How do you handle the high uncertainty of your timeline/deadline for delivering a ml/dl product?

[D] Attention layer yields inconsistent results

Automatic Data Augmentation for NLP task

[D] Where can I find CNN baseline architectures for MNIST, CIFAR-10?

[P] OpenGPT-2: We Replicated GPT-2 Because You Can Too

[D] Can you use LBFGS to solve Huber linear regression problems?

[N] Announcing the xView 2 Prize Challenge: Assess Building Damage

[R] Adversarial point perturbations on 3D objects