Author: torontoai

[P] Loss for restricted boltzmann machines

Written on November 28, 2019. Posted in Reddit MachineLearning.

Hello guys, I am trying to implement this paper. They used restricted boltzmann machines to get evolution of a system. Due to nature of the problem “activation” function is not sigmoid but tanh which is later transformed into probability. Different activation function means that gradient is different, although in the paper they wrote “standard” gradient equations. When I tried to train model with standard gradient equations I couldn’t get any meaningful results.I am aware that RBM minimizes KL divergence, but I couldn’t figure out how to implement that (how to get probability function). After trying to find what should I put for loss function when training with eg. Adam I stumbled upon some random github code which for loss function had:

L = energy(data_vec) – energy(genrated_vec)

(yes without absolute value). Well, I plugged that in and it worked like a charm. When I tried to put absolute value over loss it did not worked. I am having trouble understanding why is this correct loss for RBM and I would appreciate help 🙂

submitted by /u/matej1408
[link] [comments]

[D] What is SOTA for image manipulation detection?

Written on November 28, 2019. Posted in Reddit MachineLearning.

What is sota for image manipulation detection? I have not seen much work in this field happening, does anybody know the reason for that? The only recent paper have seen is from CVPR2018 but I thought there would be more work being done?

Thanks for your help!

submitted by /u/davidblyrics
[link] [comments]

[D]What is the latest on this Fujitsu Laboratories announcement from Oct?

Written on November 28, 2019. Posted in Reddit MachineLearning.

I found this :

https://www.fujitsu.com/global/about/resources/news/press-releases/2019/1015-01.html

Looks like it was posted about a month ago, I haven’t really found much more info on it, was trying to see if it was anything groundbreaking, the details provided are slim to none. Just gives a basic overview of how neural networks seem to work, and some fancy terms for normalization/transformations/landmark detection. Thanks.

They make some bold claims:

This technology has achieved a high detection accuracy rate of 81% even with limited training data.

This technology is also more accurate than other existing technologies according to certain facial expression recognition technology benchmarks

submitted by /u/FreckledMil
[link] [comments]

[D] Amd or Intel CPU for deep learning server?

Written on November 28, 2019. Posted in Reddit MachineLearning.

Building a Deep Learning home server with 4x 2080 ti blower style GPUs and I am wondering what CPU to get for this machine. The delima is that I am using python Pytorch and Numpy which has a lot of support with Intels MLK packages that sabotage AMD performance. However, I am utilizing the GPUs and CUDA technology for the matrix multiplication and can get away with Numpy Openblas for my environments on the AMD machine. I’m trying to keep this project under $8000 and want the best performance. Also, I am primary using DDPG and DQN networks. What would you do?

Type	Item	Price
CPU	AMD Threadripper 3960X 3.8 GHz 24-Core Processor	–
CPU Cooler	be quiet! Dark Rock Pro TR4 59.5 CFM CPU Cooler	–
Motherboard	ASRock TRX40 Creator ATX sTRX4 Motherboard	–
Memory	G.Skill Ripjaws V Series 16 GB (2 x 8 GB) DDR4-3200 Memory	$59.99 @ Newegg
Storage	HP EX950 1 TB M.2-2280 NVME Solid State Drive	$129.99 @ Newegg
Video Card	PNY GeForce RTX 2080 Ti 11 GB Blower Video Card	–
Case	Corsair Air 540 ATX Mid Tower Case	$129.98 @ Newegg
Power Supply	EVGA SuperNOVA P2 1600 W 80+ Platinum Certified Fully Modular ATX Power Supply	–
Prices include shipping, taxes, rebates, and discounts
Total (before mail-in rebates)	$339.96
Mail-in rebates	-$20.00
Total	$319.96
Generated by PCPartPicker 2019-11-29 12:22 EST-0500

submitted by /u/thinking_computer
[link] [comments]

Software Developer (Machine Learning) – Luci.AI Inc. – North York, ON

Written on November 28, 2019. Posted in Toronto Job Postings.

Experience with machine learning frameworks – PyTorch, Caffe2. Grounding in Deep Learning theory and recent advances, e.g. CNN, Yolo.
From Indeed – Fri, 29 Nov 2019 17:11:23 GMT – View all North York, ON jobs

[P] Detecting Sarcasm – How good are humans compared to machines?

Written on November 28, 2019. Posted in Reddit MachineLearning.

I’m developing a tool to use machine learning and sentiment analysis to determine the usefulness of context when parsing sarcasm.

In order to evaluate the success of the tool, I’m looking for human responses to a few examples from SARC 2.0 (https://arxiv.org/abs/1704.05579), a corpus of Reddit comments.

If you can spare a few minutes to answer this Google form (no login required), please help me out!

submitted by /u/CHR1597
[link] [comments]

[P] (Early Stage) kaggledatasets will make things easy – Looking for Contributions & Reviews

Written on November 28, 2019. Posted in Reddit MachineLearning.

kaggledatasets: Collection of Kaggle Datasets ready for everyone to use.

It is a Python package which will contain several high voted kaggle datasets made available in easy to use format with special support for frameworks like Tensorflow 2.0 and PyTorch.

Every beginner in ML/DL/Data Science struggles in the initial stages of Data Loading and Pre-processing. Kaggle being the gold mine of datasets, it’s necessary to make it easy to use for everyone.

For the folks who have used tf.datasets or torchvision datasets, this is something similar to that and only for Kaggle 🙂

And hold on, support for more datasets and more types is on the way. Feel free to contribute and add datasets. Also, looking for experienced developers who can contribute to the architecture of the project or provide reviews for improvement.

GitHub: https://github.com/kaggledatasets/kaggledatasets

submitted by /u/op_prabhuomkar
[link] [comments]

[D] ML Paper Notes: My notes of various ML research papers (DL – CV – NLP)

Written on November 28, 2019. Posted in Reddit MachineLearning.

Hello,

As a PhD student, I read quite a lot of papers, and sometimes I make short summaries with a simple latex template to get a better understanding and have clearer idea of the paper’s contributions. For a while I stored them in private Github repo, so I tought why note share them, some people might find them helpful.

PS: Sorry for the (sometimes frequent) spelling mistakes.

Here is the Github link: https://github.com/yassouali/ML_paper_notes

submitted by /u/youali
[link] [comments]

[P] Introducing Valohai – The managed platform for building production-scale machine learning.

Written on November 28, 2019. Posted in Reddit MachineLearning.

Hi everyone!

We’re Valohai, a platform for running experiments in the cloud with full version control.

It would be great if you’d sign up for a free account and gave us your feedback.

Some of the features of Valohai:

Tech-agnostic by design. Whatever language or library you are using, we can run it
Cloud-agnostic by design. Run the same project easily in Azure, AWS, GCP or on-premise hardware
Experiments are fully versioned and reproducible. All code, data, logs, metrics, models, and environments are stored for later use and other team members to see
Cloud instances are started and stopped automatically. Don’t waste your money on those idle machines!
Link your git or GitHub repository directly for automatic code pull
Jupyter notebook support using the Jupyhai plugin
For more advanced users, we have a robust CLI and a fully documented API

More info in our website and docs.

submitted by /u/keely
[link] [comments]

[R] Graph Neural Ordinary Differential Equations

Written on November 28, 2019. Posted in Reddit MachineLearning.

Paper: https://arxiv.org/abs/1911.07532

Github, blog

Abstract: We extend the framework of graph neural networks (GNN) to continuous time. Graph neural ordinary differential equations (GDEs) are introduced as the counterpart to GNNs where the input–output relationship is determined by a continuum of GNN layers. The GDE framework is shown to be compatible with the majority of commonly used GNN models with minimal modification to the original formulations. We evaluate the effectiveness of GDEs on both static as well as dynamic datasets: results prove their general effectiveness even in cases where the data is not generated by continuous time processes.

submitted by /u/soshiheart
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

Author: torontoai

[P] Loss for restricted boltzmann machines

[D] What is SOTA for image manipulation detection?

[D]What is the latest on this Fujitsu Laboratories announcement from Oct?

[D] Amd or Intel CPU for deep learning server?

Software Developer (Machine Learning) – Luci.AI Inc. – North York, ON

[P] Detecting Sarcasm – How good are humans compared to machines?

[P] (Early Stage) kaggledatasets will make things easy – Looking for Contributions & Reviews

[D] ML Paper Notes: My notes of various ML research papers (DL – CV – NLP)

[P] Introducing Valohai – The managed platform for building production-scale machine learning.

[R] Graph Neural Ordinary Differential Equations