Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

Author: torontoai

[P] Loss for restricted boltzmann machines

Hello guys, I am trying to implement this paper. They used restricted boltzmann machines to get evolution of a system. Due to nature of the problem “activation” function is not sigmoid but tanh which is later transformed into probability. Different activation function means that gradient is different, although in the paper they wrote “standard” gradient equations. When I tried to train model with standard gradient equations I couldn’t get any meaningful results.I am aware that RBM minimizes KL divergence, but I couldn’t figure out how to implement that (how to get probability function). After trying to find what should I put for loss function when training with eg. Adam I stumbled upon some random github code which for loss function had:

L = energy(data_vec) – energy(genrated_vec)

(yes without absolute value). Well, I plugged that in and it worked like a charm. When I tried to put absolute value over loss it did not worked. I am having trouble understanding why is this correct loss for RBM and I would appreciate help 🙂

submitted by /u/matej1408
[link] [comments]

[D]What is the latest on this Fujitsu Laboratories announcement from Oct?

I found this :

https://www.fujitsu.com/global/about/resources/news/press-releases/2019/1015-01.html

Looks like it was posted about a month ago, I haven’t really found much more info on it, was trying to see if it was anything groundbreaking, the details provided are slim to none. Just gives a basic overview of how neural networks seem to work, and some fancy terms for normalization/transformations/landmark detection. Thanks.

They make some bold claims:

This technology has achieved a high detection accuracy rate of 81% even with limited training data.

This technology is also more accurate than other existing technologies according to certain facial expression recognition technology benchmarks

submitted by /u/FreckledMil
[link] [comments]

[D] Amd or Intel CPU for deep learning server?

Building a Deep Learning home server with 4x 2080 ti blower style GPUs and I am wondering what CPU to get for this machine. The delima is that I am using python Pytorch and Numpy which has a lot of support with Intels MLK packages that sabotage AMD performance. However, I am utilizing the GPUs and CUDA technology for the matrix multiplication and can get away with Numpy Openblas for my environments on the AMD machine. I’m trying to keep this project under $8000 and want the best performance. Also, I am primary using DDPG and DQN networks. What would you do?

Type Item Price
CPU AMD Threadripper 3960X 3.8 GHz 24-Core Processor
CPU Cooler be quiet! Dark Rock Pro TR4 59.5 CFM CPU Cooler
Motherboard ASRock TRX40 Creator ATX sTRX4 Motherboard
Memory G.Skill Ripjaws V Series 16 GB (2 x 8 GB) DDR4-3200 Memory $59.99 @ Newegg
Storage HP EX950 1 TB M.2-2280 NVME Solid State Drive $129.99 @ Newegg
Video Card PNY GeForce RTX 2080 Ti 11 GB Blower Video Card
Case Corsair Air 540 ATX Mid Tower Case $129.98 @ Newegg
Power Supply EVGA SuperNOVA P2 1600 W 80+ Platinum Certified Fully Modular ATX Power Supply
Prices include shipping, taxes, rebates, and discounts
Total (before mail-in rebates) $339.96
Mail-in rebates -$20.00
Total $319.96
Generated by PCPartPicker 2019-11-29 12:22 EST-0500

submitted by /u/thinking_computer
[link] [comments]

[P] Detecting Sarcasm – How good are humans compared to machines?

I’m developing a tool to use machine learning and sentiment analysis to determine the usefulness of context when parsing sarcasm.

In order to evaluate the success of the tool, I’m looking for human responses to a few examples from SARC 2.0 (https://arxiv.org/abs/1704.05579), a corpus of Reddit comments.

If you can spare a few minutes to answer this Google form (no login required), please help me out!

submitted by /u/CHR1597
[link] [comments]

[P] (Early Stage) kaggledatasets will make things easy – Looking for Contributions & Reviews

kaggledatasets: Collection of Kaggle Datasets ready for everyone to use.

It is a Python package which will contain several high voted kaggle datasets made available in easy to use format with special support for frameworks like Tensorflow 2.0 and PyTorch.

Every beginner in ML/DL/Data Science struggles in the initial stages of Data Loading and Pre-processing. Kaggle being the gold mine of datasets, it’s necessary to make it easy to use for everyone.

For the folks who have used tf.datasets or torchvision datasets, this is something similar to that and only for Kaggle 🙂

And hold on, support for more datasets and more types is on the way. Feel free to contribute and add datasets. Also, looking for experienced developers who can contribute to the architecture of the project or provide reviews for improvement.

GitHub: https://github.com/kaggledatasets/kaggledatasets

submitted by /u/op_prabhuomkar
[link] [comments]

[D] ML Paper Notes: My notes of various ML research papers (DL – CV – NLP)

Hello,

As a PhD student, I read quite a lot of papers, and sometimes I make short summaries with a simple latex template to get a better understanding and have clearer idea of the paper’s contributions. For a while I stored them in private Github repo, so I tought why note share them, some people might find them helpful.

PS: Sorry for the (sometimes frequent) spelling mistakes.

Here is the Github link: https://github.com/yassouali/ML_paper_notes

submitted by /u/youali
[link] [comments]

[P] Introducing Valohai – The managed platform for building production-scale machine learning.

Hi everyone!

We’re Valohai, a platform for running experiments in the cloud with full version control.

It would be great if you’d sign up for a free account and gave us your feedback.

Some of the features of Valohai:

  • Tech-agnostic by design. Whatever language or library you are using, we can run it
  • Cloud-agnostic by design. Run the same project easily in Azure, AWS, GCP or on-premise hardware
  • Experiments are fully versioned and reproducible. All code, data, logs, metrics, models, and environments are stored for later use and other team members to see
  • Cloud instances are started and stopped automatically. Don’t waste your money on those idle machines!
  • Link your git or GitHub repository directly for automatic code pull
  • Jupyter notebook support using the Jupyhai plugin
  • For more advanced users, we have a robust CLI and a fully documented API

More info in our website and docs.

submitted by /u/keely
[link] [comments]

[R] Graph Neural Ordinary Differential Equations

Paper: https://arxiv.org/abs/1911.07532

Github, blog

Abstract: We extend the framework of graph neural networks (GNN) to continuous time. Graph neural ordinary differential equations (GDEs) are introduced as the counterpart to GNNs where the input–output relationship is determined by a continuum of GNN layers. The GDE framework is shown to be compatible with the majority of commonly used GNN models with minimal modification to the original formulations. We evaluate the effectiveness of GDEs on both static as well as dynamic datasets: results prove their general effectiveness even in cases where the data is not generated by continuous time processes.

submitted by /u/soshiheart
[link] [comments]