Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

Author: torontoai

[R] Meta-Learning Deep Energy-Based Memory Models

Interesting research from DeepMind:

“Our new work on memory uses a neural network’s weights as fast and compressive associative storage. Reading from the memory is performed by approximate minimization of the energy modeled by the network.”

“Unlike classical associative memory models such as Hopfield networks, we are not limited in the expressivity of our energy model, and make use of the deep architectures with fully-connected, convolutional and recurrent layers.”

“For this to work, stored patterns must be local minima of the energy. We use recent advances in gradient-based meta-learning to write into the memory such that this requirement approximately holds.”

https://arxiv.org/abs/1910.02720

submitted by /u/Quantum_Network
[link] [comments]

Bengio, Hinton et LeCun acceptent le prix A.M. Turing 2018 de l’ACM à San Francisco

Les lauréats du prix A.M. Turing 2018 de l’ACM, Geoffrey Hinton, Yoshua Bengio et Yann LeCun, ont reçu le prix A.M. Turing de l’ACM au banquet de remise des prix 2019 de l’ACM, à San Francisco, au mois de juin.

 

À la conférence FCRC de l’ACM à Phœnix, Hinton et LeCun ont prononcé leur conférence Turing : « La révolution de l’apprentissage profond » et « La révolution de l’apprentissage numérique : La suite ». | Vidéo

 

Les trois lauréats ont fait la couverture du numéro de juin 2019 de la revue Communications of the ACM | Lisez ici

 

Cette année, la Banque d’Angleterre a rendu hommage à Alan Turing pour ses travaux d’avant-garde sur les ordinateurs, ainsi que pour ses contributions lors de la Deuxième Guerre mondiale, y compris la « bombe de Turing », l’un des outils principaux utilisés pour décrypter les messages codés à l’aide d’Enigma; son portrait figurera sur les billets de 50 £. Pendant le mois de la fierté gaie cette année, le New York Times a aussi rendu hommage à Alan Turing pour ses idées qui ont contribué à la victoire pendant la Deuxième Guerre mondiale, et les épreuves qu’il a traversées relativement à sa sexualité : Overlooked No More: Alan Turing, Condemned Code Breaker and Computer Visionary

Cet article a été publié dans le Bulletin IACan. Abonnez-vous à la publication électronique bimestrielle pour rester au fait des plus récentes nouvelles en IA au Canada.

Collaborating with Humans Requires Understanding Them

AI agents have learned to play Dota, StarCraft, and Go, by training to beat an
automated system that increases in difficulty as the agent gains skill at the
game: in vanilla self-play, the AI agent plays games against itself, while in
population-based training, each agent must play against a population of other
agents, and the entire population learns to play the game.

This technique has a lot going for it. There is a natural curriculum in
difficulty: as the agent improves, the task it faces gets harder, which leads
to efficient learning. It doesn’t require any manual design of opponents, or
handcrafted features of the environment. And most notably, in all of the games
above, the resulting agents have beaten human champions.

The technique has also been used in collaborative settings: OpenAI had one
public match where each team was composed of three OpenAI Five agents alongside
two human experts, and the For The Win (FTW) agents trained to play Quake were
paired with both humans and other agents during evaluation. In the Quake
case
, humans rated the FTW agents as more collaborative than fellow humans
in a participant survey.

Continue reading

[Project] pgANN Fast Approximate Nearest Neighbor (ANN) searches with a PostgreSQL database.

We are open-sourcing pgANN – an ANN (approx nearest neighbor) approach with a PostgreSQL backend. The key differentiator between pgANN and the rest (FAISS, Annoy,NearPy etc) is:

  1. this enables “online” learning i.e. doesn’t require retraining with every CRUD, and
  2. works with extremely large datasets, since it’s not held in RAM like the others

We use it internally to QA images and find it consistently provides sub-second query performance with a few million rows of vectors on a 32GB.8 vcpu Ubuntu box and can reasonably be expected to scale-up with normal pgsql scaling techniques. We invite the community to give this a try and share feedback.

submitted by /u/bluzkluz
[link] [comments]

[D] Possible privacy attack method on data shrunk by autoencoder?

http://arxiv.org/abs/1910.08489

Hello, this is my first paper preprint:

Privacy-preserving Federated Bayesian Learning of a Generative Model for Imbalanced Classification of Clinical Data

Though it is not fully biased toward deep learning, or federated learning on a edge device, I made a new framework for learning a global model in a horizontally distributed setting, especially in a clinical field.

AFAIK, it is the first trial to apply Approximate Bayesian Computation(ABC) on federated learning.
(If not, please let me know!)

Without complicated perturbation techniques e.g. Differential Privacy, Homomorphic encryption, Hashing, etc., the proposed method can preserve privacy.

As I said in the paper, unless each local site reveals trained weights and the structure of Autoencoder, shrunk data CANNOT be recovered in the central server. Also not possible even if some local sites conspire against the other site to disclose the information.

  • But this is my hypothesis and expectation, so I want to listen to some feedback or opinions on this.
    Is it really impossible to make leakage on data shrunk by local autoencdoer?

Plus, a global model can be learned in the central server with the minimal information (merely with a distance between local data (perturbed via Autoencoder) and generated data (same dim. with the perturbed one)).

Welcome and thank you in advance for any feedback and questions!

submitted by /u/vaseline555
[link] [comments]

[D] How do you keep up with latest advances in ML which are not directly relevant to your work?

In recent years I have heard of key advances made in NLP by transformer models and BERT. Then there was this paper on neural ODEs by DeepMind.

I have been wanting to dig deeper into the details and understand the key ideas behind these hot topics. They are not directly relevant to my work (which is focused mainly on images/videos) but I still feel it is important as an ML engineer to keep myself up-to-date with key developments in diverse areas. However because it does not directly relate to my work, I find it hard to find the time to get a deeper understanding of these papers.

Has anyone found themselves in a similar situation? How do you deal with it?

submitted by /u/nivter
[link] [comments]

[D] Design methodology for ML engineering.

I am currently preparing for ML Engineer interviews and wanted to learn a design methodology to crack the design rounds. I have looked for standard resources but couldn’t find any. So I thought of coming up with one on my own from my own experiences and material that I have read online. So below is an overview of the approach I have in my mind. Please let me know what you think of it.

  1. Understand the use case.
    1. Figure out what can be solved deterministically
    2. Figure out what needs to be solved using ‘Data + Machine learning’
  2. Pose the problem(1.b) as a math problem.
    1. Come up with an objective that you want to optimize.
  3. Select the right data set.
    1. What data is needed. How is the labeling done/derived?
    2. How do you deal with bias, skewed classes,
  4. Feature Engineering.
    1. Think about what(information) you need to solve the problem optimally.
    2. What aspects from the data could you use to replicate findings in 4.a?
    3. Transform data into features and a form more apt for model’s learning
  5. Model selection:
    1. What model is best suited to solve the problem(2.a) at hand.
    2. Are there any off the shelf(direct or transfer learning) that would help.
  6. Training:
    1. How will you train
    2. How will you handle: skewed classes and other problems associated with the dataset.
    3. Validation: what metrics, experiments do you conduct to validate the model’s learning.
  7. Productionizing:
    1. How will the solution be deployed?
    2. Performance monitoring, feedback loop/retrain,
    3. Application scaling and model maintenance.

submitted by /u/kireeti_
[link] [comments]