Skip to main content


Learn About Our Meetup

5000+ Members



Join our meetup, learn, connect, share, and get to know your Toronto AI community. 



Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.



Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

[D] Requirements for a fast model-building algorithm in one-shot model-based reinforcement learning

Comparision of algorithms for the fast extraction of a model from real world observations to be used for predicting rewards at different future timespans.

Requirements:​ * Time – Has memory of at least 20 steps so that it can handle temporal sequences * 1sht – Can learn from a single example so that it doesn’t need hundreds of training samples for each class * Hier – Is hierarchical so that it generalizes well (not just flat memorization) * Arch – Can learn the architecture from data so that it doesn’t need to be predefined by the developers * Curr – Has curriculum learning so that it can be trained successively and doesn’t suffer from catastrophic forgetting * Scal – Can be scaled up to at least 1 million inputs so that it’s not limited to toy environments

Algo Time 1sht Hier Arch Curr Scal
NNGP 🚫 🚫
THSOM 🚫 🚫 🚫
BPTT 🚫 🚫 🚫
GA 🚫 🚫 🚫
HTM 🚫 🚫

Candidate algorithms: * NNGP – Nearest Neighbor Gaussian Processes * GHSOM – Growing Hierarchical Self-Organizing Map * THSOM – Temporal Hebbian Self-organizing Map * BPTT – Recurrent Neural Networks trained with Backpropagation Through Time, for example * GA – Genetic Algorithms * HTM – Hierarchical Temporal Memory or in German

The table probably has errors because I’m not an expert and just wanna watch progress in AGI. But the current backprop winter is boring me, and if no one else is taking the initiative then an outsider from the audience has to.

As I don’t understand the math in the paper for NNGPs, I’m assuming that they are just a hierarchical version of the simple nearest neighbor algorithm. Or that the two SOM-descendants are just standard self-organizing maps plus some fancy extensions for hierarchical architecture and time.

Drop a note if you find an error and I will fix the table.

submitted by /u/wlorenz65
[link] [comments]