Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

[R] Mental Simulation with Self-Supervised Spatiotemporal Learning

Hi all, I’d like to share my undergraduate thesis on Mental Simulation with Self-Supervised Spatiotemporal Learning. We propose that one way to understand mental simulation in humans is to approach it as a problem of video prediction. The code is based off the recent ICLR 2019 paper Eidetic 3D LSTM: A Model for Video Prediction and Beyond.

Paper: https://github.com/kevinstan/video_prediction/blob/master/paper/mental_sim.pdf

Code: https://github.com/kevinstan/video_prediction

Abstract: Mental simulation — the capacity to imagine objects and scenes in order to make decisions, predictions, and inferences about the world — is a key feature of human cognition. Evidence from behavioral studies suggest that representations of visual imagery are spatial and sensitive to the causal structure of the world. Inspired by how humans anticipate future scenes, we aim to leverage state-of-the-art techniques in deep learning and computer vision to tackle the problem of spatiotemporal predictive learning in a self-supervised manner. We perform explorations across three architectural design choices: (i) the importance of 2D-convolution vs. 3D-convolution inside the cell of recurrent neural networks, (ii) the effectiveness of residual connections in stacked long short-term memory models for remembering spatial information over long time horizons, and (iii) the balance between $l_1$ norm and $l_2$ norm components in the objective function. Our extensive evaluations demonstrate that finetuning with residual connections achieves state-of-the-art performance on the Moving MNIST and KTH Action benchmark datasets. Potential application areas include weather forecasting, traffic flow prediction, and physical interaction simulation.

Comments and feedback are highly appreciated. Thanks!

submitted by /u/computeisallyouneed
[link] [comments]