Machine Learning (OpenCV) Engineer – Step by Step Professional Services – North York, ON
From Step by Step Professional Services – Mon, 21 Oct 2019 16:53:19 GMT – View all North York, ON jobs
Are there embeddings for seq2seq tasks? I have a very large vocab and I want to feed a fixed-length vector rather than one-hot vector to the input of the neural network (and for the output)
submitted by /u/omgDoYouKnow
[link] [comments]
Interesting research from DeepMind:
“Our new work on memory uses a neural network’s weights as fast and compressive associative storage. Reading from the memory is performed by approximate minimization of the energy modeled by the network.”
“Unlike classical associative memory models such as Hopfield networks, we are not limited in the expressivity of our energy model, and make use of the deep architectures with fully-connected, convolutional and recurrent layers.”
“For this to work, stored patterns must be local minima of the energy. We use recent advances in gradient-based meta-learning to write into the memory such that this requirement approximately holds.”
submitted by /u/Quantum_Network
[link] [comments]
Les lauréats du prix A.M. Turing 2018 de l’ACM, Geoffrey Hinton, Yoshua Bengio et Yann LeCun, ont reçu le prix A.M. Turing de l’ACM au banquet de remise des prix 2019 de l’ACM, à San Francisco, au mois de juin.
À la conférence FCRC de l’ACM à Phœnix, Hinton et LeCun ont prononcé leur conférence Turing : « La révolution de l’apprentissage profond » et « La révolution de l’apprentissage numérique : La suite ». | Vidéo
Les trois lauréats ont fait la couverture du numéro de juin 2019 de la revue Communications of the ACM | Lisez ici
Cette année, la Banque d’Angleterre a rendu hommage à Alan Turing pour ses travaux d’avant-garde sur les ordinateurs, ainsi que pour ses contributions lors de la Deuxième Guerre mondiale, y compris la « bombe de Turing », l’un des outils principaux utilisés pour décrypter les messages codés à l’aide d’Enigma; son portrait figurera sur les billets de 50 £. Pendant le mois de la fierté gaie cette année, le New York Times a aussi rendu hommage à Alan Turing pour ses idées qui ont contribué à la victoire pendant la Deuxième Guerre mondiale, et les épreuves qu’il a traversées relativement à sa sexualité : Overlooked No More: Alan Turing, Condemned Code Breaker and Computer Visionary
Cet article a été publié dans le Bulletin IACan. Abonnez-vous à la publication électronique bimestrielle pour rester au fait des plus récentes nouvelles en IA au Canada.
I’d like to annotate source code (mainly PHP/HTML/JS), highlight some parts of the docs and apply tags. Is anyone aware of this kind of tool?
submitted by /u/data-soup
[link] [comments]
AI agents have learned to play Dota, StarCraft, and Go, by training to beat an
automated system that increases in difficulty as the agent gains skill at the
game: in vanilla self-play, the AI agent plays games against itself, while in
population-based training, each agent must play against a population of other
agents, and the entire population learns to play the game.
This technique has a lot going for it. There is a natural curriculum in
difficulty: as the agent improves, the task it faces gets harder, which leads
to efficient learning. It doesn’t require any manual design of opponents, or
handcrafted features of the environment. And most notably, in all of the games
above, the resulting agents have beaten human champions.
The technique has also been used in collaborative settings: OpenAI had one
public match where each team was composed of three OpenAI Five agents alongside
two human experts, and the For The Win (FTW) agents trained to play Quake were
paired with both humans and other agents during evaluation. In the Quake
case, humans rated the FTW agents as more collaborative than fellow humans
in a participant survey.
We are open-sourcing pgANN – an ANN (approx nearest neighbor) approach with a PostgreSQL backend. The key differentiator between pgANN and the rest (FAISS, Annoy,NearPy etc) is:
We use it internally to QA images and find it consistently provides sub-second query performance with a few million rows of vectors on a 32GB.8 vcpu Ubuntu box and can reasonably be expected to scale-up with normal pgsql scaling techniques. We invite the community to give this a try and share feedback.
submitted by /u/bluzkluz
[link] [comments]
http://arxiv.org/abs/1910.08489
Hello, this is my first paper preprint:
Privacy-preserving Federated Bayesian Learning of a Generative Model for Imbalanced Classification of Clinical Data
Though it is not fully biased toward deep learning, or federated learning on a edge device, I made a new framework for learning a global model in a horizontally distributed setting, especially in a clinical field.
AFAIK, it is the first trial to apply Approximate Bayesian Computation(ABC) on federated learning.
(If not, please let me know!)
Without complicated perturbation techniques e.g. Differential Privacy, Homomorphic encryption, Hashing, etc., the proposed method can preserve privacy.
As I said in the paper, unless each local site reveals trained weights and the structure of Autoencoder, shrunk data CANNOT be recovered in the central server. Also not possible even if some local sites conspire against the other site to disclose the information.
Plus, a global model can be learned in the central server with the minimal information (merely with a distance between local data (perturbed via Autoencoder) and generated data (same dim. with the perturbed one)).
Welcome and thank you in advance for any feedback and questions!
submitted by /u/vaseline555
[link] [comments]
In recent years I have heard of key advances made in NLP by transformer models and BERT. Then there was this paper on neural ODEs by DeepMind.
I have been wanting to dig deeper into the details and understand the key ideas behind these hot topics. They are not directly relevant to my work (which is focused mainly on images/videos) but I still feel it is important as an ML engineer to keep myself up-to-date with key developments in diverse areas. However because it does not directly relate to my work, I find it hard to find the time to get a deeper understanding of these papers.
Has anyone found themselves in a similar situation? How do you deal with it?
submitted by /u/nivter
[link] [comments]
I am currently preparing for ML Engineer interviews and wanted to learn a design methodology to crack the design rounds. I have looked for standard resources but couldn’t find any. So I thought of coming up with one on my own from my own experiences and material that I have read online. So below is an overview of the approach I have in my mind. Please let me know what you think of it.
submitted by /u/kireeti_
[link] [comments]