Join our meetup, learn, connect, share, and get to know your Toronto AI community.
Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.
Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.
For those struggling to understand AD and dual numbers, this article might help :)!
If I understand correctly, AD essentially computes an derivative of each operator using — for example, dual numbers — analytically exact methods, and combine them using the chain rule. This is feasible because programmable functions are defined using “primitive” operators, and we can compute their derivatives algorithmically (or analytically).
edit:
noticed the author has another multivariate example.
edit 2:
This was also helpful.
submitted by /u/kevinccccccc
[link] [comments]