Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

[D] Two questions about deep q learning

[D] Two questions about deep q learning

https://i.redd.it/9eifb59zaoe31.png

I have two questions(In bold)

I want to use two neural networks to calculate the Q value for my current state(A board game). I use a sigmoid function. I correct the action with the highest value max Q that I obtain in the target network(The rest target is set to original output of the DQN). Is this the correct approach or should I correct all output values in one iteration? Second question: How do I calculate my target value r+ b mac Q(s(prime….). Should I use fixed reward values? **Do I have to use reward values so that I can not surpass the possible output range of sigmoid function?(**Like 0,4 or 0,2) Thank you

submitted by /u/Kralex68
[link] [comments]

Next Meetup

 

Days
:
Hours
:
Minutes
:
Seconds

 

Plug yourself into AI and don't miss a beat

 


Toronto AI is a social and collaborative hub to unite AI innovators of Toronto and surrounding areas. We explore AI technologies in digital art and music, healthcare, marketing, fintech, vr, robotics and more. Toronto AI was founded by Dave MacDonald and Patrick O'Mara.