Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

[D] Prior knowledge on Actor-critic / policy gradient methods for portfolio allocation

Hey guys,

So, I have to solve a portfolio allocation problem, which can be formulated as:

given an input (financial indicators), output a vector of weights for assets (that sum up to 1) in order to maximize a “performance function”.

Translating this formulation to an RL problem seems pretty straight forward. However, I don’t have much data (a couple of hundred data points). So, I was wondering if it is possible to incorporate prior knowledge in order to have a better training with fewer data.

Can I incorporate knowledge by using a “custom” advantage function in Actor-critic? What about using Bayesian policy gradient / Actor-critic?

Does that make sense?

Thanks!

submitted by /u/tutorialboys
[link] [comments]