Journal of Computers, Vol 2, No 1 (2007), 12-19, Feb 2007
doi:10.4304/jcp.2.1.12-19

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

Nicolas Chapados, Yoshua Bengio

Abstract


We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-best-paths algorithm. We consider an application in financial portfolio management where we can train a controller to directly optimize a Sharpe Ratio (or other risk-averse non-additive) utility function. We illustrate the approach by demonstrating experimental results using a kernel-based controller architecture that would not normally be considered in traditional reinforcement learning or approximate dynamic programming. We further show that using a non-additive criterion (incremental Sharpe Ratio) yields a noisy K-best-paths extraction problem, that can give substantially improved performance.



References



Full Text: PDF


Journal of Computers (JCP, ISSN 1796-203X)

Copyright @ 2006-2012 by ACADEMY PUBLISHER – All rights reserved.