Recap
- quick recap of bug fixes:
- train eval plot with and without market bound now works
- optimal strat value now is correct
- action space now relaxed to
[-1_000, 1_000]
- show market bound plot
- show sharpe ratio on train eval plot
- show grid plot of fbm opt strats
- show recreated sharpe ratio plot from Rasonyi Nika paper
- show experiments with T being bigger than num_steps
TODO
- show sharpe ratio of the optimal strategy on the train eval plot
- do the same as above but split the plot into two instead of a new y scale
- show the asimptotic optimal strategy on the market bound plot
- do an experiment where we train on
H = 0.1with increasingTand we keep ratio ofT / num_stepsconstant - show side by side of H0.1 T1024 with and without clipped action space, that is
[-10, 10]and[-1_000, 1_000] - szeretnenk latni a hisztogramjat az actionoknek
- if the training works for small
Tthen we can try to incrementally extend the time horizon
Remaining from last week
- when running parallell envs try to shuffle the starting point to cover more of the episode length
- pump up the
num_stepsnumbers - train model on with with big
num_steps_evalnumber (5_000) - fix training for
- Liquidation test env continue
- make a grid search where there are three axis:
clip_coef,learning_rate,H = 0.1, 0.7 - make an env that is a child of FBMEnv and has a price process of a linear function with random slope