Quantopian's community platform is shutting down. Please read this post for more information and download your code.
Back to Community
Help with creating a feature set for a machine learning algo

Hey everyone,

I am coding a simple machine learning algorithm for trading and I could use some help with picking features to use in my code. I am under the impression that I can use chart technical indicators; examples in the "Chart Technical Indicators" section in this link to the MathWorks financial toolbox. I want to use a feature set to predict future price direction, but I'm not sure which features are most relevant and what number or periods they should be calculated for.

I already coded the majority of the algorithm in MATLAB, and the algorithm is reasonably good at making predictions, but I am running into some issues with overfitting, and I suspect this is because my feature set could use some improvement. I have started translating my code to Python because I want to backtest it here, but my feature set just isn't quite good enough yet for me to post my Python code. Any help with what features I should focus on would be greatly appreciated.

4 responses

Machine learning is curve-fitting by nature. You need to do a lot of out of sample testing.

Zach I'd be interested to know if you have any positive results once you've coded it in Python. I did a quick run of this a while ago, using a very basic selection of RSI, ATR, ROC, etc, and found it had absolutely zero predictive power using a wide selection of classifiers.

I thought about combining into an event study but never got round to it

Some say TA had some predictive power 15 years ago, not anymore. But this is just anecdotal evidence, most research I read that says TA worked were very poor and didn't count for all the over-fitting factors and were not rigorous enough.

James, I've had fairly positive results in MATLAB trading Bitcoin, but I think this is partly because there are so many inefficiencies in Bitcoin markets so they are easy to predict.