Hoping for some advice in regards to training a single Random Forest classifier model on 200+ assets, that will predict the direction of the next bar/candle. This is a learning experiment for me where accuracy is not the primary goal - i'm more interested in how to tackle the data processing for building a single generalised model for multiple assets.
I have a dozen or so features which are derived from price (simple technical indicators and statistical calculations), the label/prediction is the direction of the next candle/bar.
- What is the preferred way to scale the data (standardize/normalize/log etc) across all the asset features? The price, volatility, and returns are wildy different between assets eg. Asset 1 is $0.30 with +/- 10% stddev of returns, and Asset 2 is $2500 with a +/- 2% stddev returns.
Many thanks,
J