Hi all,
I recently learned about Quantopian after taking a data mining course in university. I've been developing a screener, or classifier, for predicting daily stock movement and have seen some really great results so far. In order to understand how well the predictions would factor into a trading strategy, I've built-in a simple strategy that buys at market close if the stock is predicted to go up tomorrow or sells at market close if the stock is predicted to go down tomorrow. Then, the next day, it undoes the position from the day before, makes a new prediction, and either buys or sells based on that prediction.
My motivation for coming to Quantopian was to both get a more rigorous calculation of return (incl. commission fees, slippage/market impact of order) and to hopefully use the platform for live trading in the future.
My process is rather simple:
I build a pandas DataFrame using get_data_yahoo for every ticker I want to experiment with. I append multiple columns using pandas transforms, simple algebra, boolean expressions, etc. (most of which are essential to my model, so I will need some way to bring my custom data frame function or rebuild the exact columns i have been working with in Quantopian). I build a scikit-learn model that trains on certain columns in the DF and tests on those columns, comparing the prediction vector to the true value vector. I have stored prediction vectors for historical data (1 for up, 0 for down) for multiple stocks. I have text files that contain pickled sklearn models--these are able to make predictions on current or future data. I just don't really know how to get started within Quantopian. I've read through the API & Documentation multiple times and even tried messing around with zipline in my development environment. No luck so far in getting the big picture.
Any guidance would be very much appreciated.