Quantopian's community platform is shutting down. Please read this post for more information and download your code.
Back to Community
missing split, PPG?

Per https://www.splithistory.com/ppg/ there was a PPG split on 6/15/2015. The split appears to be unaccounted for in the Quantopian database.

2015-06-10 09:31 PRINT PPG
2015-06-10 09:31 PRINT current price: 230.381
2015-06-10 09:31 PRINT portfolio value: 0.0
2015-06-11 09:31 PRINT PPG
2015-06-11 09:31 PRINT current price: 231.56
2015-06-11 09:31 PRINT portfolio value: 9957.08
2015-06-12 09:31 PRINT PPG
2015-06-12 09:31 PRINT current price: 233.65
2015-06-12 09:31 PRINT portfolio value: 10046.95
2015-06-15 09:31 PRINT PPG
2015-06-15 09:31 PRINT current price: 115.48
2015-06-15 09:31 PRINT portfolio value: 4965.64
2015-06-16 09:31 PRINT PPG
2015-06-16 09:31 PRINT current price: 115.37
2015-06-16 09:31 PRINT portfolio value: 4960.91
2015-06-17 09:31 PRINT PPG
2015-06-17 09:31 PRINT current price: 116.106
2015-06-17 09:31 PRINT portfolio value: 4992.558
2015-06-18 09:31 PRINT PPG
2015-06-18 09:31 PRINT current price: 117.23
2015-06-18 09:31 PRINT portfolio value: 5040.89
2015-06-19 09:31 PRINT PPG
2015-06-19 09:31 PRINT current price: 117.78
2015-06-19 09:31 PRINT portfolio value: 5064.54
End of logs.

4 responses

Hi Grant,

You're right. It looks like we incorrectly have the split occurring on 5/7/2015. A fix is in the works. Thanks for reporting the problem.

Disclaimer

The material on this website is provided for informational purposes only and does not constitute an offer to sell, a solicitation to buy, or a recommendation or endorsement for any security or strategy, nor does it constitute an offer to provide investment advisory services by Quantopian. In addition, the material offers no opinion with respect to the suitability of any security or specific investment. No information contained herein should be regarded as a suggestion to engage in or refrain from any investment-related course of action as none of Quantopian nor any of its affiliates is undertaking to provide investment advice, act as an adviser to any plan or entity subject to the Employee Retirement Income Security Act of 1974, as amended, individual retirement account or individual retirement annuity, or give advice in a fiduciary capacity with respect to the materials presented herein. If you are an individual retirement or other investor, contact your financial advisor or other fiduciary unrelated to Quantopian about whether any given investment idea, strategy, product or service described herein may be appropriate for your circumstances. All investments involve risk, including loss of principal. Quantopian makes no guarantees as to the accuracy or completeness of the views expressed in the website. The views are subject to change, and may have become unreliable for various reasons, including changes in market conditions or economic circumstances.

Hello Jamie -

Thanks for the follow-up. A few questions/comments:

  1. Any idea how missing split errors end up in the data? It is very mysterious, looking in from the outside. Presumably, splits are public knowledge, and are handled openly and systematically (and efficiently, since https://www.splithistory.com/ offers up the data for free, without even ads on their website). Is there something about how Quantopian works that makes addressing splits particularly challenging?
  2. At one point, I started to do a head-scratch on how the research platform might be used to search for such errors across all stocks and time. Apparently there are some challenges (or you or one of your 90,000 users would have done it by now). Any guidance? Naively, I'd think that correlating changes in shares outstanding with overnight price jumps would do the trick, but maybe since the data aren't exactly 1:1, one needs a "fuzzy" algorithm and every case needs to be reviewed in detail (but then, data such as is available on https://www.splithistory.com/ could be used to firm up the decision).
  3. At the risk of beating a dead horse, I continue to recommend a user-facing database of known gross data errors, and the means to access the database from within the research platform and algos. Is this on the engineering task list, or is it a non-starter? And if the latter, why is it a bad idea (it just seems so logical and practical in my mind)?

Hi Grant,

  1. Unfortunately, the data from our provider is simply just missing a split sometimes. We are in the process of improving the robustness of our corporate actions data. Regarding https://www.splithistory.com, it's a good source for confirming splits, but in my experience, it should not be relied on as a primary source for what happened. The sensitivity is good on that site (they don't miss many events), but they frequently list events as splits that aren't really splits (for example, spinoffs). I also cannot speak to whether the data they post is good live. I also think that the text on each symbol is auto-generated so it's not that descriptive.
  2. Tracking down missed/incorrect events simply by looking at pricing data is not trivial, but it is something that we would like to do. It's not something that we're doing yet, but it's on our radar.
  3. This is not something that we are really considering right now. Instead, we're focusing our efforts on fixing the existing problems and doing what we can to prevent these from happening in the future. The fix to PPG also includes fixes to KGC, BIS, HBI, XHS, XHE, XSW, XTN, NTN, and VBFC which are all the tickers with incorrect/missing split info that we are aware of.

Thanks Jamie -

This is all still very mysterious to me. When a company decides to do a stock split, presumably it has to file paperwork, and the corporate action ends up in an authoritative database. It's not like the market opens and a company could say "Oops, we forgot to mention that we decided to do a stock split today." What am I missing here?

Regarding data errors, it seems that it would be a very simple change to your API that could be cranked into https://www.quantopian.com/help#ide-trading-guards . For example, add:

security_lists.missing_splits  

As Dan says on https://www.quantopian.com/posts/discrepancy-in-quantopian-input-data-sets , "Data errors are always present in financial data; I'm sure we haven't found them all." I'd encourage you to at least have it on the list as a project to be more systematic in publishing problems as they are found, and making the information accessible to users in their code. Seems pretty straightforward, but I can also appreciate that you have lots of competing priorities.

Grant