I have been sifting through the fundamental data quite a bit over the last week and have collected a few questions
There are stock symbols with ebit, ebitda as None. What are the conditions under which I can expect to get ebit/ebitda to be None? I can think of just one, when there are multiple classes of shares, all except one could be expected to have None. Are there others.
I see "nan" for some values, including morningstart_sector_code. For example, if I picked up all the stocks in the fundamental data base like I did and do a union of all the sector codes, I get this 'morningstar_sector_code': [nan, 101.0, 102.0, 103.0, 104.0, 205.0, 206.0, 207.0, 308.0, 309.0, 310.0, 311.0]. Whats throwing the nan? and Why?
3.If I am trying to build and index based on match filter, how can I eliminate
3.1 Stocks that are of different share classed of the same company? Example, I don't want to see BRK.A and BRK.B. Is there a simple way to filter these out
3.2 What symbols should be eliminated? I find symbols with ebit/ebitda as None or NAN in some cases. Are there a class of symbols that are reasonably expected not to have ebit/ebitda, operating margins, ebitda_margin, etc
Here is some data on NAN checks across the database. I've looked at some of them, and they look like a regular stock, such as CEL or AGM. I don't understand why their ebit/ebitda is NAN, it can be -ve, +ive or 0, right ?
2003-01-02: NAN Check: {'LIFE': ['operation_margin', 'ebit', 'ev_to_ebitda', 'ebitda_margin', 'ebitda', 'ebit_margin', 'net_margin'], 'CEL': ['operation_margin', 'ebit', 'ev_to_ebitda', 'ebitda_margin', 'ebitda'], 'ATYT': ['ev_to_ebitda'], 'SSPI': ['ev_to_ebitda'], 'AGE': ['operation_margin', 'ebit', 'ev_to_ebitda', 'ebitda_margin', 'ebitda', 'ebit_margin'], 'AGM': ['ebit', 'ebit_margin'], 'CSNT': ['ebit', 'ebit_margin'], 'EPEX': ['ev_to_ebitda'], 'AGY': ['ebit', 'ebitda'], 'AGT': ['ev_to_ebitda'], 'AGR': ['ev_to_ebitda'], 'AGP': ['ev_to_ebitda'], 'KYO': ['ebit', 'ebitda'], 'MROE': ['ebit', 'ev_to_ebitda', 'ebit_margin'], 'CTZN': ['ebit', 'ebit_margin'], 'TINY': ['ebit', 'ebit_margin'], 'PTNR': ['ebit', 'ebitda'], 'SPI': ['ebit', 'ebitda'], 'SPM': ['revenue_growth', 'operation_margin', 'ebit', 'ebitda_margin', 'ebitda', 'ebit_margin', 'net_margin'], 'CRBC': ['ebit', 'ebit_margin'], 'GS': ['ebit', 'ebit_margin'], 'HAND': ['ev_to_ebitda'], 'BYH': ['operation_margin', 'ebit', 'ev_to_ebitda', 'ebitda_margin', 'ebitda', 'ebit...Here is some data on None checks. I can see why avg5_yrs_roic could be None if there stock wasn't public long enough, at that point to be able to calculate 5 year averages. But how about "equity_per_share_growth". And also SPG below, according to yahoo has been public since early 1990, so there should be plenty to calculate both of the fields that are none below
2003-01-02: None Check: {'MSSN': ['equity_per_share_growth', 'avg5_yrs_roic'], 'SPP': ['equity_per_share_growth', 'avg5_yrs_roic'], 'CTZN': ['equity_per_share_growth', 'avg5_yrs_roic'], 'SPW': ['equity_per_share_growth', 'avg5_yrs_roic'], 'SPH': ['equity_per_share_growth', 'avg5_yrs_roic'], 'SPI': ['equity_per_share_growth', 'avg5_yrs_roic'], 'SPM': ['equity_per_share_growth', 'avg5_yrs_roic'], 'SPN': ['equity_per_share_growth', 'avg5_yrs_roic'], 'SPA': ['equity_per_share_growth', 'avg5_yrs_roic'], 'SPC': ['equity_per_share_growth', 'avg5_yrs_roic'], 'SPF': ['equity_per_share_growth', 'avg5_yrs_roic'], 'SPG': ['equity_per_share_growth', 'avg5_yrs_roic'], 'TISA': ['equity_per_share_growth', 'avg5_yrs_roic'], 'ARTN_A': ['equity_per_share_growth', 'avg5_yrs_roic'], 'REMX': ['equity_per_share_growth', 'avg5_yrs_roic'], 'CGPI': ['equity_per_share_growth', 'avg5_yrs_roic'], 'PQE': ['equity_per_share_growth', 'avg5_yrs_roic'], 'JNY': ['equity_per_share_growth', 'avg5_yrs_roic'], 'JNC': ['equity_per_share_growth', 'avg5_yrs_roic...
thanks,
Sarvi