Portfolio Analysis using `pyfolio`¶

There are many ways to evaluate and analyze an algorithm. While we already provide you with some of these measures like a cumulative returns plot in the Quantopian backtester, you may want to dive deeper into what your algorithm is doing. For example, you might want to look at how your portfolio allocation changes over time, or what your exposure to certain risk-factors is.

At Quantopian, we built and open-sourced pyfolio for exactly that purpose. In this notebook you will learn how you can use this library from within the Quantopian research environment (you can also use this library independently, see the pyfolio website for more information on that).

At the core of pyfolio, we have tear sheets that summarize information about a backtest. Each tear sheet returns a number of plots, as well as other information, about a given topic. There are five main ones:

Cumulative returns tear sheet
Shock event returns tear sheet
Positional tear sheet
Transactional tear sheet
Bayesian tear sheet

We have added an interface to the object returned by get_backtest() to create these various tear sheets. To generate all tear sheets at once, it's as simple as generating a backtest object and calling create_full_tear_sheet on it:

# Get backtest object
bt = get_backtest('56bb3f8d3ce1db11952648c0')

# Create all tear sheets
bt.create_full_tear_sheet()

100% Time: 0:00:05|###########################################################|
Entire data start date: 2010-12-01
Entire data end date: 2016-02-04


Backtest Months: 62
                   Backtest
annual_return          0.33
annual_volatility      0.23
sharpe_ratio           1.37
calmar_ratio           1.74
stability              0.93
max_drawdown          -0.19
omega_ratio            1.28
sortino_ratio          1.96
skewness              -0.54
kurtosis               3.49
information_ratio      0.07
alpha                  0.23
beta                   0.77

Worst Drawdown Periods
   net drawdown in %  peak date valley date recovery date duration
0              19.24 2015-09-17  2016-01-11           NaT      NaN
4              15.53 2012-03-26  2012-05-18    2012-08-13      101
1              14.99 2015-06-23  2015-07-09    2015-08-11       36
2              14.71 2014-07-16  2014-12-16    2015-03-20      178
3              13.39 2013-08-05  2013-08-30    2014-03-24      166


2-sigma returns daily    -0.028
2-sigma returns weekly   -0.050
dtype: float64

Stress Events
                                    mean    min    max
US downgrade/European Debt Crisis  0.001 -0.047  0.020
Fukushima                          0.003 -0.015  0.019
EZB IR Event                       0.000 -0.029  0.033
Apr14                              0.001 -0.019  0.015
Oct14                              0.001 -0.019  0.029
Fall2015                          -0.003 -0.063  0.031
Recovery                           0.002 -0.067  0.053
New Normal                         0.001 -0.090  0.056


Top 10 long positions of all time (and max%)
[u'XIV' u'RSP' u'EDV' u'TLT']
[ 0.947  0.921  0.836  0.793]


Top 10 short positions of all time (and max%)
[]
[]


Top 10 positions of all time (and max%)
[u'XIV' u'RSP' u'EDV' u'TLT']
[ 0.947  0.921  0.836  0.793]


All positions ever held
[u'XIV' u'RSP' u'EDV' u'TLT']
[ 0.947  0.921  0.836  0.793]

Interpreting the output¶

There are many metrics being reported in all the tear sheets above. At the top, there are tables that tell you about summary performance statistics like the Sharpe ratio, Sortino ratio, and worst drawdown periods. The following plots are hopefully pretty self-explanatory, but more information can be found on the pyfolio website.

More fine-grained access¶

As the name suggests, create_full_tear_sheet() creates all tear sheets available (except for the Bayesian one, see below). You can also create individual tear sheets. For example, lets create one that only uses the returns of your strategy.

In addition, we will pass in a keyword argument called live_start_date. The use-case for this feature is that you might have deployed this algorithm and want to see how the out-of-sample period measures up to your backtest. Although it currently is not possible to access returns from live-traded algorithms in research, you could still note the date when you deployed it and run a new backtest over the full time period. This date can be passed with live_start_date. Lets pretend that we developed and deployed this algorithm on 2014-1-1. As I had access to 10 years of historical data, I could have easily overfit my algorithm to only work well on that time period. In fact, it is very difficult not to overfit, so comparing in-sample and out-of-sample (OOS) data is a good way to look at that.

This time, we will create just the returns tear sheet on the same backtest object from above:

bt.create_returns_tear_sheet(live_start_date='2014-1-1')

Entire data start date: 2010-12-01
Entire data end date: 2016-02-04


Out-of-Sample Months: 25
Backtest Months: 36
                   Backtest  Out_of_Sample  All_History
annual_return          0.48           0.14         0.33
annual_volatility      0.22           0.24         0.23
sharpe_ratio           1.88           0.68         1.37
calmar_ratio           3.11           0.74         1.74
stability              0.93           0.75         0.93
max_drawdown          -0.16          -0.19        -0.19
omega_ratio            1.39           1.14         1.28
sortino_ratio          2.77           0.93         1.96
skewness              -0.48          -0.59        -0.54
kurtosis               2.34           4.72         3.49
information_ratio      0.08           0.04         0.07
alpha                  0.32           0.13         0.23
beta                   0.66           0.98         0.77

Worst Drawdown Periods
   net drawdown in %  peak date valley date recovery date duration
0              19.24 2015-09-17  2016-01-11           NaT      NaN
4              15.53 2012-03-26  2012-05-18    2012-08-13      101
1              14.99 2015-06-23  2015-07-09    2015-08-11       36
2              14.71 2014-07-16  2014-12-16    2015-03-20      178
3              13.39 2013-08-05  2013-08-30    2014-03-24      166


2-sigma returns daily    -0.028
2-sigma returns weekly   -0.050
dtype: float64

There are a few differences in the returns tear sheet that was created. Note for example that the performance table at the top now has 3 columns: Backtest, Out_of_Sample, and All_History.

The cumulative returns plot also differentiates between in-sample and OOS time periods. In addition, there is a cone that gives you an indiciation of how your algorithm is performing OOS compared to in it's backtest.

At the bottom we also see 3 distribution plots comparing the in-sample and OOS returns distributions. The first one standardizes both distributions to have the same mean and standard deviation of 1. The other two plots relax this standardization.

Bayesian analysis¶

There are also a few more advanced (and still experimental) analysis methods in pyfolio based on Bayesian statistics.

The main benefit of these methods is uncertainty quantification. All the values you saw above, like the Sharpe ratio, are just single numbers. These estimates are noisy because they have been computed over a limited number of data points. So how much can you trust these numbers? You don't know because there is no sense of uncertainty. That is where Bayesian statistics helps as instead of single values, we are dealing with probability distributions that assign degrees of belief to all possible parameter values.

Lets create the Bayesian tear sheet. Under the hood this is running MCMC sampling in PyMC3 to estimate the posteriors which can take quite a while (that's the reason why we don't generate this by default in create_full_tear_sheet()).

bt.create_bayesian_tear_sheet(live_start_date='2014-1-1')

Running T model
 [-----------------100%-----------------] 2000 of 2000 complete in 3.8 sec
Finished T model (required 33.74 seconds).

Running BEST model
 [-----------------100%-----------------] 2000 of 2000 complete in 7.1 sec
Finished BEST model (required 41.33 seconds).

Finished plotting Bayesian cone (required 0.42 seconds).

Finished plotting BEST results (required 0.76 seconds).

Finished computing Bayesian predictions (required 0.14 seconds).

Finished plotting Bayesian VaRs estimate (required 0.06 seconds).

Running alpha beta model
 [-----------------100%-----------------] 2000 of 2000 complete in 3.4 sec
Finished running alpha beta model (required 26.17 seconds).

Finished plotting alpha beta model (required 0.16 seconds).

Total runtime was 102.78 seconds.

Lets go through these row by row:

The first one is the Bayesian cone plot that is the result of a summer internship project of Sepideh Sadeghi here at Quantopian. It's similar to the cone plot you already saw at in the tear sheet above but has two critical additions: (i) it takes uncertainty into account (i.e. a short backtest length will result in a wider cone), and (ii) it does not assume normality of returns but instead uses a Student-T distribution with heavier tails.
The next row is comparing mean returns of the in-sample (backest) and OOS (forward) period. As you can see, mean returns are not a single number but a (posterior) distribution that gives us an indication of how certain we can be in our estimates. As you can see, the green distribution on the left side is much wider representing our increased uncertainty due to having less OOS data. We can then calculate the difference between these two distributions as shown on the right side. The grey lines denote the 2.5% and 97.5% percentiles. Intuitively, if the right grey line is lower than 0 you can say that with probability > 97.5% the OOS mean returns are below what is suggested by the backtest. The model used here is called BEST and was developed by John Kruschke.
The next couple of rows follow the same pattern but are an estimate of annual volatility, Sharpe ratio and their respective differences.
The 5th row shows the effect size or the difference of means normalized by the standard deviation and gives you a general sense how far apart the two distributions are. Intuitively, even if the means are significantly different, it may not be very meaningful if the standard deviation is huge amounting to a tiny difference of the two returns distributions.
The 6th row shows predicted returns (based on the backtest) for tomorrow, and 5 days from now. The blue line indicates the probability of losing more than 5% of your portfolio value and can be interpeted as a Bayesian VaR estimate.
Lastly, a Bayesian estimate of annual alpha and beta. In addition to uncertainty estimates, this model, like all above ones, assumes returns to be T-distributed which leads to more robust estimates than a standard linear regression would.

For more information on Bayesian statistics, check out these resources:

My personal blog: http://twiecki.github.io/
A talk I gave in Singapore on Probabilistic Programming in Quantitative Finance: http://blog.quantopian.com/probabilistic-programming-for-non-statisticians/
The IPython NB book Bayesian Methods for Hackers.

Using pyfolio directly¶

Above, we saw how we can easily create a variety of tear sheets. These are all created using a thin wrapper on top of the pyfolio OSS library. You might also want more fine-grained access over the functionality provided by this library. For this, you can import pyfolio and use it directly.

import pyfolio as pf

returns = bt.daily_performance.returns
pf.timeseries.cum_returns(returns).plot();

For more information on the usage of the library, check out the pyfolio website or our GitHub repo.

Contributing¶

pyfolio is still a very new project — there will be bugs and there are many rough edges. Your help is greatly appreciated.

If you find bugs or have other questions, please report them to our issue tracker. We also appreciate any contributions. For some ideas on where to start, see the 'help wanted' tag.