.. _metrics:

Metrics
-------

The risk and performance metrics are summarizing values calculated by Zipline
when running a simulation. These metrics can be about the performance of an
algorithm, like returns or cash flow, or the riskiness of an algorithm, like
volatility or beta. Metrics may be reported minutely, daily, or once at the end
of a simulation. A single metric may choose to report at multiple time-scales
where appropriate.

Metrics Sets
~~~~~~~~~~~~

Zipline groups risk and performance metrics into collections called "metrics
sets". A single metrics set defines all of the metrics to track during a single
backtest. A metrics set may contain metrics that report at different time
scales. The default metrics set will compute a host of metrics, such as
algorithm returns, volatility, Sharpe ratio, and beta.

Selecting the Metrics Set
~~~~~~~~~~~~~~~~~~~~~~~~~

When running a simulation, the user may select the metrics set to report. How
you select the metrics set depends on the interface being used to run the
algorithm.

Command Line and IPython Magic
``````````````````````````````

When running with the command line or IPython magic interfaces, the metrics set
may be selected by passing the ``--metrics-set`` argument. For example:

.. code-block:: bash

   $ zipline run algorithm.py -s 2014-01-01 -e 2014-02-01 --metrics-set my-metrics-set

``run_algorithm``
`````````````````

When running through the :func:`~zipline.run_algorithm` interface, the metrics
set may be passed with the ``metrics_set`` argument. This may either be the name
of a registered metrics set, or a set of metric object. For example:

.. code-block:: python

   run_algorithm(..., metrics_set='my-metrics-set')
   run_algorithm(..., metrics_set={MyMetric(), MyOtherMetric(), ...})

Running Without Metrics
~~~~~~~~~~~~~~~~~~~~~~~

Computing risk and performance metrics is not free, and contributes to the total
runtime of a backtest. When actively developing an algorithm, it is often
helpful to skip these computations to speed up the debugging cycle. To disable
the calculation and reporting of all metrics, users may select the built-in
metrics set ``none``. For example:

.. code-block:: bash

   $ zipline run algorithm.py -s 2014-01-01 -e 2014-02-01 --metrics-set none

Defining New Metrics
~~~~~~~~~~~~~~~~~~~~

A metric is any object that implements some subset of the following methods:

- ``start_of_simulation``
- ``end_of_simulation``
- ``start_of_session``
- ``end_of_session``
- ``end_of_bar``

These functions will be called at the time indicated by their name, at which
point the metric object may collect any needed information and optionally report
a computed value. If a metric does not need to do any processing at one of these
times, it may omit a definition for the given method.

A metric should be reusable, meaning that a single instance of a metric class
should be able to be used across multiple backtests. Metrics do not need to
support multiple simulations at once, meaning that internal caches and data are
consistent between ``start_of_simulation`` and ``end_of_simulation``.

``start_of_simulation``
```````````````````````

The ``start_of_simulation`` method should be thought of as a per-simulation
constructor. This method should initialize any caches needed for the duration of
a single simulation.

The ``start_of_simulation`` method should have the following signature:

.. code-block:: python

   def start_of_simulation(self,
                           ledger,
                           emission_rate,
                           trading_calendar,
                           sessions,
                           benchmark_source):
       ...

``ledger`` is an instance of :class:`~zipline.finance.ledger.Ledger` which is
maintaining the simulation's state. This may be used to lookup the algorithm's
starting portfolio values.

``emission_rate`` is a string representing the smallest frequency at which
metrics should be reported. ``emission_rate`` will be either ``minute`` or
``daily``. When ``emission_rate`` is ``daily``, ``end_of_bar`` will not be
called at all.

``trading_calendar`` is an instance of
:class:`~zipline.utils.calendars.TradingCalendar` which is the trading calendar
being used by the simulation.

``sessions`` is a :class:`pandas.DatetimeIndex` which is holds the session
labels, in sorted order, that the simulation will execute.

``benchmark_source`` is an instance of
:class:`~zipline.sources.benchmark_source.BenchmarkSource` which is the
interface to the returns of the benchmark specified by
:func:`~zipline.api.set_benchmark`.

``end_of_simulation``
`````````````````````

The ``end_of_simulation`` method should have the following signature:

.. code-block:: python

   def end_of_simulation(self,
                         packet,
                         ledger,
                         trading_calendar,
                         sessions,
                         data_portal,
                         benchmark_source):
       ...

``ledger`` is an instance of :class:`~zipline.finance.ledger.Ledger` which is
maintaining the simulation's state. This may be used to lookup the algorithm's
final portfolio values.

``packet`` is a dictionary to write the end of simulation values for the given
metric into.

``trading_calendar`` is an instance of
:class:`~zipline.utils.calendars.TradingCalendar` which is the trading calendar
being used by the simulation.

``sessions`` is a :class:`pandas.DatetimeIndex` which is holds the session
labels, in sorted order, that the simulation has executed.

``data_portal`` is an instance of :class:`~zipline.data.data_portal.DataPortal`
which is the metric's interface to pricing data.

``benchmark_source`` is an instance of
:class:`~zipline.sources.benchmark_source.BenchmarkSource` which is the
interface to the returns of the benchmark specified by
:func:`~zipline.api.set_benchmark`.

``start_of_session``
````````````````````

The ``start_of_session`` method may see a slightly different view of the
``ledger`` or ``data_portal`` than the previous ``end_of_session`` if the price
of any futures owned move between trading sessions or if a capital change
occurs.

The ``start_of_session`` method should have the following signature:

.. code-block:: python

   def start_of_session(self,
                        ledger,
                        session_label,
                        data_portal):
       ...

``ledger`` is an instance of :class:`~zipline.finance.ledger.Ledger` which is
maintaining the simulation's state. This may be used to lookup the algorithm's
current portfolio values.

``session_label`` is a :class:`~pandas.Timestamp` which is the label of the
session which is about to run.

``data_portal`` is an instance of :class:`~zipline.data.data_portal.DataPortal`
which is the metric's interface to pricing data.

``end_of_session``
``````````````````

The ``end_of_session`` method should have the following signature:

.. code-block:: python

   def end_of_session(self,
                      packet,
                      ledger,
                      session_label,
                      session_ix,
                      data_portal):

``packet`` is a dictionary to write the end of session values. This dictionary
contains two sub-dictionaries: ``daily_perf`` and ``cumulative_perf``. When
applicable, the ``daily_perf`` should hold the current day's value, and
``cumulative_perf`` should hold a cumulative value for the entire simulation up
to the current time.

``ledger`` is an instance of :class:`~zipline.finance.ledger.Ledger` which is
maintaining the simulation's state. This may be used to lookup the algorithm's
current portfolio values.

``session_label`` is a :class:`~pandas.Timestamp` which is the label of the
session which is has just completed.

``session_ix`` is an :class:`int` which is the index of the current trading
session being run. This is provided to allow for efficient access to the daily
returns through ``ledger.daily_returns_array[:session_ix + 1]``.

``data_portal`` is an instance of :class:`~zipline.data.data_portal.DataPortal`
which is the metric's interface to pricing data

``end_of_bar``
``````````````

.. note::

   ``end_of_bar`` is only called when ``emission_mode`` is ``minute``.

The ``end_of_bar`` method should have the following signature:

.. code-block:: python

   def end_of_bar(self,
                  packet,
                  ledger,
                  dt,
                  session_ix,
                  data_portal):

``packet`` is a dictionary to write the end of session values. This dictionary
contains two sub-dictionaries: ``minute_perf`` and ``cumulative_perf``. When
applicable, the ``minute_perf`` should hold the current partial day's value, and
``cumulative_perf`` should hold a cumulative value for the entire simulation up
to the current time.

``ledger`` is an instance of :class:`~zipline.finance.ledger.Ledger` which is
maintaining the simulation's state. This may be used to lookup the algorithm's
current portfolio values.

``dt`` is a :class:`~pandas.Timestamp` which is the label of bar that has just
completed.

``session_ix`` is an :class:`int` which is the index of the current trading
session being run. This is provided to allow for efficient access to the daily
returns through ``ledger.daily_returns_array[:session_ix + 1]``.

``data_portal`` is an instance of :class:`~zipline.data.data_portal.DataPortal`
which is the metric's interface to pricing data.

Defining New Metrics Sets
~~~~~~~~~~~~~~~~~~~~~~~~~

Users may use :func:`zipline.finance.metrics.register` to register a new metrics
set. This may be used to decorate a function taking no arguments which returns a
new set of metric object instances. For example:

.. code-block:: python

   from zipline.finance import metrics

   @metrics.register('my-metrics-set')
   def my_metrics_set():
       return {MyMetric(), MyOtherMetric(), ...}


This may be embedded in the user's ``extension.py``.

The reason that a metrics set is defined as a function which produces a set,
instead of just a set, is that users may want to fetch external data or
resources to construct their metrics. By putting this behind a callable, users
do not need to fetch the resources when the metrics set is not being used.