stable/docs/data-analysis.md

# Analyzing bot data

You can analyze the results of backtests and trading history easily using Jupyter notebooks. A sample notebook is located at `user_data/notebooks/analysis_example.ipynb`. For usage instructions, see [jupyter.org](https://jupyter.org/documentation).

*Pro tip - Don't forget to start a jupyter notbook server from within your conda or venv environment or use [nb_conda_kernels](https://github.com/Anaconda-Platform/nb_conda_kernels)*

## Example snippets

### Load backtest results into a pandas dataframe

```python
from freqtrade.data.btanalysis import load_backtest_data
# Load backtest results
df = load_backtest_data("user_data/backtest_data/backtest-result.json")

# Show value-counts per pair
df.groupby("pair")["sell_reason"].value_counts()
```

This will allow you to drill deeper into your backtest results, and perform analysis which otherwise would make the regular backtest-output very difficult to digest due to information overload.

### Load live trading results into a pandas dataframe

``` python
from freqtrade.data.btanalysis import load_trades_from_db

# Fetch trades from database
df = load_trades_from_db("sqlite:///tradesv3.sqlite")

# Display results
df.groupby("pair")["sell_reason"].value_counts()
```

## Strategy debugging example

Debugging a strategy can be time-consuming. FreqTrade offers helper functions to visualize raw data.

### Import requirements and define variables used in analyses

```python
# Imports
from pathlib import Path
import os
from freqtrade.data.history import load_pair_history
from freqtrade.resolvers import StrategyResolver

# You can override strategy settings as demonstrated below.
# Customize these according to your needs.

# Define some constants
ticker_interval = "5m"
# Name of the strategy class
strategy_name = 'AwesomeStrategy'
# Path to user data
user_data_dir = 'user_data'
# Location of the strategy
strategy_location = Path(user_data_dir, 'strategies')
# Location of the data
data_location = Path(user_data_dir, 'data', 'binance')
# Pair to analyze 
# Only use one pair here
pair = "BTC_USDT"
```

### Load exchange data

```python
# Load data using values set above
bt_data = load_pair_history(datadir=Path(data_location),
                            ticker_interval=ticker_interval,
                            pair=pair)

# Confirm success
print(f"Loaded {len(bt_data)} rows of data for {pair} from {data_location}")
```

### Load and run strategy  

* Rerun each time the strategy file is changed

```python
# Load strategy using values set above
strategy = StrategyResolver({'strategy': strategy_name,
                            'user_data_dir': user_data_dir,
                            'strategy_path': strategy_location}).strategy

# Generate buy/sell signals using strategy
df = strategy.analyze_ticker(bt_data, {'pair': pair})
```

### Display the trade details

* Note that using `data.head()` would also work, however most indicators have some "startup" data at the top of the dataframe.

#### Some possible problems

* Columns with NaN values at the end of the dataframe
* Columns used in `crossed*()` functions with completely different units

#### Comparison with full backtest

having 200 buy signals as output for one pair from `analyze_ticker()` does not necessarily mean that 200 trades will be made during backtesting.

Assuming you use only one condition such as, `df['rsi'] < 30` as buy condition, this will generate multiple "buy" signals for each pair in sequence (until rsi returns > 29).
The bot will only buy on the first of these signals (and also only if a trade-slot ("max_open_trades") is still available), or on one of the middle signals, as soon as a "slot" becomes available.

```python
# Report results
print(f"Generated {df['buy'].sum()} buy signals")
data = df.set_index('date', drop=True)
data.tail()
```

Feel free to submit an issue or Pull Request enhancing this document if you would like to share ideas on how to best analyze the data.
Add data-analysis documentation 2019-06-22 14:18:22 +00:00			`# Analyzing bot data`

edits for jupyter notebook example 2019-08-07 02:35:14 +00:00			You can analyze the results of backtests and trading history easily using Jupyter notebooks. A sample notebook is located at `user_data/notebooks/analysis_example.ipynb`. For usage instructions, see [jupyter.org](https://jupyter.org/documentation).
Add data-analysis documentation 2019-06-22 14:18:22 +00:00
added reminders 2019-08-09 15:53:29 +00:00			`Pro tip - Don't forget to start a jupyter notbook server from within your conda or venv environment or use [nb_conda_kernels](https://github.com/Anaconda-Platform/nb_conda_kernels)`

edits to clarify backtesting analysis 2019-08-09 02:09:15 +00:00			`## Example snippets`

			`### Load backtest results into a pandas dataframe`

			```python
added imports to doc code blocks. 2019-08-09 21:06:19 +00:00			`from freqtrade.data.btanalysis import load_backtest_data`
edits to clarify backtesting analysis 2019-08-09 02:09:15 +00:00			`# Load backtest results`
			`df = load_backtest_data("user_data/backtest_data/backtest-result.json")`

			`# Show value-counts per pair`
			`df.groupby("pair")["sell_reason"].value_counts()`
			```

Reinstate comment on backesting data 2019-08-10 13:45:41 +00:00			`This will allow you to drill deeper into your backtest results, and perform analysis which otherwise would make the regular backtest-output very difficult to digest due to information overload.`

edits to clarify backtesting analysis 2019-08-09 02:09:15 +00:00			`### Load live trading results into a pandas dataframe`

			``` python
added imports to doc code blocks. 2019-08-09 21:06:19 +00:00			`from freqtrade.data.btanalysis import load_trades_from_db`

edits to clarify backtesting analysis 2019-08-09 02:09:15 +00:00			`# Fetch trades from database`
			`df = load_trades_from_db("sqlite:///tradesv3.sqlite")`

			`# Display results`
			`df.groupby("pair")["sell_reason"].value_counts()`
			```

edits for jupyter notebook example 2019-08-07 02:35:14 +00:00			`## Strategy debugging example`
Add data-analysis documentation 2019-06-22 14:18:22 +00:00
edits for jupyter notebook example 2019-08-07 02:35:14 +00:00			`Debugging a strategy can be time-consuming. FreqTrade offers helper functions to visualize raw data.`
Add data-analysis documentation 2019-06-22 14:18:22 +00:00
edits to clarify backtesting analysis 2019-08-09 02:09:15 +00:00			`### Import requirements and define variables used in analyses`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00
			```python
edits for jupyter notebook example 2019-08-07 02:35:14 +00:00			`# Imports`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00			`from pathlib import Path`
edits for jupyter notebook example 2019-08-07 02:35:14 +00:00			`import os`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00			`from freqtrade.data.history import load_pair_history`
			`from freqtrade.resolvers import StrategyResolver`

code block instructions. removed extra packages 2019-08-09 21:24:17 +00:00			`# You can override strategy settings as demonstrated below.`
			`# Customize these according to your needs.`

Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00			`# Define some constants`
added imports to doc code blocks. 2019-08-09 21:06:19 +00:00			`ticker_interval = "5m"`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00			`# Name of the strategy class`
added imports to doc code blocks. 2019-08-09 21:06:19 +00:00			`strategy_name = 'AwesomeStrategy'`
edits for jupyter notebook example 2019-08-07 02:35:14 +00:00			`# Path to user data`
			`user_data_dir = 'user_data'`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00			`# Location of the strategy`
fixed another instance of Path in docs and nb 2019-08-09 15:36:53 +00:00			`strategy_location = Path(user_data_dir, 'strategies')`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00			`# Location of the data`
fixed another instance of Path in docs and nb 2019-08-09 15:36:53 +00:00			`data_location = Path(user_data_dir, 'data', 'binance')`
edits for jupyter notebook example 2019-08-07 02:35:14 +00:00			`# Pair to analyze`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00			`# Only use one pair here`
edits for jupyter notebook example 2019-08-07 02:35:14 +00:00			`pair = "BTC_USDT"`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00			```

edits for jupyter notebook example 2019-08-07 02:35:14 +00:00			`### Load exchange data`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00
edits for jupyter notebook example 2019-08-07 02:35:14 +00:00			```python
			`# Load data using values set above`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00			`bt_data = load_pair_history(datadir=Path(data_location),`
edits for jupyter notebook example 2019-08-07 02:35:14 +00:00			`ticker_interval=ticker_interval,`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00			`pair=pair)`
edits for jupyter notebook example 2019-08-07 02:35:14 +00:00
			`# Confirm success`
code block instructions. removed extra packages 2019-08-09 21:24:17 +00:00			`print(f"Loaded {len(bt_data)} rows of data for {pair} from {data_location}")`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00			```

edits for jupyter notebook example 2019-08-07 02:35:14 +00:00			`### Load and run strategy`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00
edits for jupyter notebook example 2019-08-07 02:35:14 +00:00			`* Rerun each time the strategy file is changed`

			```python
			`# Load strategy using values set above`
			`strategy = StrategyResolver({'strategy': strategy_name,`
			`'user_data_dir': user_data_dir,`
			`'strategy_path': strategy_location}).strategy`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00
edits to clarify backtesting analysis 2019-08-09 02:09:15 +00:00			`# Generate buy/sell signals using strategy`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00			`df = strategy.analyze_ticker(bt_data, {'pair': pair})`
			```

edits to clarify backtesting analysis 2019-08-09 02:09:15 +00:00			`### Display the trade details`
Create detailed section about strategy problem analysis 2019-08-01 18:08:30 +00:00
edits to clarify backtesting analysis 2019-08-09 02:09:15 +00:00			* Note that using `data.head()` would also work, however most indicators have some "startup" data at the top of the dataframe.
Add data-analysis documentation 2019-06-22 14:18:22 +00:00
edits to clarify backtesting analysis 2019-08-09 02:09:15 +00:00			`#### Some possible problems`
Add data-analysis documentation 2019-06-22 14:18:22 +00:00
edits to clarify backtesting analysis 2019-08-09 02:09:15 +00:00			`* Columns with NaN values at the end of the dataframe`
			* Columns used in `crossed*()` functions with completely different units
Add data-analysis documentation 2019-06-22 14:18:22 +00:00
edits to clarify backtesting analysis 2019-08-09 02:09:15 +00:00			`#### Comparison with full backtest`
Add data-analysis documentation 2019-06-22 14:18:22 +00:00
edits to clarify backtesting analysis 2019-08-09 02:09:15 +00:00			having 200 buy signals as output for one pair from `analyze_ticker()` does not necessarily mean that 200 trades will be made during backtesting.

			Assuming you use only one condition such as, `df['rsi'] < 30` as buy condition, this will generate multiple "buy" signals for each pair in sequence (until rsi returns > 29).
			`The bot will only buy on the first of these signals (and also only if a trade-slot ("max_open_trades") is still available), or on one of the middle signals, as soon as a "slot" becomes available.`

			```python
			`# Report results`
			`print(f"Generated {df['buy'].sum()} buy signals")`
			`data = df.set_index('date', drop=True)`
			`data.tail()`
Add data-analysis documentation 2019-06-22 14:18:22 +00:00			```

Improve wording 2019-06-24 15:20:41 +00:00			`Feel free to submit an issue or Pull Request enhancing this document if you would like to share ideas on how to best analyze the data.`