stable/docs/freqai.md

![freqai-logo](assets/freqai_doc_logo.svg)

# FreqAI 

##Introduction

FreqAI is a software designed to automate a variety of tasks associated with training a predictive machine learning model to generate market forecasts given a set of input features.

Features include:

* **Self-adaptive retraining** - Retrain models during [live deployments](freqai-running.md#running-the-model-live) to self-adapt to the market in a supervised manner
* **Rapid feature engineering** - Create large rich [feature sets](freqai-feature-engineering.md#feature-engineering) (10k+ features) based on simple user-created strategies
* **High performance** - Threading allows for adaptive model retraining on a separate thread (or on GPU if available) from model inferencing (prediction) and bot trade operations. Newest models and data are kept in RAM for rapid inferencing
* **Realistic backtesting** - Emulate self-adaptive training on historic data with a [backtesting module](freqai-running.md#backtesting) that automates retraining
* **Extensibility** - The generalized and robust architecture allows for incorporating any [machine learning library/method](freqai-configuration.md#using-different-prediction-models) available in Python. Eight examples are currently available, including classifiers, regressors, and a convolutional neural network
* **Smart outlier removal** - Remove outliers from training and prediction data sets using a variety of [outlier detection techniques](freqai-outlier-detection.md)
* **Crash resilience** - Store trained models to disk to make reloading from a crash fast and easy, and [purge obsolete files](freqai-data-handling.md#purging-old-model-data) for sustained dry/live runs
* **Automatic data normalization** - [Normalize the data](freqai-feature-engineering.md#feature-normalization) in a smart and statistically safe way
* **Automatic data download** - Compute timeranges for data downloads and update historic data (in live deployments)
* **Cleaning of incoming data** - Handle NaNs safely before training and model inferencing
* **Dimensionality reduction** - Reduce the size of the training data via [Principal Component Analysis](freqai-feature-engineering.md#data-dimensionality-reduction-with-principal-component-analysis)
* **Deploying bot fleets** - Set one bot to train models while a fleet of [follower bots](freqai-running.md#setting-up-a-follower) inference the models and handle trades

## Quick start

The easiest way to quickly test FreqAI is to run it in dry mode with the following command:

```bash
freqtrade trade --config config_examples/config_freqai.example.json --strategy FreqaiExampleStrategy --freqaimodel LightGBMRegressor --strategy-path freqtrade/templates
```

The user will see the boot-up process of automatic data downloading, followed by simultaneous training and trading.

An example strategy, prediction model, and config to use as a starting points can be found in
`freqtrade/templates/FreqaiExampleStrategy.py`, `freqtrade/freqai/prediction_models/LightGBMRegressor.py`, and
`config_examples/config_freqai.example.json`, respectively.

## General approach

The user provides FreqAI with a set of custom *base indicators* (the same way as in a [typical Freqtrade strategy](strategy-customization.md)) as well as target values (*labels*). For each pair in the whitelist, FreqAI trains a model to predict the target values based on the input of custom indicators. The models are then consistently retrained, with a frequency set by the user, to adapt to market conditions. FreqAI offers the ability to both backtest strategies (emulating reality with periodic retraining on historic data) and deploy dry/live runs. In dry/live conditions, FreqAI can be set to constant retraining in a background thread to keep models as up to date as possible.

An overview of the algorithm, explaining the data processing pipeline and model usage, is shown below.

![freqai-algo](assets/freqai_algo.jpg)

### Important machine learning vocabulary

**Features** - the parameters, based on historic data, on which a model is trained. All features for a single candle is stored as a vector. In FreqAI, the user builds a feature data sets from anything they can construct in the strategy.

**Labels** - the target values that a model is trained toward. Each feature vector is associated with a single label that is defined by the user within the strategy. These labels intentionally look into the future, and are not available to the model during dry/live/backtesting.

**Training** - the process of "teaching" the model to match the feature sets to the associated labels. Different types of models "learn" in different ways. More information about the different models can be found [here](freqai-configuration.md#using-different-prediction-models).

**Train data** - a subset of the feature data set that is fed to the model during training. This data directly influences weight connections in the model.

**Test data** - a subset of the feature data set that is used to evaluate the performance of the model after training. This data does not influence nodal weights within the model.

**Inferencing** - the process of feeding a trained model new data on which it will make a prediction.

## Install prerequisites

The normal Freqtrade install process will ask the user if they wish to install FreqAI dependencies. The user should reply "yes" to this question if they wish to use FreqAI. If the user did not reply yes, they can manually install these dependencies after the install with:

``` bash
pip install -r requirements-freqai.txt
```

!!! Note
    Catboost will not be installed on arm devices (raspberry, Mac M1, ARM based VPS, ...), since it does not provide wheels for this platform.

### Usage with docker

For docker users, a dedicated tag with FreqAI dependencies is available as `:freqai`. As such - you can replace the image line in your docker-compose file with `image: freqtradeorg/freqtrade:develop_freqai`. This image contains the regular FreqAI dependencies. Similar to native installs, Catboost will not be available on ARM based devices.

## Common pitfalls

FreqAI cannot be combined with dynamic `VolumePairlists` (or any pairlist filter that adds and removes pairs dynamically). This is for performance reasons - FreqAI relies on making quick predictions/retrains. To do this effectively, it needs to download all the training data at the beginning of a dry/live instance. FreqAI stores and appends new candles automatically for future retrains. This means that if new pairs arrive later in the dry run due to a volume pairlist, it will not have the data ready. However, FreqAI does work with the `ShufflePairlist` or a `VolumePairlist` which keeps the total pairlist constant (but reorders the pairs according to volume).

## Credits

FreqAI is developed by a group of individuals who all contribute specific skillsets to the project.

Conception and software development:
Robert Caulk @robcaulk

Theoretical brainstorming and data analysis:
Elin Törnquist @th0rntwig

Code review and software architecture brainstorming:
@xmatthias

Software development:
Wagner Costa @wagnercosta

Beta testing and bug reporting:
Stefan Gehring @bloodhunter4rc, @longyu, @paranoidandy, @smidelis, Ryan McMullan @smarmau,
Juha Nykänen @suikula, Johan van der Vlugt @jooopiert, Richárd Józsa @richardjosza
improve docs 2022-08-10 09:56:42 +00:00			`![freqai-logo](assets/freqai_doc_logo.svg)`
add freqai logo to top of doc 2022-07-21 22:02:07 +00:00
Reorganise multipage doc 2022-09-17 15:43:39 +00:00			`# FreqAI`

			`##Introduction`
add freqao backend machinery, user interface, documentation 2022-05-03 08:14:17 +00:00
improve class diagram 2022-09-22 19:32:12 +00:00			`FreqAI is a software designed to automate a variety of tasks associated with training a predictive machine learning model to generate market forecasts given a set of input features.`
Restructure and improve doc, add fiq 2022-08-17 20:35:26 +00:00
			`Features include:`

Reorganise multipage doc 2022-09-17 15:43:39 +00:00			`* Self-adaptive retraining - Retrain models during [live deployments](freqai-running.md#running-the-model-live) to self-adapt to the market in a supervised manner`
			`* Rapid feature engineering - Create large rich [feature sets](freqai-feature-engineering.md#feature-engineering) (10k+ features) based on simple user-created strategies`
			`* High performance - Threading allows for adaptive model retraining on a separate thread (or on GPU if available) from model inferencing (prediction) and bot trade operations. Newest models and data are kept in RAM for rapid inferencing`
			`* Realistic backtesting - Emulate self-adaptive training on historic data with a [backtesting module](freqai-running.md#backtesting) that automates retraining`
			`* Extensibility - The generalized and robust architecture allows for incorporating any [machine learning library/method](freqai-configuration.md#using-different-prediction-models) available in Python. Eight examples are currently available, including classifiers, regressors, and a convolutional neural network`
			`* Smart outlier removal - Remove outliers from training and prediction data sets using a variety of [outlier detection techniques](freqai-outlier-detection.md)`
			`* Crash resilience - Store trained models to disk to make reloading from a crash fast and easy, and [purge obsolete files](freqai-data-handling.md#purging-old-model-data) for sustained dry/live runs`
			`* Automatic data normalization - [Normalize the data](freqai-feature-engineering.md#feature-normalization) in a smart and statistically safe way`
			`* Automatic data download - Compute timeranges for data downloads and update historic data (in live deployments)`
			`* Cleaning of incoming data - Handle NaNs safely before training and model inferencing`
			`* Dimensionality reduction - Reduce the size of the training data via [Principal Component Analysis](freqai-feature-engineering.md#data-dimensionality-reduction-with-principal-component-analysis)`
			`* Deploying bot fleets - Set one bot to train models while a fleet of [follower bots](freqai-running.md#setting-up-a-follower) inference the models and handle trades`
improve docs 2022-08-10 09:56:42 +00:00
			`## Quick start`

Reorganise multipage doc 2022-09-17 15:43:39 +00:00			`The easiest way to quickly test FreqAI is to run it in dry mode with the following command:`
improve docs 2022-08-10 09:56:42 +00:00
			```bash
			`freqtrade trade --config config_examples/config_freqai.example.json --strategy FreqaiExampleStrategy --freqaimodel LightGBMRegressor --strategy-path freqtrade/templates`
			```

Restructure and improve doc, add fiq 2022-08-17 20:35:26 +00:00			`The user will see the boot-up process of automatic data downloading, followed by simultaneous training and trading.`
improve docs 2022-08-10 09:56:42 +00:00
Reorganise multipage doc 2022-09-17 15:43:39 +00:00			`An example strategy, prediction model, and config to use as a starting points can be found in`
Restructure and improve doc, add fiq 2022-08-17 20:35:26 +00:00			`freqtrade/templates/FreqaiExampleStrategy.py`, `freqtrade/freqai/prediction_models/LightGBMRegressor.py`, and
improve docs 2022-08-10 09:56:42 +00:00			`config_examples/config_freqai.example.json`, respectively.
add freqao backend machinery, user interface, documentation 2022-05-03 08:14:17 +00:00
doc update thanks matthias 2022-05-17 18:41:42 +00:00			`## General approach`
Fix some typos 2022-05-15 14:25:08 +00:00
Reorganise multipage doc 2022-09-17 15:43:39 +00:00			The user provides FreqAI with a set of custom base indicators (the same way as in a [typical Freqtrade strategy](strategy-customization.md)) as well as target values (labels). For each pair in the whitelist, FreqAI trains a model to predict the target values based on the input of custom indicators. The models are then consistently retrained, with a frequency set by the user, to adapt to market conditions. FreqAI offers the ability to both backtest strategies (emulating reality with periodic retraining on historic data) and deploy dry/live runs. In dry/live conditions, FreqAI can be set to constant retraining in a background thread to keep models as up to date as possible.
add image of algorithmic overview to doc 2022-07-30 16:51:00 +00:00
Reorganise multipage doc 2022-09-17 15:43:39 +00:00			`An overview of the algorithm, explaining the data processing pipeline and model usage, is shown below.`
add image of algorithmic overview to doc 2022-07-30 16:51:00 +00:00
Reduce image sizes in freqai doc (#7304) 2022-08-28 21:27:12 +00:00			`![freqai-algo](assets/freqai_algo.jpg)`
add freqao backend machinery, user interface, documentation 2022-05-03 08:14:17 +00:00
Restructure and improve doc, add fiq 2022-08-17 20:35:26 +00:00			`### Important machine learning vocabulary`
add freqao backend machinery, user interface, documentation 2022-05-03 08:14:17 +00:00
Reorganise multipage doc 2022-09-17 15:43:39 +00:00			`Features - the parameters, based on historic data, on which a model is trained. All features for a single candle is stored as a vector. In FreqAI, the user builds a feature data sets from anything they can construct in the strategy.`

			`Labels - the target values that a model is trained toward. Each feature vector is associated with a single label that is defined by the user within the strategy. These labels intentionally look into the future, and are not available to the model during dry/live/backtesting.`
add freqao backend machinery, user interface, documentation 2022-05-03 08:14:17 +00:00
Reorganise multipage doc 2022-09-17 15:43:39 +00:00			`Training - the process of "teaching" the model to match the feature sets to the associated labels. Different types of models "learn" in different ways. More information about the different models can be found [here](freqai-configuration.md#using-different-prediction-models).`
add freqao backend machinery, user interface, documentation 2022-05-03 08:14:17 +00:00
Reorganise multipage doc 2022-09-17 15:43:39 +00:00			`Train data - a subset of the feature data set that is fed to the model during training. This data directly influences weight connections in the model.`
add freqao backend machinery, user interface, documentation 2022-05-03 08:14:17 +00:00
Reorganise multipage doc 2022-09-17 15:43:39 +00:00			`Test data - a subset of the feature data set that is used to evaluate the performance of the model after training. This data does not influence nodal weights within the model.`
add freqao backend machinery, user interface, documentation 2022-05-03 08:14:17 +00:00
Reorganise multipage doc 2022-09-17 15:43:39 +00:00			`Inferencing - the process of feeding a trained model new data on which it will make a prediction.`
add freqao backend machinery, user interface, documentation 2022-05-03 08:14:17 +00:00
give beta testers more information in the doc 2022-05-15 12:01:53 +00:00			`## Install prerequisites`

slightly update doc wording 2022-08-14 15:08:29 +00:00			`The normal Freqtrade install process will ask the user if they wish to install FreqAI dependencies. The user should reply "yes" to this question if they wish to use FreqAI. If the user did not reply yes, they can manually install these dependencies after the install with:`
give beta testers more information in the doc 2022-05-15 12:01:53 +00:00
Update some docs wording 2022-07-22 18:27:25 +00:00			``` bash
			`pip install -r requirements-freqai.txt`
			```
give beta testers more information in the doc 2022-05-15 12:01:53 +00:00
Exclude aarch64 from catboost requirements 2022-08-01 07:32:25 +00:00			`!!! Note`
Reorganise multipage doc 2022-09-17 15:43:39 +00:00			`Catboost will not be installed on arm devices (raspberry, Mac M1, ARM based VPS, ...), since it does not provide wheels for this platform.`
Exclude aarch64 from catboost requirements 2022-08-01 07:32:25 +00:00
Update docs for freqai docker container 2022-08-17 08:35:56 +00:00			`### Usage with docker`

Reorganise multipage doc 2022-09-17 15:43:39 +00:00			For docker users, a dedicated tag with FreqAI dependencies is available as `:freqai`. As such - you can replace the image line in your docker-compose file with `image: freqtradeorg/freqtrade:develop_freqai`. This image contains the regular FreqAI dependencies. Similar to native installs, Catboost will not be available on ARM based devices.
add record of contribution to doc and source 2022-07-23 11:04:06 +00:00
Add Common pitfalls 2022-09-18 12:51:11 +00:00			`## Common pitfalls`

			FreqAI cannot be combined with dynamic `VolumePairlists` (or any pairlist filter that adds and removes pairs dynamically). This is for performance reasons - FreqAI relies on making quick predictions/retrains. To do this effectively, it needs to download all the training data at the beginning of a dry/live instance. FreqAI stores and appends new candles automatically for future retrains. This means that if new pairs arrive later in the dry run due to a volume pairlist, it will not have the data ready. However, FreqAI does work with the `ShufflePairlist` or a `VolumePairlist` which keeps the total pairlist constant (but reorders the pairs according to volume).

add record of contribution to doc and source 2022-07-23 11:04:06 +00:00			`## Credits`
slightly update doc wording 2022-08-14 15:08:29 +00:00
Reorganise multipage doc 2022-09-17 15:43:39 +00:00			`FreqAI is developed by a group of individuals who all contribute specific skillsets to the project.`
add record of contribution to doc and source 2022-07-23 11:04:06 +00:00
			`Conception and software development:`
			`Robert Caulk @robcaulk`

Reorganise multipage doc 2022-09-17 15:43:39 +00:00			`Theoretical brainstorming and data analysis:`
Restructure and improve doc, add fiq 2022-08-17 20:35:26 +00:00			`Elin Törnquist @th0rntwig`
add record of contribution to doc and source 2022-07-23 11:04:06 +00:00
Reorganise multipage doc 2022-09-17 15:43:39 +00:00			`Code review and software architecture brainstorming:`
add record of contribution to doc and source 2022-07-23 11:04:06 +00:00			`@xmatthias`

Reorganise multipage doc 2022-09-17 15:43:39 +00:00			`Software development:`
reorganize freqai docs for easier reading, add detailed file structure description 2022-09-11 15:29:14 +00:00			`Wagner Costa @wagnercosta`

add record of contribution to doc and source 2022-07-23 11:04:06 +00:00			`Beta testing and bug reporting:`
reorganize freqai docs for easier reading, add detailed file structure description 2022-09-11 15:29:14 +00:00			`Stefan Gehring @bloodhunter4rc, @longyu, @paranoidandy, @smidelis, Ryan McMullan @smarmau,`
			`Juha Nykänen @suikula, Johan van der Vlugt @jooopiert, Richárd Józsa @richardjosza`