stable

Author	SHA1	Message	Date
robcaulk	4b9499e321	improve nomenclature and fix short exit bug	2022-08-24 13:00:55 +02:00
sonnhfit	4baa36bdcf	fix persist a single training environment for PPO	2022-08-24 13:00:55 +02:00
robcaulk	f95602f6bd	persist a single training environment.	2022-08-24 13:00:55 +02:00
robcaulk	5d4e5e69fe	reinforce training with state info, reinforce prediction with state info, restructure config to accommodate all parameters from any user imported model type. Set 5Act to default env on TDQN. Clean example config.	2022-08-24 13:00:55 +02:00
sonnhfit	7962a1439b	remove keep low profit	2022-08-24 13:00:55 +02:00
sonnhfit	81b5aa66e8	make env keep current position when low profit	2022-08-24 13:00:55 +02:00
sonnhfit	45218faeb0	fix coding convention	2022-08-24 13:00:55 +02:00
robcaulk	b90da46b1b	improve price df handling to enable backtesting	2022-08-24 13:00:55 +02:00
MukavaValkku	2080ff86ed	5ac base fixes in logic	2022-08-24 13:00:55 +02:00
sonnhfit	0475b7cb18	remove unuse code and fix coding conventions	2022-08-24 13:00:55 +02:00
robcaulk	bf7ceba958	set cpu threads in config	2022-08-24 13:00:55 +02:00
robcaulk	acf3484e88	add multiprocessing variant of ReinforcementLearningPPO	2022-08-24 13:00:55 +02:00
robcaulk	926023935f	make base 3ac and base 5ac environments. TDQN defaults to 3AC.	2022-08-24 13:00:55 +02:00
robcaulk	91683e1dca	restructure RL so that user can customize environment	2022-08-24 13:00:55 +02:00

1 2