robcaulk
|
81fd2e588f
|
ensure typing, remove unsued code
|
2022-11-26 12:11:59 +01:00 |
|
robcaulk
|
9f13d99b99
|
improve parameter table, add better documentation for custom calculate_reward, add various helpful notes in docstrings etc
|
2022-11-26 11:32:39 +01:00 |
|
Matthias
|
8f1a8c752b
|
Add freqairl docker build process
|
2022-11-24 07:00:12 +01:00 |
|
Matthias
|
3d26659d5e
|
Fix some doc typos
|
2022-11-23 20:09:55 +01:00 |
|
robcaulk
|
d02da279f8
|
document the simplifications of the training environment
|
2022-11-19 13:20:20 +01:00 |
|
robcaulk
|
60fcd8dce2
|
fix skipped mac test, fix RL bug in add_state_info, fix use of __import__, revise doc
|
2022-11-17 21:50:02 +01:00 |
|
robcaulk
|
c8d3e57712
|
add note that these environments are designed for short-long bots only.
|
2022-11-13 17:30:56 +01:00 |
|
robcaulk
|
c76afc255a
|
explain how to choose environments, and how to customize them
|
2022-11-13 17:26:11 +01:00 |
|
robcaulk
|
90f168d1ff
|
remove more user references. cleanup dataprovider
|
2022-11-13 17:06:06 +01:00 |
|
robcaulk
|
f8f553ec14
|
remove references to "the user"
|
2022-11-13 16:58:36 +01:00 |
|
robcaulk
|
388ca21200
|
update docs, fix bug in environment
|
2022-11-13 16:56:31 +01:00 |
|
robcaulk
|
9c6b97c678
|
ensure normalization acceleration methods are employed in RL
|
2022-11-12 12:01:59 +01:00 |
|
robcaulk
|
e5204101d9
|
add tensorboard back to reqs to keep default integration working (and for docker)
|
2022-10-05 21:34:10 +02:00 |
|
robcaulk
|
ab4705efd2
|
provide background and goals for RL in doc
|
2022-10-05 16:39:38 +02:00 |
|
robcaulk
|
936ca24482
|
separate RL install from general FAI install, update docs
|
2022-10-05 15:58:54 +02:00 |
|
robcaulk
|
292d72d593
|
automatically handle model_save_type for user
|
2022-10-03 18:42:20 +02:00 |
|
robcaulk
|
cf882fa84e
|
fix tests
|
2022-10-01 20:26:41 +02:00 |
|
robcaulk
|
ab9d781b06
|
add reinforcement learning page to docs
|
2022-10-01 17:50:05 +02:00 |
|