This website requires JavaScript.
Explore
Help
Sign In
adelorenzo
/
stable
Watch
1
Star
0
Fork
0
You've already forked stable
Code
Issues
Pull Requests
Packages
Projects
Releases
Wiki
Activity
17,491
Commits
14
Branches
74
Tags
197
MiB
acf3484e88
Commit Graph
3 Commits
Author
SHA1
Message
Date
robcaulk
acf3484e88
add multiprocessing variant of ReinforcementLearningPPO
2022-08-24 13:00:55 +02:00
robcaulk
926023935f
make base 3ac and base 5ac environments. TDQN defaults to 3AC.
2022-08-24 13:00:55 +02:00
robcaulk
91683e1dca
restructure RL so that user can customize environment
2022-08-24 13:00:55 +02:00