Skip to content

Tags: johnolven/bsuite

Tags

0.3.5

Toggle 0.3.5's commit message
Relax versioning constraints for tf/tfp, bump patch version to 0.3.5. PiperOrigin-RevId: 358134728 Change-Id: Iba086222e5c97d8e6410097b38668a3935c923dc

0.3.4

Toggle 0.3.4's commit message
Bump version again (incorrect tag was used before) and use PyPI's rlax. PiperOrigin-RevId: 334553236 Change-Id: I88eca16269fabf975dd0c8f7ccd8a8c2374b78c9

0.3.3

Toggle 0.3.3's commit message
Internal change. PiperOrigin-RevId: 334334765 Change-Id: I1080d6b302fb77b3d75633c4d2ed487e6b26730d

0.3.2

Toggle 0.3.2's commit message
Bump version to 0.3.2. PiperOrigin-RevId: 315450863 Change-Id: I23a258dd001aae14bde3edd7c29df79579030eaf

0.3.1

Toggle 0.3.1's commit message
Drop meaningless "level" columns from the correct dataframe. (Does not affect performance) PiperOrigin-RevId: 313763435 Change-Id: I8973213e969be65a0a59ca73b07a1b8a628751a2

0.3.0

Toggle 0.3.0's commit message
Calculate best episode using full episode return in cartpole_swingup. Return is non-monotonic in this problem; currently this cherry-picks the peak of return during the episode. Also applied same change to base cartpole for consistency and efficiency, but cartpole return is monotonic (so not a bug). PiperOrigin-RevId: 308033113 Change-Id: I9add00d41f8e87d518e00c3fef9cd9ad7ad18d0b

0.2.0

Toggle 0.2.0's commit message
Re-organize baselines into subdirectories according to their provenan… …ce/libraries used. - tf: TensorFlow 2/Sonnet 2/TRFL-based agents. - jax: JAX/Haiku/rlax-based agents. - third_party: Agents created by third parties (not DeepMind). Also adopt more standard naming practice within each agent folder (agent.py). PiperOrigin-RevId: 305674544 Change-Id: I3d4f076fb96d2e0250cfbb3f1adf163ce6932e97

0.2

Toggle 0.2's commit message
Re-organize baselines into subdirectories according to their provenan… …ce/libraries used. - tf: TensorFlow 2/Sonnet 2/TRFL-based agents. - jax: JAX/Haiku/rlax-based agents. - third_party: Agents created by third parties (not DeepMind). Also adopt more standard naming practice within each agent folder (agent.py). PiperOrigin-RevId: 305674544 Change-Id: I3d4f076fb96d2e0250cfbb3f1adf163ce6932e97