Skip to content

Tags: thu-ml/tianshou

Tags

v1.0.0

Toggle v1.0.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Update publish.yaml, typo [skip ci[

v0.5.0

Toggle v0.5.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
update version to 0.5.0 (#826)

v0.4.11

Toggle v0.4.11's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
fix info not pass issue in PGPolicy (#787)

close #775

v0.4.10

Toggle v0.4.10's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
bump version to 0.4.10 (#757)

v0.4.9

Toggle v0.4.9's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
bump version to 0.4.9 (#684)

v0.4.8

Toggle v0.4.8's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Add vecenv wrappers for obs_norm to support running mujoco experiment…

… with envpool (#628)

- add VectorEnvWrapper and VectorEnvNormObs
- obs_rms store in policy save/load
- align mujoco scripts with atari: obs_norm, envpool, wandb and README

v0.4.7

Toggle v0.4.7's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
rename save_fn to save_best_fn to avoid ambiguity (#575)

This PR also introduces `tianshou.utils.deprecation` for a unified deprecation wrapper.

v0.4.6

Toggle v0.4.6's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Add VizDoom PPO example and results (#533)

* update vizdoom ppo example

* update README with results

v0.4.6.post1

Toggle v0.4.6.post1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
fix conda support and keep API compatibility (#536)

* loose constrains

* fix nni issue (#478)

* fix coverage

v0.4.5

Toggle v0.4.5's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Fix critic network for Discrete CRR (#485)

- Fixes an inconsistency in the implementation of Discrete CRR. Now it uses `Critic` class for its critic, following conventions in other actor-critic policies;
- Updates several offline policies to use `ActorCritic` class for its optimizer to eliminate randomness caused by parameter sharing between actor and critic;
- Add `writer.flush()` in TensorboardLogger to ensure real-time result;
- Enable `test_collector=None` in 3 trainers to turn off testing during training;
- Updates the Atari offline results in README.md;
- Moves Atari offline RL examples to `examples/offline`; tests to `test/offline` per review comments.