Enhancing Twin Delayed Deep Deterministic Policy Gradient with Cross-Entropy Method

MIT licensed

Hieu Trung Nguyen, Khang Tran and Ngoc Hoang Luong

Setup

$ git clone https://github.com/junhill-2000/eTD3.git
$ cd eTD3
  • The following packages are needed:

tianshou==0.4.4
gym==0.19.0
mujoco_py==2.0.2.13 (need to install mujoco200 from http://www.mujoco.org/)

Usage

train agent with eTD3

python3 mujoco_td3_mod_v3.py --task [environment_name] --seed [seed] --logdir [log_directory]

Visualize (comming soon)

Acknowledgement

Our source code is

 

 

 

To finish reading, please visit source site