LibtorchDRLControlTrainer

Overview

This object is supposed to train a Deep Reinforcement Learning (DRL) controller using the Proximal Policy Optimization (PPO) algorithm Schulman et al. (2017).

Example Input File Syntax

warning

The detailed documentation of this object is only available when Moose is compiled with Libtorch. For instructions on how to compile Moose with Libtorch, visit the general installation webpage or click here.

References

John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347, 2017.

BibTeX

@article{schulman2017proximal,
    author = "Schulman, John and Wolski, Filip and Dhariwal, Prafulla and Radford, Alec and Klimov, Oleg",
    title = "Proximal policy optimization algorithms",
    journal = "arXiv preprint arXiv:1707.06347",
    year = "2017"
}

RIS

TY  - JOUR
AU  - Schulman, John
AU  - Wolski, Filip
AU  - Dhariwal, Prafulla
AU  - Radford, Alec
AU  - Klimov, Oleg
TI  - Proximal policy optimization algorithms
JO  - arXiv preprint arXiv:1707.06347
PY  - 2017
ER  -

Plain Text

John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347, 2017.

Overview
Example Input File Syntax
References