No more hand-tuning rewards: Masked constrained policy optimization for safe reinforcement learning

Stef Van Havermaet, Yara Khaluf, Pieter Simoens

Research output: Chapter in Book/Report/Conference proceedingConference paperAcademicpeer-review

Fingerprint

Dive into the research topics of 'No more hand-tuning rewards: Masked constrained policy optimization for safe reinforcement learning'. Together they form a unique fingerprint.

Computer Science

Engineering

Chemical Engineering