evorl.algorithms.erl.erl_td3.erl_es¶
Module Contents¶
Classes¶
ERL w/ ES. |
|
API¶
- class evorl.algorithms.erl.erl_td3.erl_es.ERLESWorkflow(**kwargs)[source]¶
Bases:
evorl.algorithms.erl.erl_td3.erl_td3_workflow.ERLTD3WorkflowTemplateERL w/ ES.
Configs:
EC: n actors
RL: k actors + k critics
Shared replay buffer
- evaluate(state: evorl.types.State) tuple[evorl.metrics.MetricBase, evorl.types.State][source]¶
- learn(state: evorl.types.State) evorl.types.State[source]¶
- step(state: evorl.types.State) tuple[evorl.metrics.MetricBase, evorl.types.State][source]¶