evorl.algorithms.erl.cemrl_td3.cemrl¶
Module Contents¶
Classes¶
1 critic + n actors + 1 replay buffer. |
|
Functions¶
API¶
- class evorl.algorithms.erl.cemrl_td3.cemrl.CEMRLWorkflow(**kwargs)[source]¶
Bases:
evorl.algorithms.erl.cemrl_td3.cemrl_td3_workflow.CEMRLTD3WorkflowTemplate1 critic + n actors + 1 replay buffer.
We use shard_map to split and parallel the population.
- evaluate(state: evorl.types.State) tuple[evorl.metrics.MetricBase, evorl.types.State][source]¶
- learn(state: evorl.types.State) evorl.types.State[source]¶
- step(state: evorl.types.State) tuple[evorl.metrics.MetricBase, evorl.types.State][source]¶
- class evorl.algorithms.erl.cemrl_td3.cemrl.EvaluateMetric[source]¶
Bases:
evorl.metrics.MetricBase- pop_center_episode_lengths: chex.Array¶
None
- pop_center_episode_returns: chex.Array¶
None