evorl.algorithms.erl.cemrl_td3.cemrl

Module Contents

Classes

CEMRLWorkflow

1 critic + n actors + 1 replay buffer.

EvaluateMetric

Functions

API

class evorl.algorithms.erl.cemrl_td3.cemrl.CEMRLWorkflow(**kwargs)[source]

Bases: evorl.algorithms.erl.cemrl_td3.cemrl_td3_workflow.CEMRLTD3WorkflowTemplate

1 critic + n actors + 1 replay buffer.

We use shard_map to split and parallel the population.

evaluate(state: evorl.types.State) tuple[evorl.metrics.MetricBase, evorl.types.State][source]
learn(state: evorl.types.State) evorl.types.State[source]
classmethod name()[source]
step(state: evorl.types.State) tuple[evorl.metrics.MetricBase, evorl.types.State][source]
class evorl.algorithms.erl.cemrl_td3.cemrl.EvaluateMetric[source]

Bases: evorl.metrics.MetricBase

pop_center_episode_lengths: chex.Array

None

pop_center_episode_returns: chex.Array

None

evorl.algorithms.erl.cemrl_td3.cemrl.get_std_statistics(variance)[source]