evorl.algorithms.erl.erl_td3.erl_eda¶
Module Contents¶
Classes¶
ERL w/ EDA. |
|
Functions¶
API¶
- class evorl.algorithms.erl.erl_td3.erl_eda.ERLEDAWorkflow(**kwargs)[source]¶
Bases:
evorl.algorithms.erl.erl_td3.erl_td3_workflow.ERLTD3WorkflowTemplateERL w/ EDA.
Configs:
EC: n actors
RL: 1 (actor,critic)
Shared replay buffer
RL will be injected into the pop mean. Support all EDA based ES algorithms.
- evaluate(state: evorl.types.State) tuple[evorl.metrics.MetricBase, evorl.types.State][source]¶
- learn(state: evorl.types.State) evorl.types.State[source]¶
- step(state: evorl.types.State) tuple[evorl.metrics.MetricBase, evorl.types.State][source]¶
- class evorl.algorithms.erl.erl_td3.erl_eda.EvaluateMetric[source]¶
Bases:
evorl.metrics.MetricBase- pop_center_episode_lengths: chex.Array¶
None
- pop_center_episode_returns: chex.Array¶
None
- rl_episode_lengths: chex.Array¶
None
- rl_episode_returns: chex.Array¶
None
- evorl.algorithms.erl.erl_td3.erl_eda.build_rl_update_fn(agent: evorl.agent.Agent, optimizer: optax.GradientTransformation, config: omegaconf.DictConfig)[source]¶