evorl.algorithms.erl.erl_td3.erl_origin¶
Module Contents¶
Classes¶
Original ERL impl. |
Functions¶
API¶
- class evorl.algorithms.erl.erl_td3.erl_origin.ERLTrainMetric[source]¶
Bases:
evorl.algorithms.erl.erl_workflow.ERLTrainMetric- num_updates_per_iter: chex.Array¶
‘zeros(…)’
- time_cost_per_iter: float¶
0.0
- class evorl.algorithms.erl.erl_td3.erl_origin.ERLWorkflow(**kwargs)[source]¶
Bases:
evorl.algorithms.erl.erl_td3.erl_ga.ERLGAWorkflowOriginal ERL impl.
Have dynamic training updates per iteration, i.e., #rl_updates = #sampled_timesteps_this_iter. Therefore the
step()function cannot be directly jitted.- step(state: evorl.types.State) tuple[evorl.metrics.MetricBase, evorl.types.State][source]¶
The basic step function for the workflow to update agent.