evorl.evaluators.mo_brax_evaluator¶
Module Contents¶
Classes¶
Mutli-objective evaluator for Brax environments. |
Functions¶
Evaulate a batch of episodic trajectories. |
|
Fast evaulate a batch of episodic trajectories. |
API¶
- class evorl.evaluators.mo_brax_evaluator.BraxEvaluator[source]¶
Bases:
evorl.evaluators.evaluator.EvaluatorMutli-objective evaluator for Brax environments.
- Variables:
metric_names – The names of the metrics to evaluate, default is (“reward”, “episode_lengths”)
- metric_names: tuple[str]¶
‘pytree_field(…)’
- evorl.evaluators.mo_brax_evaluator.eval_env_step(env_fn: evorl.envs.EnvStepFn, action_fn: evorl.agent.AgentActionFn, env_state: evorl.envs.EnvState, agent_state: evorl.agent.AgentState, key: chex.PRNGKey, metric_names: tuple[str] = ()) tuple[evorl.rollout.SampleBatch, evorl.envs.EnvState][source]¶
- evorl.evaluators.mo_brax_evaluator.eval_metrics(env_fn: collections.abc.Callable[[evorl.envs.EnvState, evorl.types.Action], evorl.envs.EnvState], action_fn: collections.abc.Callable[[evorl.agent.AgentState, evorl.rollout.SampleBatch, chex.PRNGKey], tuple[evorl.types.Action, evorl.types.PolicyExtraInfo]], env_state: evorl.envs.EnvState, agent_state: evorl.agent.AgentState, key: chex.PRNGKey, rollout_length: int, discount: float, metric_names: tuple[str] = ()) tuple[evorl.types.PyTreeDict, evorl.envs.EnvState][source]¶
- evorl.evaluators.mo_brax_evaluator.eval_rollout_episode(env_fn: collections.abc.Callable[[evorl.envs.EnvState, evorl.types.Action], evorl.envs.EnvState], action_fn: collections.abc.Callable[[evorl.agent.AgentState, evorl.rollout.SampleBatch, chex.PRNGKey], tuple[evorl.types.Action, evorl.types.PolicyExtraInfo]], env_state: evorl.envs.EnvState, agent_state: evorl.agent.AgentState, key: chex.PRNGKey, rollout_length: int, metric_names: tuple[str] = ()) tuple[evorl.rollout.SampleBatch, evorl.envs.EnvState][source]¶
Evaulate a batch of episodic trajectories.
The retruned metrics are defined by
metric_names.
- evorl.evaluators.mo_brax_evaluator.fast_eval_metrics(env_fn: collections.abc.Callable[[evorl.envs.EnvState, evorl.types.Action], evorl.envs.EnvState], action_fn: collections.abc.Callable[[evorl.agent.AgentState, evorl.rollout.SampleBatch, chex.PRNGKey], tuple[evorl.types.Action, evorl.types.PolicyExtraInfo]], env_state: evorl.envs.EnvState, agent_state: evorl.agent.AgentState, key: chex.PRNGKey, rollout_length: int, metric_names: tuple[str] = ()) tuple[evorl.types.PyTreeDict, evorl.envs.EnvState][source]¶
Fast evaulate a batch of episodic trajectories.
The retruned metrics are defined by
metric_names.