evorl.envs.wrappers.reward_wrapper¶
Module Contents¶
Classes¶
Scale the reward by a factor. |
|
Convert dense reward to sparse reward. |
API¶
- class evorl.envs.wrappers.reward_wrapper.RewardScaleWrapper(env: evorl.envs.env.Env, reward_scale: float)[source]¶
Bases:
evorl.envs.wrappers.wrapper.WrapperScale the reward by a factor.
Usage:
Use EpisodeWrapper(RewardScaleWrapper(env)) to get the scaled
info.episode_return.Use RewardScaleWrapper(EpisodeWrapper(env)) to get the original
info.episode_return.
- reset(key: chex.PRNGKey) evorl.envs.env.EnvState[source]¶
- step(state: evorl.envs.env.EnvState, action: evorl.types.Action) evorl.envs.env.EnvState[source]¶
- class evorl.envs.wrappers.reward_wrapper.SparseRewardWrapper(env: evorl.envs.env.Env, sparse_length: int)[source]¶
Bases:
evorl.envs.wrappers.wrapper.WrapperConvert dense reward to sparse reward.
The dense rewards become: 0, 0, …, sum(rewards), 0, 0, …, sum(rewards)
- reset(key: chex.PRNGKey) evorl.envs.env.EnvState[source]¶
- step(state: evorl.envs.env.EnvState, action: evorl.types.Action) evorl.envs.env.EnvState[source]¶