evorl.networks.linear¶
Module Contents¶
Classes¶
Functions¶
Creates a Q network for discrete action space: (obs) -> q_values. |
|
Creates an MLP network. |
|
Creates a policy network. |
|
Creates a Q network: (obs, action) -> value. |
|
Creates a V network: (obs) -> value. |
|
Creates multiple MLP networks in parallel. |
Data¶
API¶
- evorl.networks.linear.ActivationFn¶
None
- evorl.networks.linear.Initializer¶
None
- class evorl.networks.linear.MLP[source]¶
Bases:
flax.linen.ModuleMLP module.
- activation: evorl.networks.linear.ActivationFn¶
None
- activation_final: evorl.networks.linear.ActivationFn | None¶
None
- kernel_init: evorl.networks.linear.Initializer¶
‘lecun_uniform(…)’
- layer_sizes: collections.abc.Sequence[int]¶
None
- norm_layer: flax.linen.Module | None¶
None
- use_bias: bool¶
True
- class evorl.networks.linear.SNMLP[source]¶
Bases:
flax.linen.ModuleMLP module with Spectral Normalization.
- activation: evorl.networks.linear.ActivationFn¶
None
- activation_final: evorl.networks.linear.ActivationFn | None¶
None
- kernel_init: evorl.networks.linear.Initializer¶
‘lecun_uniform(…)’
- layer_sizes: collections.abc.Sequence[int]¶
None
- use_bias: bool¶
True
- evorl.networks.linear.make_discrete_q_network(action_size: int, n_stack: int = 1, hidden_layer_sizes: collections.abc.Sequence[int] = (256, 256), activation: evorl.networks.linear.ActivationFn = nn.relu, kernel_init: evorl.networks.linear.Initializer = jax.nn.initializers.lecun_uniform(), norm_layer_type: str = 'none', obs_key: str = '') flax.linen.Module[source]¶
Creates a Q network for discrete action space: (obs) -> q_values.
- evorl.networks.linear.make_mlp(layer_sizes: collections.abc.Sequence[int], activation: evorl.networks.linear.ActivationFn = nn.relu, kernel_init: evorl.networks.linear.Initializer = jax.nn.initializers.lecun_uniform(), activation_final: evorl.networks.linear.ActivationFn | None = None, use_bias: bool = True, norm_layer_type: str = 'none') flax.linen.Module[source]¶
Creates an MLP network.
- evorl.networks.linear.make_policy_network(action_size: int, hidden_layer_sizes: collections.abc.Sequence[int] = (256, 256), use_bias: bool = True, activation: evorl.networks.linear.ActivationFn = nn.relu, activation_final: evorl.networks.linear.ActivationFn | None = None, norm_layer_type: str = 'none', obs_key: str = '') flax.linen.Module[source]¶
Creates a policy network.
- evorl.networks.linear.make_q_network(n_stack: int = 1, hidden_layer_sizes: collections.abc.Sequence[int] = (256, 256), activation: evorl.networks.linear.ActivationFn = nn.relu, kernel_init: evorl.networks.linear.Initializer = jax.nn.initializers.lecun_uniform(), norm_layer_type: str = 'none', obs_key: str = '') flax.linen.Module[source]¶
Creates a Q network: (obs, action) -> value.
- evorl.networks.linear.make_v_network(hidden_layer_sizes: collections.abc.Sequence[int] = (256, 256), activation: evorl.networks.linear.ActivationFn = nn.relu, kernel_init: evorl.networks.linear.Initializer = jax.nn.initializers.lecun_uniform(), norm_layer_type: str = 'none', obs_key: str = '') flax.linen.Module[source]¶
Creates a V network: (obs) -> value.
- evorl.networks.linear.make_vmap_mlp(layer_sizes: collections.abc.Sequence[int], activation: evorl.networks.linear.ActivationFn = nn.relu, kernel_init: evorl.networks.linear.Initializer = jax.nn.initializers.lecun_uniform(), activation_final: evorl.networks.linear.ActivationFn | None = None, use_bias: bool = True, norm_layer_type: str = 'none', out_axes: int = -2)[source]¶
Creates multiple MLP networks in parallel.