memorax.networks.heads

memorax.networks.heads#

Output heads for different RL objectives.

Categorical - Categorical policy for discrete actions.

Gaussian - Gaussian policy for continuous actions.

SquashedGaussian - Squashed Gaussian policy (tanh-bounded).

VNetwork - State value function head.

DiscreteQNetwork - Q-network for discrete actions.

ContinuousQNetwork - Q-network for continuous actions.

Alpha - Learnable temperature parameter for SAC.