# Policy Gradient

- [Off-Policy Actor-Critic](/deep-reinforcement-learning/fu-lu/policy-gradient/off-policy-actor-critic.md)
- [Generalized Advantage Estimation](/deep-reinforcement-learning/fu-lu/policy-gradient/advantage-estimation.md)
- [Soft Actor-Critic](/deep-reinforcement-learning/fu-lu/policy-gradient/soft-actor-critic.md)
- [PPO-Penalty](/deep-reinforcement-learning/fu-lu/policy-gradient/ppo-penalty.md)
