Section outline

    • Actor Critic based Deep RL: TRPO, Soft Actor Critic.

      Value based Deep RL: DQN, Double DQN, Rainbow.

      Robust RL and IRL.