Advantage Actor Critic with Recurrent Network Benchmark

This is A2C using Recurrent Network (GRU) instead of the plain Multi Layer Perceptron (MLP) / feedforward.

results matching ""

    No results matching ""